RLHF & DPO Explained (In Simple Terms!)

Поделиться
HTML-код
  • Опубликовано: 2 янв 2025

Комментарии • 12

  • @pamelamadingdong
    @pamelamadingdong Месяц назад

    Fact that you gave a concrete examples really helped me go through this! Thank you for the great video

  • @BijouBakson
    @BijouBakson 17 дней назад

    Thank you

  • @joshuatettey7771
    @joshuatettey7771 26 дней назад

    Great video

  • @liberate7604
    @liberate7604 6 месяцев назад

    Great video , Is it better to use KTO as optimizer for a binary classification?

    • @EntryPointAI
      @EntryPointAI  6 месяцев назад

      I couldn't say for sure. Binary classification is a fairly simple task, so I would start with supervised fine-tuning.

  •  5 месяцев назад

    Awesome. Thanks

  • @VerdonTrigance
    @VerdonTrigance 6 месяцев назад

    Hey! Thanks for video! I never used these techniques, but what I really wants to do is to train a base or chat LLM model like llama or phi-3 on some big text (Lord of the Ring for example). But all techniques I've seen so far requires a proper dataset to be prepared, but who and how can do that? Ask all of possible questions and answer them as well? It's impossible! Don't you know how can I prepare a dataset to later train a model on?

    • @EntryPointAI
      @EntryPointAI  6 месяцев назад

      Besides including the big text in a model's pretraining, you can fine-tune on it using empty prompts, which will make the model more likely to respond in a style similar to the writing. That doesn't necessarily make it an expert on the contents. In order to answer questions about a corpus, the typical approach is to chunk it up and use RAG. I have another video on the difference between RAG and fine-tuning.

    • @iasplay224
      @iasplay224 6 месяцев назад

      Thank you for the info, it was very good explained for an introduction

  • @MarshallRoy-h9e
    @MarshallRoy-h9e 3 месяца назад

    Melisa Branch

  • @priscillaleapman2367
    @priscillaleapman2367 3 месяца назад

    Martin Shirley Jackson Kenneth Allen Mary