Effective Instruction Tuning: Results & Take-Aways

Effective Instruction Tuning: Trade-Offs in Data Selection

Finetuning Open-Source LLMs

Did Harper and Maddox Kiss? (Ft. Maddox Batson)

Surviving The Most EVIL Parents for a Day

I Built a Gas Surron

Effective Instruction Tuning: Data & Methods

Shayne Longpre

Просмотров 5 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 21 окт 2024

Комментарии • 1

@labsanta Год назад ⁺⁵
instruction tuning data sets that aim to improve a model's ability to understand and follow instructions.
1. What is the topic of the recording?
- The topic is effective instruction tuning, data, and methods.
2. What is the goal of instruction tuning?
- The goal of instruction tuning is to teach a model to follow instructions and perform new or unseen tasks from the instructions it hadn't seen at training time.
3. How is instruction tuning different from alignment tuning?
- Alignment tuning is a type of instruction tuning that includes more open-ended generation and creative tasks than traditional instruction tuning methods and incorporates an element of human feedback.
4. What are some methods used to achieve stronger instruction tuning?
- The five methods used to achieve stronger instruction tuning are scaling to 1800 tasks, adding Chain of Thought fine-tuning data, enriching the diversity of tasks from existing datasets, using input inversion, and balancing the different sources of data.
5. What is the Fauna Collection?
- The Fauna Collection is a collection of instruction tuning datasets that aim to improve a model's ability to understand and follow instructions.
1. How many tasks are in the collections of other great works in the prompt Source t0 set of tasks natural instructions V2 set of tasks?
- There are approximately 1800 tasks.
2. What is the effect of scaling fine-tuning tasks to an unprecedented number of tasks?
- Models benefit from having more tasks, with the exception of the very smallest model.
- Although there are some diminishing returns, models get the most performance when all tasks are included.
- There is room to increase the diversity and number of tasks for fine-tuning, especially for higher models.
3. What is input and versioning on existing data sets, and is it effective?
- Input and versioning means permuting the order of components in a data set to create different input-target pairs and instructions.
- This creates greater variety in tasks from the same data set.
- It is still beneficial even with 1800 tasks and existing diversity.
4. What is mixing template types at training?
- Mixing template types means formulating the same data set with different templates.
- This can be done in a zero-shot setting, where there are no exemplars, or in a few-shot setting, where a different exemplar is provided.
- The first exemplar helps the model see the pattern of how it should respond to the question.
5. Is there a limit to the number of tasks that can be added to a model?
- There is not much diminishment after adding 200 to 400 tasks, but larger models have more capacity to benefit from more tasks.
- There is still room to increase the diversity and number of tasks for fine-tuning, especially for higher models.
1. What is the difference between zero shot and few shot templates?
- Zero shot templates have no training data, while few shot templates have some training data.
- Zero shot templates rely on the model's general knowledge, while few shot templates use specific examples to fine-tune the model.
- Zero shot templates are less accurate than few shot templates.
2. What was the purpose of the experiment discussed in the transcript?
- The experiment aimed to test the trade-off in performance between zero shot and few shot templates.
- The experiment tested the value of incorporating a mixture of both zero shot and few shot templates.
- The experiment aimed to find a more informed final weighting of the different sources of data.
3. What was the surprising finding of the experiment?
- The mixture of both zero shot and few shot templates improved results compared to using only one type.
- Incorporating both zero shot and few shot templates improved task diversity and performance in both zero shot and few shot evaluations.
- The performance did not decrease as expected when including both zero shot and few shot templates.
4. How was the balance of data achieved in the experiment?
- The experiment removed subsets of data according to various sources or sets of tasks, such as dialogue, program synthesis, and Chain of Thought.
- The subsets of data were ranked based on how important they were, and this led to a more informed final weighting.
- A more informed final weighting led to better results than omitting any source of data or equally weighting them altogether.
5. What were the final results of the experiment?
- The final FLON T5 XL generally got the best results for both held in and held out tasks.
- Removing any one of the methods described generally decreased performance.
- Fine-tuning the same base model on different subsets of the instruction tuning data generally led to a decent boost in performance using the methods described in the experiment.

Следующие

Автовоспроизведение

Effective Instruction Tuning: Results & Take-Aways

Effective Instruction Tuning: Results & Take-Aways

Effective Instruction Tuning: Trade-Offs in Data Selection

Effective Instruction Tuning: Trade-Offs in Data Selection

Finetuning Open-Source LLMs

Finetuning Open-Source LLMs

Did Harper and Maddox Kiss? (Ft. Maddox Batson)

Did Harper and Maddox Kiss? (Ft. Maddox Batson)

Surviving The Most EVIL Parents for a Day

Surviving The Most EVIL Parents for a Day

I Built a Gas Surron

I Built a Gas Surron

A Spooky Squad UPdate! 🎃

A Spooky Squad UPdate! 🎃

Low-rank Adaption of Large Language Models: Explaining the Key Concepts Behind LoRA

Low-rank Adaption of Large Language Models: Explaining the Key Concepts Behind LoRA

Mixture-of-Experts Meets Instruction Tuning: A Winning Combination for LLMs Explained

Mixture-of-Experts Meets Instruction Tuning: A Winning Combination for LLMs Explained

LoRA explained (and a bit about precision and quantization)

LoRA explained (and a bit about precision and quantization)

Fine-tuning LLMs with PEFT and LoRA

Fine-tuning LLMs with PEFT and LoRA

Prompt Engineering, RAG, and Fine-tuning: Benefits and When to Use

Prompt Engineering, RAG, and Fine-tuning: Benefits and When to Use

LLM2 Module 2 - Efficient Fine-Tuning | 2.3 PEFT and Soft Prompt

LLM2 Module 2 - Efficient Fine-Tuning | 2.3 PEFT and Soft Prompt

CMU Advanced NLP 2024 (8): Fine-tuning and Instruction Tuning

CMU Advanced NLP 2024 (8): Fine-tuning and Instruction Tuning

How To Create Datasets for Finetuning From Multiple Sources! Improving Finetunes With Embeddings.

How To Create Datasets for Finetuning From Multiple Sources! Improving Finetunes With Embeddings.

Chain of Thought (CoT) meets Instruction Fine-Tuning

Chain of Thought (CoT) meets Instruction Fine-Tuning

«Не знаю, що потрібно цьому бісу, що пішов війною на нас»: 98-річна переселенка з Донбасу

«Не знаю, що потрібно цьому бісу, що пішов війною на нас»: 98-річна переселенка з Донбасу

Как не носить с собой вещи

Как не носить с собой вещи

Китайка и Девочка Тыквааа😂😆

Китайка и Девочка Тыквааа😂😆

My favourite features after a week of testing! #dodgechallenger #rccars #primalchallenger

My favourite features after a week of testing! #dodgechallenger #rccars #primalchallenger

ЕЁ УГНАЛИ 15 лет назад! Тачка ПРИЗРАК…

ЕЁ УГНАЛИ 15 лет назад! Тачка ПРИЗРАК…

Murtazaliev vs Tszyu HIGHLIGHTS: October 19, 2024 | PBC on Prime Video

Murtazaliev vs Tszyu HIGHLIGHTS: October 19, 2024 | PBC on Prime Video

Борщ в стиле высокой кухни!

Борщ в стиле высокой кухни!

КАМИЛЬ ПОПАЛ В ТЮРЬМУ 😨 РАЗБИЛИ СТЕКЛО ТАЧКИ АМИНА ПОДРУЖИЛАСЬ С КВАДРОБЕРОМ ?

КАМИЛЬ ПОПАЛ В ТЮРЬМУ 😨 РАЗБИЛИ СТЕКЛО ТАЧКИ АМИНА ПОДРУЖИЛАСЬ С КВАДРОБЕРОМ ?