L19.5.2.6 BART: Combining Bidirectional and Auto-Regressive Transformers

AI, Machine Learning, Deep Learning and Generative AI Explained

Has Generative AI Already Peaked? - Computerphile

Presidential debate watch party encourages political participation

Dallas Cowboys vs. Cleveland Browns | NFL 2024 Week 1 Game Highlights

My Daughter Survives WORLD'S TINIEST HOUSE

L19.5.2.5 GPT-v3: Language Models are Few-Shot Learners

Sebastian Raschka

Просмотров 3,7 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 11 сен 2024
Sebastian's books: sebastianrasch...
Slides: sebastianrasch...
-------
This video is part of my Introduction of Deep Learning course.
Next video: • L19.5.2.6 BART: Combi...
The complete playlist: • Intro to Deep Learning...
A handy overview page with links to the materials: sebastianrasch...
-------
If you want to be notified about future videos, please consider subscribing to my channel: / sebastianraschka

Комментарии • 5

@niranjansitapure4740 Год назад
This series has been super informative to get a semi-deep dive into the concepts of different transformer and NLP models. Super helpful!
@jonathansum9084 3 года назад ⁺¹
Hi. Thank you for uploading the video.
I have a question. during the fine-tuning of the GPT, which the zero-shot, one-shot, or few-shot, does GPT do gradient update for the fine-tuning?
If not, it means that fine-tuning does not learn from anything but for testing its performance?
@SebastianRaschka 3 года назад ⁺⁴
Good question. I think you are referring to the few-shot examples during testing/inference? I am 99% sure that GPT-3 does not do any gradient descent updates when it uses the examples in the context
@yarasultan3433 25 дней назад
good
@davidlearnforus Год назад
Thanks for the great video! I struggle to understand how does 0, 1 or few-shot training (or it is just testing?) let the model have better predictions in few-shot versus 0-shot if weights are not updated? The only thing I can think of would be somehow combining backpropagated gradient with every new shot. How would othervise 0 or few shot example even comparable. Just everything would be 0-shot right?

Следующие

Автовоспроизведение

L19.5.2.6 BART: Combining Bidirectional and Auto-Regressive Transformers

L19.5.2.6 BART: Combining Bidirectional and Auto-Regressive Transformers

AI, Machine Learning, Deep Learning and Generative AI Explained

AI, Machine Learning, Deep Learning and Generative AI Explained

Has Generative AI Already Peaked? - Computerphile

Has Generative AI Already Peaked? - Computerphile

Presidential debate watch party encourages political participation

Presidential debate watch party encourages political participation

Dallas Cowboys vs. Cleveland Browns | NFL 2024 Week 1 Game Highlights

Dallas Cowboys vs. Cleveland Browns | NFL 2024 Week 1 Game Highlights

My Daughter Survives WORLD'S TINIEST HOUSE

My Daughter Survives WORLD'S TINIEST HOUSE

MISSING: SLIM SHADY [Expanded Mourner’s Edition Trailer]

MISSING: SLIM SHADY [Expanded Mourner’s Edition Trailer]

LoRA & QLoRA Fine-tuning Explained In-Depth

LoRA & QLoRA Fine-tuning Explained In-Depth

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Generative AI in a Nutshell - how to survive and thrive in the age of AI

What are Transformer Models and how do they work?

What are Transformer Models and how do they work?

L19.5.2.1 Some Popular Transformer Models: BERT, GPT, and BART -- Overview

L19.5.2.1 Some Popular Transformer Models: BERT, GPT, and BART -- Overview

Fine-tuning LLMs with PEFT and LoRA

Fine-tuning LLMs with PEFT and LoRA

Conditional Ordinal Regression for Neural Networks (CORN) With Examples in PyTorch

Conditional Ordinal Regression for Neural Networks (CORN) With Examples in PyTorch

Fine-tuning Large Language Models (LLMs) | w/ Example Code

Fine-tuning Large Language Models (LLMs) | w/ Example Code

Few Shot Learning - EXPLAINED!

Few Shot Learning - EXPLAINED!

Good fragrance doesn’t have to come with a big price tag🏷️ #trending #shorts #catchysmells #itmaaz

Good fragrance doesn’t have to come with a big price tag🏷️ #trending #shorts #catchysmells #itmaaz

Евгений Кузнецов кого-то нам сильно напоминает🤬🔥 #КХЛ

Евгений Кузнецов кого-то нам сильно напоминает🤬🔥 #КХЛ

Apple Event - September 9

Apple Event - September 9

Вся правда о презентации iPhone 16/PRO! Опять, Apple?

Вся правда о презентации iPhone 16/PRO! Опять, Apple?

Эркак ва Аёл #rek #uzbwedding #love #kulgilivideo #live #svadbauz #uzbekistanmusic #music #hahaidea

Эркак ва Аёл #rek #uzbwedding #love #kulgilivideo #live #svadbauz #uzbekistanmusic #music #hahaidea

УДОЧКА ЗА 1$ VS 10$ VS 100$!

УДОЧКА ЗА 1$ VS 10$ VS 100$!

Шок. Никокадо Авокадо похудел на 110 кг

Шок. Никокадо Авокадо похудел на 110 кг

Ставь лайк, если нравится 8бит! Или напиши любимого бравлера в комменты 🙌🏻

Ставь лайк, если нравится 8бит! Или напиши любимого бравлера в комменты 🙌🏻