Understanding XGBoost From A to Z!

MIT Introduction to Deep Learning | 6.S191

What is this Temperature for a Large Language Model?

New Battlefield game was just revealed...

Lizzy McAlpine - Pushing It Down and Praying (Official Video)

How Can We Generate BETTER Sequences with LLMs?

The ML Tech Lead!

Просмотров 404

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 20 сен 2024
We know that LLMs are trained to predict the next word. When we decode the output sequence, we use the tokens of the prompt and the previously predicted tokens to predict the next word. With greedy decoding or multinomial sampling decoding, we use those predictions to output the next token in an autoregressive manner. But is this the sequence we are looking for, considering the prompt? Do we actually care about the probability of the next token in a sequence? What we want is the whole sequence to maximize the probability conditioned on the prompt, not each token separately.
So let's look at why predicting the next token is not the prediction we care about, and how we can do better than simply autoregressing by just looking at the probability of the next token.

Комментарии • 2

@tripathi26 4 месяца назад ⁺¹
Informative! 🙏
@SHAILENDRAUPADHYAY-ok4yz 4 месяца назад
Absolute master piece

Следующие

Автовоспроизведение

Understanding XGBoost From A to Z!

Understanding XGBoost From A to Z!

MIT Introduction to Deep Learning | 6.S191

MIT Introduction to Deep Learning | 6.S191

What is this Temperature for a Large Language Model?

What is this Temperature for a Large Language Model?

New Battlefield game was just revealed...

New Battlefield game was just revealed...

Lizzy McAlpine - Pushing It Down and Praying (Official Video)

Lizzy McAlpine - Pushing It Down and Praying (Official Video)

Introducing the PlayStation 30th Anniversary Collection

Introducing the PlayStation 30th Anniversary Collection

Let's build the GPT Tokenizer

Let's build the GPT Tokenizer

Introduction to Machine Learning System Design!

Introduction to Machine Learning System Design!

Learn RAG From Scratch - Python AI Tutorial from a LangChain Engineer

Learn RAG From Scratch – Python AI Tutorial from a LangChain Engineer

Understanding How Vector Databases Work!

Understanding How Vector Databases Work!

The Epic History of Large Language Models (LLMs) | From LSTMs to ChatGPT | CampusX

The Epic History of Large Language Models (LLMs) | From LSTMs to ChatGPT | CampusX

ICML 2024 Tutorial: Physics of Language Models

ICML 2024 Tutorial: Physics of Language Models

This is why Deep Learning is really weird.

This is why Deep Learning is really weird.

Simple Code, High Performance

Simple Code, High Performance

The 5 Levels Of Text Splitting For Retrieval

The 5 Levels Of Text Splitting For Retrieval

Милана Стар мечтает посетить все Диснейленды 💕 | Смотри Ты меня знаешь в VK Видео!

Милана Стар мечтает посетить все Диснейленды 💕 | Смотри Ты меня знаешь в VK Видео!

Спустил на землю мастера спорта!

Спустил на землю мастера спорта!

Включаем рубильник крана спустя 27 лет простоя! Жахнет?!

Включаем рубильник крана спустя 27 лет простоя! Жахнет?!

ИСЧЕЗНИ ВОДУ ДО КОНЦА

ИСЧЕЗНИ ВОДУ ДО КОНЦА

How to Build a Homemade Bike Using a Barrel

How to Build a Homemade Bike Using a Barrel

Lp. Сердце Вселенной #16 НАЧАЛО ВСЕГО • Майнкрафт

Lp. Сердце Вселенной #16 НАЧАЛО ВСЕГО • Майнкрафт

Хаос Неделимый в #warhammer40k #hobsplay #вархамер

Хаос Неделимый в #warhammer40k #hobsplay #вархамер