How large language models work, a visual intro to transformers | Chapter 5, Deep Learning

Attention Is All You Need - Paper Explained

Attention is all you need (Transformer) - Model explanation (including math), Inference and Training

Watching this Area that will Build Up...

SML LOGAN'S NEW FILMING HOUSE!

Singham Again | Official Trailer | A Rohit Shetty Cop Universe | In Cinemas 1st Nov

Transformer Architecture Explained from Scratch with Detailed Math Examples

Neural Hacks with Vasanth

Просмотров 1,3 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 10 окт 2024
Watch now and learn the mathematics behind the Transformer Architecture! 🔗
In this video, we're going to dive into the Transformer Architecture and explore its various components, including token embeddings, positional encoding, attention, and feed-forward blocks. You'll learn about the encoder-decoder architecture and how it's used in sequence-to-sequence tasks. We'll also cover the importance of multi-head attention and how it's used in the Transformer model.
Contents Covered
Introduction to the Transformer Architecture
Token Embeddings and Positional Encoding
Attention Mechanism and Multi-Head Attention and its Variants
Feed-Forward Blocks and Layer Normalization
Training and Inference Process
Decoding Strategy and Greedy Decoding
Key Takeaways:
Learn the Transformer Architecture and its various components
Understand the importance of multi-head attention and how it's used in the model
Discover how the feed-forward blocks and layer normalization work
Understand the training and inference process
Learn about different decoding strategies and greedy decoding
What's Next: In the next video, we'll implement the Transformer Architecture for translation tasks. Stay tuned!
Subscribe to our channel for more AI and machine learning tutorials!
Join this channel to get access to perks:
/ @neuralhackswithvasanth
Important Links:
Github Repo: github.com/Vas...
For further discussions please join the following telegram group
Telegram Group Link: t.me/nhv4949
You can also connect with me in the following socials
Gmail: vasanth51430@gmail.com
LinkedIn: / vasanthengineer4949

Комментарии • 8

@agamergen 3 дня назад ⁺¹
Please keep going. I love your creativity
@SowmyaRao-d9g 3 дня назад ⁺¹
Very detailed and you explained it very well.
@openai.deepaksingh 2 дня назад
This was awesome. I cleared my lot many doubts. Hope this channel keeps bringing such videos.
@agamergen 3 дня назад ⁺¹
As always you chew it very well so we can swallow it easily ❤
@prashlovessamosa 3 дня назад ⁺¹
Thanks Anna
@ShivamPradhan-c1x День назад
can you add the resource link of this video
@venkatagangadharraoy5407 День назад
Thanks for the video. Can you please share the resource as well
@NavdeepVarshney-ep4ck 16 часов назад
Can I get your number are u a researcher

Следующие

Автовоспроизведение

How large language models work, a visual intro to transformers | Chapter 5, Deep Learning

How large language models work, a visual intro to transformers | Chapter 5, Deep Learning

Attention Is All You Need - Paper Explained

Attention Is All You Need - Paper Explained

Attention is all you need (Transformer) - Model explanation (including math), Inference and Training

Attention is all you need (Transformer) - Model explanation (including math), Inference and Training

Watching this Area that will Build Up...

Watching this Area that will Build Up...

SML LOGAN'S NEW FILMING HOUSE!

SML LOGAN'S NEW FILMING HOUSE!

Singham Again | Official Trailer | A Rohit Shetty Cop Universe | In Cinemas 1st Nov

Singham Again | Official Trailer | A Rohit Shetty Cop Universe | In Cinemas 1st Nov

The Marías: Tiny Desk Concert

The Marías: Tiny Desk Concert

Transformer Architecture Implemented From Scratch Line By Line

Transformer Architecture Implemented From Scratch Line By Line

How RAG (Retrieval-Augmented Generation) talks with your private data using LLM | AI Heroes 2023

How RAG (Retrieval-Augmented Generation) talks with your private data using LLM | AI Heroes 2023

Decoder-Only Transformers, ChatGPTs specific Transformer, Clearly Explained!!!

Decoder-Only Transformers, ChatGPTs specific Transformer, Clearly Explained!!!

ML Was Hard Until I Learned These 5 Secrets!

ML Was Hard Until I Learned These 5 Secrets!

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Pytorch Transformers from Scratch (Attention is all you need)

Pytorch Transformers from Scratch (Attention is all you need)

Rotary Positional Embeddings: Combining Absolute and Relative

Rotary Positional Embeddings: Combining Absolute and Relative

How to train a model to generate image embeddings from scratch

How to train a model to generate image embeddings from scratch

EuroMPI 2024 (Day 3) Yuang Yan

EuroMPI 2024 (Day 3) Yuang Yan

Последняя гонка Пассата! Финал большой истории!

Последняя гонка Пассата! Финал большой истории!

Подкатила на кофемашине #машмилаш

Подкатила на кофемашине #машмилаш

نترس تو برق نبود😅😅

نترس تو برق نبود😅😅

Hurricane Milton: Storm damage in Fort Myers, Fla.

Hurricane Milton: Storm damage in Fort Myers, Fla.

НАШЛА ДЕНЬГИ🙀@VERONIKAborsch

НАШЛА ДЕНЬГИ🙀@VERONIKAborsch

Израиль vs Палестина #арабы #евреи #страны

Израиль vs Палестина #арабы #евреи #страны

ШАМАН и ЛЕПС в гостях у Корчевникова 😁 [Пародия]

ШАМАН и ЛЕПС в гостях у Корчевникова 😁 [Пародия]

Давай сыграем в прятки?😁НАЙДИ МЕНЯ😁

Давай сыграем в прятки?😁НАЙДИ МЕНЯ😁