Vision Transformer Basics

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale (Paper Explained)

Transformers (how LLMs work) explained visually | DL5

Making Cookies For Santa

Warfare | Official Trailer HD | A24

where i have been.

Vision Transformer (ViT) - An Image is Worth 16x16 Words: Transformers for Image Recognition

AI Bites

Просмотров 7 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 27 дек 2024

Комментарии •

@suke933 2 года назад ⁺¹
Once again the better way of illustrating the recent knowledge. Thanks a lot.
@BiranchiNarayanNayak 2 года назад ⁺¹
Excellent explanation. i love it.
@VikashVerma-c3v Год назад ⁺¹
Nice explanation
@AIBites Год назад
Thank you Vikash! 😊
@sdsgnitromax8632 4 года назад
Hi! Could you please elaborate on task transfer? You gave an example of classification of dogs and cats as task 1 and task 2 of horses and elephants. How does knowledge transfer work here?
@AIBites 4 года назад
Thanks for your comments. To elaborate on that, appearance wise all are 4 legged creatures. So the knowledge that classes horses and elephants are similar to classes dogs and cats should be transferred from task 1 to task 2. I spoke more from the perspective of meta learning where we train in episodes and each episode is a task. Hope it make sense now. Or perhaps the example wasn't the best.
@lisabecker3246 3 года назад
Thanks for the great video! Do you mean BERT instead of BIRT when you mention the class token?
@AIBites 3 года назад
Yes, thats a good spot Lisa. I meant BERT! :)
@user-or7ji5hv8y 4 года назад ⁺²
this is a very good presentation
@AIBites 4 года назад
Thank you very much!
@vipulmehra1925 3 года назад
How to carry regression with Vision Transformer?
@AIBites 3 года назад
Thanks for your question. It is the same as how you do with any neural network or CNN architecture. Instead of training your output with a softmax cross-entropy you can train with a L1 or L2 loss.
@godwinrayan4110 3 года назад
Great video! Would be nice if you could also post one about DETR and deformable DETR:)
@AIBites 3 года назад ⁺¹
Thanks Godwin.. I have a video on DETR. Will do them at some point :)
@Holman57 4 года назад
Very cool
@AIBites 4 года назад
Thank you very much!
@navinbondade5365 3 года назад
great video, can. you making coding video
@AIBites 3 года назад
ya sure, in the future videos! ☺

Следующие

Автовоспроизведение

Vision Transformer Basics

Vision Transformer Basics

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale (Paper Explained)

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale (Paper Explained)

Transformers (how LLMs work) explained visually | DL5

Transformers (how LLMs work) explained visually | DL5

Making Cookies For Santa

Making Cookies For Santa

Warfare | Official Trailer HD | A24

Warfare | Official Trailer HD | A24

where i have been.

where i have been.

AMAD WORLD CLASS! MAN CITY 1-2 MAN UTD GOLDBRIDGE MATCH REACTION

AMAD WORLD CLASS! MAN CITY 1-2 MAN UTD GOLDBRIDGE MATCH REACTION

Vision Transformer (ViT) - An image is worth 16x16 words | Paper Explained

Vision Transformer (ViT) - An image is worth 16x16 words | Paper Explained

Ilya Sutskever: "Sequence to sequence learning with neural networks: what a decade"

Ilya Sutskever: "Sequence to sequence learning with neural networks: what a decade"

Attention in transformers, visually explained | DL6

Attention in transformers, visually explained | DL6

DINO: Emerging Properties in Self-Supervised Vision Transformers (Facebook AI Research Explained)

DINO: Emerging Properties in Self-Supervised Vision Transformers (Facebook AI Research Explained)

CoAtNet: Marrying Convolution and Attention for All Data Sizes

CoAtNet: Marrying Convolution and Attention for All Data Sizes

LLM-Based Reasoning: Opportunities and Pitfalls (LAVA Workshop in ACCV 2024)

LLM-Based Reasoning: Opportunities and Pitfalls (LAVA Workshop in ACCV 2024)

Vision Transformer and its Applications

Vision Transformer and its Applications

The Dome Paradox: A Loophole in Newton's Laws

The Dome Paradox: A Loophole in Newton's Laws

Transformers in Vision: From Zero to Hero

Transformers in Vision: From Zero to Hero

有同感的宝妈宝爸们吗？感觉看见太奶了！！#看一遍笑一遍 #宝爸带娃 #人类幼崽 #亲子日常 #露兮粑粑

有同感的宝妈宝爸们吗？感觉看见太奶了！！#看一遍笑一遍 #宝爸带娃 #人类幼崽 #亲子日常 #露兮粑粑

Момент падения самолета вблизи Актау

Момент падения самолета вблизи Актау

Dad Vs Son Telepathy TEST

Dad Vs Son Telepathy TEST

Что говорят официальные лица и что известно об основных версиях крушения самолета в Актау

Что говорят официальные лица и что известно об основных версиях крушения самолета в Актау

СКИНЫ НА ГРАНАТЫ! ОБЗОР НОВЫХ СКИНОВ В ОБНОВЛЕНИИ STANDOFF 2 0.32.0 KITSUNE DREAMS

СКИНЫ НА ГРАНАТЫ! ОБЗОР НОВЫХ СКИНОВ В ОБНОВЛЕНИИ STANDOFF 2 0.32.0 KITSUNE DREAMS

НАША ПЕРВАЯ ТАЧКА В ЯПОНИИ! Встреча с Королём Дрифта!

НАША ПЕРВАЯ ТАЧКА В ЯПОНИИ! Встреча с Королём Дрифта!

Столкнул Двух Героев В РЕЖИМЕ БОГА!😰 Трент против Зомби!

Столкнул Двух Героев В РЕЖИМЕ БОГА!😰 Трент против Зомби!

Incredibox Sprunki . Игра в кальмара -попробуй пройти стеклянный мост !

Incredibox Sprunki . Игра в кальмара -попробуй пройти стеклянный мост !