Transformers for beginners | What are they and how do they work

Code With Aarohi

Просмотров 60 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 19 янв 2025

Комментарии • 152

@lyeln 11 месяцев назад ⁺²²
This is the only video around that REALLY EXPLAINS the transformer! I immensely appreciate your step by step approach and the use of the example. Thank you so much 🙏🙏🙏
@CodeWithAarohi 11 месяцев назад ⁺³
Glad it was helpful!
@Reem.alhamimi 8 месяцев назад
exactly
@napoleanbonaparte9225 5 месяцев назад
truly, I went through several medium blogs & also video but this lecture gave me immense calrity on each step of Transformer, thank u
@MrPioneer7 8 месяцев назад ⁺³
I had watched 3 or 4 videos about transformers before this tutorial. Finally, this tutorial made me understand the concept of transformers. Thanks for your complete and clear explanations and your illustrative example. Specially, your description about query, key and value was really helpful.
@CodeWithAarohi 8 месяцев назад
You're very welcome!
@americanfinancial7511 5 месяцев назад ⁺¹
Very well explained. Most of the people did not explained transformer as you did. You made it easy for new student to learn. Thanks
@CodeWithAarohi 5 месяцев назад
Glad it helped
@mdfarhadhussain Год назад ⁺⁴
Very nice high level description of Transformer
@CodeWithAarohi Год назад
Glad you think so!
@Zohranishrat Месяц назад
You're a life saver. Thank you sooo much. I've tried GPT, tried different articles, but it's only now that I'm getting the whole concept
@CodeWithAarohi Месяц назад
I'm glad I could help! 😊
@VishalSingh-wt9yj Год назад ⁺¹
Well explained. before watching this video i was very confused in understanding how transformers works but your video helped me alot
@CodeWithAarohi Год назад
Glad my video is helpful!
@AI_Adhyayana 9 месяцев назад ⁺¹
Accidentally I came across this video, very well explained. You are doing an excellent job .
@CodeWithAarohi 9 месяцев назад
Glad it was helpful!
@AbdulHaseeb091 10 месяцев назад
Ma'am, we are eagerly hoping for a comprehensive Machine Learning and Computer Vision playlist. Your teaching style is unmatched, and I truly wish your channel reaches 100 million subscribers! 🌟
@CodeWithAarohi 9 месяцев назад ⁺¹
Thank you so much for your incredibly kind words and support!🙂 Creating a comprehensive Machine Learning and Computer Vision playlist is an excellent idea, and I'll definitely consider it for future content.
@bidishamukherjee3051 2 месяца назад
Great explanation Aarohi. Thank you.
@CodeWithAarohi 2 месяца назад
Glad it was helpful!
@harshilldaggupati Год назад ⁺¹
Very well explained, even with such niche viewer base, keep making more of these please
@CodeWithAarohi Год назад ⁺¹
Thank you, I will
@chandankumar-j3h5d Месяц назад
So nicely explained. Thank u so much
@CodeWithAarohi 26 дней назад
Welcome!
@UjjwalSolanki-j2b 8 месяцев назад ⁺²
Can you please let us know I/p for mask multi head attention. You just said decoder. Can you please explain. Thanks
@satishbabu5510 8 месяцев назад
thank you very much for explaining and breaking it down 😀 comparatively so far, your explanation is easy to understand compared to other channels thank you very much for making this video and sharing to everyone❤
@CodeWithAarohi 8 месяцев назад
Glad it was helpful!
@SureshNair-i2q Месяц назад
Thank you for explaining so well.
@CodeWithAarohi Месяц назад
You're very welcome!
@servatechtips Год назад
This is a fantastic, Very Good explanation.
Thank you so much for good explanation
@CodeWithAarohi Год назад ⁺¹
Glad it was helpful!
@sukritgarg3175 11 месяцев назад ⁺¹
Great Video ma'am could you please clarify what you said at 22:20 once again... I think there was a bit confusion there.
@AyomideFagoroye-oe2hd 9 месяцев назад
same here
@shaminMohammed-s9s 8 месяцев назад
Wow.. you are amazing. Thank you for the clear explanation
@CodeWithAarohi 8 месяцев назад
You're very welcome!
@sahaj2805 10 месяцев назад
The best explanation of transformer that I have got on the internet , can you please make a detailed long video on transformers with theory , mathematics and more examples. I am not clear about linear and softmax layer and what is done after that , how training happens and how transformers work on the test data , can you please make a detailed video on this?
@CodeWithAarohi 10 месяцев назад ⁺¹
I will try to make it after finishing the pipelined work.
@sahaj2805 10 месяцев назад
@@CodeWithAarohi Thanks will wait for the detailed transformer video :)
@imranzahoor387 11 месяцев назад
best explanation i saw multiple video but this provide the clear concept keep it up
@CodeWithAarohi 11 месяцев назад
Glad to hear that
@vasoyarutvik2897 Год назад
Very Good Video Ma'am, Love from Gujarat, Keep it up
@CodeWithAarohi Год назад
Thanks a lot
@exoticcoder5365 Год назад
Very well explained ! I can instantly grab the concept ! Thank you Miss !
@CodeWithAarohi Год назад
Glad it was helpful!
@pandusivaprasad4277 Год назад
excellent explanation madam... thank you so much
@CodeWithAarohi Год назад
Thanks and welcome
@ykakde 6 месяцев назад
Nice explanation to such complex topic
@CodeWithAarohi 6 месяцев назад
Thanks!
@jaideepraulji1395 5 месяцев назад ⁺¹
Well Explained
@CodeWithAarohi 5 месяцев назад
Thanks!
@user-dl4jq2yn1c 8 месяцев назад
Best video ever explaining the concepts in really lucid way maam,thanks a lot,pls keep posting,i subscribed 😊🎉
@CodeWithAarohi 8 месяцев назад
Thanks and welcome
@aditichawla3253 Год назад
Great explanation! Keep uploading such nice informative content.
@CodeWithAarohi Год назад
Thank you, I will
@akera2775 4 месяца назад
lovely and deep explanation provided
@CodeWithAarohi 4 месяца назад
Glad it was helpful!
@soravsingla6574 Год назад ⁺¹
Hello Ma’am
Your AI and Data Science content is consistently impressive! Thanks for making complex concepts so accessible. Keep up the great work! 🚀 #ArtificialIntelligence #DataScience #ImpressiveContent 👏👍
@CodeWithAarohi Год назад
Thank you!
@sumankumari-gl3ze 5 месяцев назад
you explained very nicely
@CodeWithAarohi 5 месяцев назад
Thank you so much 🙂
@debarpitosinha1162 10 месяцев назад
Great Explanation mam
@CodeWithAarohi 9 месяцев назад
Glad you liked it
@MinalMahala 9 месяцев назад
Really very nice explanation ma'am!
@CodeWithAarohi 9 месяцев назад
Glad my video is helpful!
@BharatK-mm2uy 10 месяцев назад
Great Explanation, Thanks
@CodeWithAarohi 10 месяцев назад
Glad it was helpful!
@princekhunt1 Месяц назад
Nice tutorial
@CodeWithAarohi Месяц назад
Thanks
@afn8370 8 месяцев назад
your video is good, explanation is excellent , only negative I felt was the bg noise. pls use a better mic with noise cancellation. thankyou once again for this video
@CodeWithAarohi 8 месяцев назад
Noted! I will take care of the noise :)
@sukumarane2302 3 месяца назад
Well explained . Thank you 🙏
@CodeWithAarohi 3 месяца назад
Glad it was helpful!
@sairampenjarla 7 месяцев назад ⁺²
hi, Good explanation but at the end, when you explained what would be the input to the decoder's masked multi-head attention, you fumbled and didn't explain clearly. But the rest of the video was very good.
@CodeWithAarohi 7 месяцев назад
Thank you for the feedback!
@anandkumar-lq3dt 3 месяца назад
Initial input to the decoder will be from the encoder output and after that decoder will consume the input from the previously generated decoder output. At a time decoder generate one word.
@sahaj2805 10 месяцев назад
Can you please make a detailed video explaining the Attention is all you need research paper line by line, thanks in advance :)
@CodeWithAarohi 10 месяцев назад ⁺¹
Noted!
@Sam-yy4tw 6 месяцев назад
Great work mam
@CodeWithAarohi 6 месяцев назад
Thanks a lot
@Reem.alhamimi Месяц назад
The best
@CodeWithAarohi 26 дней назад
Thank you!
@MAHI-kj5tg Год назад
Just amazing explanation 👌
@CodeWithAarohi Год назад ⁺¹
Thanks a lot 😊
@sanjiwaneeayurvedic3199 2 месяца назад ⁺¹
not clear about the input of mask attention layer
@akramsyed3628 Год назад ⁺²
can you please explain 22:07 onward
@UnchartedExperience 5 месяцев назад
she is not gonna reply she only replies to happy praise comments and ignores questions lol.....i know she messed up in the end she didnt know what to say but overall it was nice attempt .....additionally she entirely skipped CROSS ATTENTION and just mumbled around the concept without introducing the terminology
@parrotsafari6329 2 месяца назад
Great
@CodeWithAarohi 2 месяца назад
Thanks!
@vimalshrivastava6586 Год назад
Thanks for making such an informative video. Please could you make a video on the transformer for image classification or image segmentation applications.
@CodeWithAarohi Год назад
Will cover that soon
@praveenchandra-i8f 5 месяцев назад
this explanation is nice can u do pratical how to implement this transformer model using sentiment analysis in python platform
@CodeWithAarohi 5 месяцев назад
Noted!
@_seeker423 11 месяцев назад
Question about query, key, value dimensionality
Given that
query is a word that is looking for other words to pay attention to
key is a word that is being looked at by other words
shouldn't query and word be a vector of size the same as number of input tokens? so that when there is a dot product between query and key the word that is querying can be correctly (positionally) dot product'd with key and get the self attention value for the word?
@CodeWithAarohi 11 месяцев назад ⁺¹
The dimensionality of query, key, and value vectors in transformers is a hyperparameter, not directly tied to the number of input tokens. The dot product operation between query and key vectors allows the model to capture relationships and dependencies between tokens, while positional information is often handled separately through positional embeddings.
@bijayalaxmikar6982 Год назад
excellent explanation
@CodeWithAarohi Год назад
Glad you liked it!
@manishnayak9759 Год назад
Thanks Aaroh i😇
@CodeWithAarohi Год назад
Glad it helped!
@blindprogrammer 7 месяцев назад
very high level but perfect!
@CodeWithAarohi 7 месяцев назад
Thanks!
@soravsingla6574 Год назад
Very well explained
@CodeWithAarohi Год назад
Thanks for liking
@TheMayankDixit Год назад
Nice explanation Ma'am.
@CodeWithAarohi Год назад
Thank you! 🙂
@farzanehpishnamaz Год назад
Hello and Thank you so much. 1 question: I don't realize where the numbers in word embedding and positional encoding come from?
@MuqadasGull-x3z 8 месяцев назад
Its great. I have only one query as whats the input of the masked multi-head attention as its not clear to me kindly guide me about it?
@animexworld6614 7 месяцев назад
Great Content
@CodeWithAarohi 7 месяцев назад
Thanks!
@anandtewari8014 6 месяцев назад
I think may be input to the masked multi head attention is not told correct.
@CodeWithAarohi 6 месяцев назад
Thank you for your message. Please share in detail.
@burerabiya7866 11 месяцев назад
can you please upload the presentation
@akshayanair6074 Год назад
Thank you. The concept has been explained very well. Could you please also explain how these query, key and value vectors are calculated?
@CodeWithAarohi Год назад
Sure, Will cover that in a separate video.
@thangarajerode7971 Год назад
Thanks. Concept explained very well. Could you please add one custom example (e.g finding similarity questions)using Transformers?
@CodeWithAarohi Год назад
Will try
@mahmudulhassan6857 Год назад
maam can you please make one video of classification using multi-head attention with custom dataset
@CodeWithAarohi Год назад
Will try
@palurikrishnaveni8344 Год назад
Could you make a video on image classification for vision transformer, madam ?
@CodeWithAarohi Год назад
Sure, soon
@_Who_u_are 8 месяцев назад
Thank you so much
@CodeWithAarohi 8 месяцев назад ⁺¹
Welcome!
@nikhilrao20 Год назад
Didn't understand what is the input to the masked multi head self attention layer in the decoder, Can you please explain me?
@CodeWithAarohi Год назад ⁺¹
In the Transformer decoder, the masked multi-head self-attention layer takes three inputs: Queries(Q), Keys(K) and Values(V)
Queries (Q): These are vectors representing the current positions in the sequence. They are used to determine how much attention each position should give to other positions.
Keys (K): These are vectors representing all positions in the sequence. They are used to calculate the attention scores between the current position (represented by the query) and all other positions.
Values (V): These are vectors containing information from all positions in the sequence. The values are combined based on the attention scores to produce the output for the current position.
The masking in the self-attention mechanism ensures that during training, a position cannot attend to future positions, preventing information leakage from the future.
In short, the masked multi-head self-attention layer helps the decoder focus on relevant parts of the input sequence while generating the output sequence, and the masking ensures it doesn't cheat by looking at future information during training.
@KavyaDabuli-ei1dr 11 месяцев назад
Can you please make a video on bert?
@CodeWithAarohi 11 месяцев назад
I will try!
@_seeker423 11 месяцев назад
Can you also talk about the purpose of the 'feed forward' layer. looks like its only there to add non-linearity. is that right?
@abirahmedsohan3554 10 месяцев назад
Yes you can say that..but mayb also for make key, quarry and value trainable
@kadapallanithin Год назад
Could you explain with python code which would be more practical. Thanks for sharing your knowledge
@CodeWithAarohi Год назад
Sure, will cover that soon.
@niluthonte45 Год назад
thank you mam
@CodeWithAarohi Год назад
Most welcome 😊
@EmpoweringHub24 Год назад
hello maa is this transform concept same for transformers in NLP?
@CodeWithAarohi Год назад
The concept of "transform" in computer vision and "transformers" in natural language processing (NLP) are related but not quite the same.
@minhaledits301 5 дней назад
hmri ma'am ny b apsy hi phara ho ga lakin unki to class main samj hi nhi ai thi
@CodeWithAarohi 5 дней назад
ohh... Video se samajh aaya apko?
@minhaledits301 5 дней назад
@CodeWithAarohi hnji kl paper hy and thank you
Allah Khush rakhy apko ♥️
@CodeWithAarohi 5 дней назад
Good luck for your exam 😊
@rj00502 4 месяца назад
doing phenominal work .
@CodeWithAarohi 4 месяца назад
Thanks!
@saeed577 11 месяцев назад
I thought it's transformers in CV. all explanations were in NLP
@CodeWithAarohi 11 месяцев назад
I recommend you to understand this video first and then check this video: ruclips.net/video/tkZMj1VKD9s/видео.html After watching these 2 videos, you will understand properly the concept of transformers used in computer vision. Transformers in CV are based on the idea of transformers in NLP. SO its better for understanding if you learn the way I told you.
@mohdUbaidWani Год назад
how to get pdfs mam
@Red_Black_splay 9 месяцев назад
Gonna tell my kids this was optimus prime.
@CodeWithAarohi 9 месяцев назад
Haha, I love it! Optimus Prime has some serious competition now :)
@skyeyes4757 Месяц назад
why don't you try to explain in hindi we can understand english but lack when it come to english to imganitation for new topic
@CodeWithAarohi Месяц назад ⁺¹
Hindi tutorial: ruclips.net/video/uJhVLjZfmo8/видео.html
@jagatdada2.021 Год назад
Use mic, background noise irritate
@CodeWithAarohi Год назад ⁺¹
Noted! Thanks for the feedback.
@_Who_u_are 8 месяцев назад
Speaking in Hindi would be more better
@CodeWithAarohi 8 месяцев назад ⁺²
Sorry for inconvenience
@digambar6191 6 месяцев назад
Thank you mam
@CodeWithAarohi 6 месяцев назад
Most welcome 😊

Следующие

Автовоспроизведение