Cross Attention in Transformers | 100 Days Of Deep Learning | CampusX

Поделиться
HTML-код
  • Опубликовано: 28 ноя 2024

Комментарии • 102

  • @himanikumar7979
    @himanikumar7979 3 месяца назад +29

    Accidentally discovered this channel and now this is the go to channel for every DS/ML related query! Kudos to you!!🙌🏻

  • @turugasairam2886
    @turugasairam2886 3 месяца назад +14

    Thanks

    • @Shisuiii69
      @Shisuiii69 3 месяца назад

      Respect 🙌🏼

  • @sharangkulkarni1759
    @sharangkulkarni1759 3 месяца назад +7

    i have finished playlist and I watched some videos 3-4 times, its poetry

  • @faugno-1516
    @faugno-1516 3 месяца назад +13

    Such a Great Teacher in whole YT according to me in Data Science. I completed all playlist for ML,NLP also i am moving parallel in this DL Playlist with sir also and It really increase my knowledge and skills , super thanks to Nitish Sir for these amazing contents. I regularly visit this channel and playlist for next video of Transformers architecture and today i completed this one with full notes and with other research . Super Excited For next Video Sir.

    • @meeturiajaykumar.2384
      @meeturiajaykumar.2384 3 месяца назад +1

      Same here. Pls complete the playlist asap sir🙇🙇🙏🙏🙏🙏

  • @sidhantatripathy9462
    @sidhantatripathy9462 3 месяца назад +4

    CampusX is a great platform for machine learning, deep learning and other data science related things 👍

  • @jooeeemusic7963
    @jooeeemusic7963 3 месяца назад +5

    We all are in a Pipeline of learning transformers from Nitish sir❤️

  • @ersushantkashyap
    @ersushantkashyap 3 месяца назад +2

    Finally completed all 82 videos of deep learning in 20-22 days and now running parallelly with you. By this 2024, almost 100 days from today, I have decided to finish your Machine Learning playlist, EDA playlist, Python playlist, Project playlist.
    Though I have almost 5-6 years of power bi experience, however, will by your 3000-rupee course, I know, it has been created under your leadership, it will surely have something which I am not aware off.
    Like always, thanks again.

  • @wilfredomartel7781
    @wilfredomartel7781 20 дней назад

    A course with deep explanations and experiments with attention mechanism are needed.

  • @MoosaMemon.
    @MoosaMemon. 3 месяца назад +8

    I had a feeling that you were gonna post this today as I just watched your video on masked multi-head attention xD - I was happy ke I've completed the playlist for a while but here comes the new one 😅

  • @Muslimplays.
    @Muslimplays. 3 месяца назад

    This truly mindblowing. Definitely,I would be going to recommend this channel to all my friends.

  • @rischiraj786
    @rischiraj786 3 месяца назад +2

    Really great playlist 👏

  • @planetforengineers7176
    @planetforengineers7176 3 дня назад

    very nice explanation thank you so much sir

  • @Mjjjyyy
    @Mjjjyyy 3 месяца назад +1

    I'd suggest you to start a Data Engineering playlist as well, much needed

  • @NickMaverick4
    @NickMaverick4 3 месяца назад +2

    Sir please make a video on Is data science dying? A lot of videos on RUclips are coming. Please give you clarity.. we are following you and just because we like your teaching style which makes us understand the topics easily. Please make a video and make us aware of what all changes we have to do in our preparation..

  • @whothefisyash
    @whothefisyash 3 месяца назад +25

    Pls complete as early as possible

    • @syedmansoor6067
      @syedmansoor6067 3 месяца назад +1

      Yeah please

    • @Shisuiii69
      @Shisuiii69 3 месяца назад

      Great things take time brother
      Just believe in Nitish Sir we all know he is 🐐

  • @BadBoy-yb5pq
    @BadBoy-yb5pq 3 месяца назад +1

    Please upload a video about “How to read a research paper and understand it ” ,breaking down mathmatics etc PLEAAAASSSSSEEEEEEE

  • @jagadeeshmemories8760
    @jagadeeshmemories8760 3 месяца назад +2

    Each vedio is dimand sir❤❤❤🎉

  • @Ahm77887
    @Ahm77887 3 месяца назад

    Thanks for this type of deep dive knowledgeable content 😍❤

  • @1111Shahad
    @1111Shahad 3 месяца назад +1

    Thanks Nistish

  • @Shisuiii69
    @Shisuiii69 3 месяца назад +2

    Sir please encoder Decoder, attention Mechanism, transformers se related code projects bhi as example bnae bht helpful rhe ga

  • @meherunfarzana
    @meherunfarzana 3 месяца назад +1

    YAYYY MY COMMENTS WORKED! NEW VIDEO! PLEASE KEEP RELEASING!

  • @VikrantPundir-gk6qz
    @VikrantPundir-gk6qz 7 дней назад

    Thank you so much sir

  • @sharadsisodiya3853
    @sharadsisodiya3853 3 месяца назад +3

    sir we are going ahead it is good , i want to know do we have any coding session on transformer ?
    this series is going in good way but some practical coding sessions required to have real understanding how it works please take it as suggestion

  • @meeturiajaykumar.2384
    @meeturiajaykumar.2384 3 месяца назад

    Sir pls complete this playlist asap. We are super excited to know and learn about LLMs and building related projects.

  • @charanpoojary4804
    @charanpoojary4804 14 дней назад

    Thank you sir

  • @rishabhchoudhary0
    @rishabhchoudhary0 3 месяца назад

    Thank you for such a great explanation. Can you tell when will you upload the Decoder Architecture Video?

  • @technicalhouse9820
    @technicalhouse9820 2 месяца назад

    Sir Love U.
    From Pak

  • @Fazalenglish
    @Fazalenglish 3 месяца назад

    Plz Sir at the completion of transformer make some projects so we can practically see their usages .

  • @SpotifyUnchained
    @SpotifyUnchained 3 месяца назад

    🙌🙌🙌 thank you so much for each and every videos; plz try to plan out llama models' architecture also sir.

  • @awe-nir-baan
    @awe-nir-baan Месяц назад

    Fascinating!

  • @SamiUllah-ql9my
    @SamiUllah-ql9my 3 месяца назад +4

    Sir ap deep learning for NLP pr course kb lanch kr rhy ha

  • @mohitmehndiratta5576
    @mohitmehndiratta5576 3 месяца назад

    Really helpful! ❤

  • @vinayakbhat9530
    @vinayakbhat9530 2 месяца назад

    excellent

  • @isBongNikky
    @isBongNikky 3 месяца назад +1

    As always great video

  • @AIMLVaibhavPawar
    @AIMLVaibhavPawar 3 месяца назад

    Please complete this playlist as soon as possible

  • @princekhunt1
    @princekhunt1 3 месяца назад +2

    Please complete the series sir

  • @salaarkhan8481
    @salaarkhan8481 3 месяца назад +5

    Please complete this playlist as soon as possible

    • @lomash_irl
      @lomash_irl 3 месяца назад +3

      Creating impactful material takes time

  • @shabirbhat1346
    @shabirbhat1346 3 месяца назад

    @campusX Hi! I hope you’re doing well. First, I want to thank you for your videos-they’ve been incredibly informative. I have a request: could you create a few videos about Retrieval-Augmented Generation (RAG)? It would be great if you could explain it from the basics, including what RAG is, how it works, and details about vector databases. Thanks!

  • @waheedweins
    @waheedweins 3 месяца назад

    thanks for your helpful content...

  • @hammry_pommter
    @hammry_pommter 3 месяца назад

    10th class ke baad phli baar hindi likhne par majboor kar diya kisine...nice playlist

  • @prasenjitsaha7217
    @prasenjitsaha7217 3 месяца назад +1

    How many concepts are there to learn before going to actual transformer architecture?

  • @abhisheksaurav
    @abhisheksaurav 3 месяца назад +1

    sir aap deep learning ka course kab launch kar rahe??

  • @not_amanullah
    @not_amanullah 3 месяца назад

    This is helpful 🖤🤗

  • @zerotohero1002
    @zerotohero1002 3 месяца назад

    thank god sirji

  • @electricalengineer5540
    @electricalengineer5540 3 месяца назад

    legend is back

  • @arpitpathak7276
    @arpitpathak7276 3 месяца назад

    Thanku sir ❤

  • @not_amanullah
    @not_amanullah 3 месяца назад

    Thanks ❤

  • @shobhitsingh6330
    @shobhitsingh6330 3 месяца назад

    What is the difference between this deep learning series and deep learning for computer vision series that you are offering on your channel under paid course?

  • @abbasahmad6643
    @abbasahmad6643 3 месяца назад

    Sir, Please make a detailed video on the graph transformer.

  • @growithindia
    @growithindia 24 дня назад

    Hiw we are doing the cross attention while inference as we do not know the future words,do we again do the same thing which we have done during the masked attention .

  • @muhammadikram375
    @muhammadikram375 3 месяца назад +1

    sir please complete MLops playlist 😢

  • @sarmadafzalj
    @sarmadafzalj 9 дней назад

    @campusX
    Little confused in this video. As I understand, in GPTs we do unsupervised learning which means we don't have labels, them how are we passing the translation of English to Hindi? is it the way that training data should be curated?

  • @md.yasinarafat17
    @md.yasinarafat17 3 месяца назад

    ❤ Tnx..Sir...

  • @RohitKumarGuptarkg
    @RohitKumarGuptarkg 3 месяца назад

    Sir, please clarify one thing.. Is the Encoder K, V static for all decoder layer i.e do we use same K, V from Encoder last layer? OR does the Encoder K, V also evolve with previous decoder layers?

  • @mayank5549
    @mayank5549 3 месяца назад

    Sir pls complete deep learning playlist asap pls sir it's a request

  • @SanjayGupta-sv7vv
    @SanjayGupta-sv7vv 3 месяца назад

    Sir may you please make a detailed video on mojo vs python.
    Will mojo take control over python?

  • @bhagatpandey369
    @bhagatpandey369 3 месяца назад

    thank you so much sir

  • @arifkhan-jz9vf
    @arifkhan-jz9vf 3 месяца назад

    Sir is part of which course

  • @velugucharan8096
    @velugucharan8096 3 месяца назад +1

    Sir we want yolo architecture

  • @itsmovies24
    @itsmovies24 3 месяца назад

    Sir plz complete this playlist ASAP

  • @Vishal-vb8og
    @Vishal-vb8og 2 месяца назад

    Hi Sir,
    Can you please share the notes link used in this playlist. It will help us to revise the concepts fast by looking at it in future.
    Thanks

  • @zerotohero1002
    @zerotohero1002 3 месяца назад

    thank you so much

  • @parometakarmmakar6620
    @parometakarmmakar6620 6 дней назад

    Hlo sir,
    Would it be possible to apply for a job after mastering only Power BI and SQL? Do you think it would be sufficient to secure a job?

  • @yamansaini6379
    @yamansaini6379 3 месяца назад

    Finally One more gems 💎

  • @satyabharadwaj7779
    @satyabharadwaj7779 3 месяца назад

    I still don't figure out how the output tokens are known in prior? Is it how the architecture works during training? Because there's no way to know the length of the output for a given input beforehand. Could you explain deeper into how token "generation" happens? In the example you quoted, if the task itself is to translate english sentence to hindi, how does the decoder know which set of tokens to correlate to the input tokens?

    • @RohitKumarGuptarkg
      @RohitKumarGuptarkg 3 месяца назад

      During Inference, the tokens are generated sequentially... then in the first timestep, encoder K, V will interact with token (start of sentence)... in next timestep encoder K, V will interact with , first decoder output token.... this will go on until decoder outputs token (end of sentence).
      During training, as taught in previous video,... decoder output used is the one given/known from data and can be parallelized. During training, we don't use actual decoder output as input for next step but the ground truth token we know from data.

    • @campusx-official
      @campusx-official  3 месяца назад

      During the training the hindi sentence is already available. How this works during inference, I will explain in a separate video

    • @satyabharadwaj7779
      @satyabharadwaj7779 3 месяца назад

      @@campusx-official I see. So this process is during training, got it!

    • @josebgeorge227
      @josebgeorge227 3 месяца назад

      @@satyabharadwaj7779 So Sir, during the training in the cross attention section are we using Masking as explained in the previous video?

  • @arjunpaudel9278
    @arjunpaudel9278 3 месяца назад

    why query vector from the output sequence(hindi) and value and key form input sequence(english)?
    According to my understanding output sequence is querying the input sequence how much similarity between you(hindi) and me(english) and value vector is helping to do weighted sum after the weight is (dot product between the query and key) is calculated.

  • @ghostofuchiha8124
    @ghostofuchiha8124 3 месяца назад

    Hi Nitish ; Just a question not related to this video .
    I just want to know how does a ML model handle data once its deployed in production.? Like when we build a model we scale the data , remove nulls ,transform it and then use it , but how does all this happen in already deployed models? Because a normal day to day life will have all the uncleaned data. Pleas help , I m really confused. I can build the ml , dl , transformers etc but am confused how is data preprocessing tackled after model is deployed .
    Basically how is all preprocessing captured in the model to be used after deployment , is it through columntransformers and pipelines or are there any other steps or is it under mlops umbrella ?

    • @campusx-official
      @campusx-official  3 месяца назад

      ruclips.net/video/xOccYkgRV4Q/видео.html

    • @ghostofuchiha8124
      @ghostofuchiha8124 3 месяца назад

      ​@@campusx-official But how will it handle dropping columns ; Theres nothing in pipeline where it drops useless columns automatically , as during testing we were only providing required values in test not all the columns as present in original dataset. How to add dropping columns step in pipeline?

    • @josebgeorge227
      @josebgeorge227 3 месяца назад

      @@ghostofuchiha8124 You can create a custom Column transformer class which deletes the extra columns. You can use the classes TransformerMixin and BaseEstimator from sklearn.base module.

  • @Duke-m4v
    @Duke-m4v 3 месяца назад

    Sir is your course dsmp1.0 good for data analyst

  • @YSH-RA0
    @YSH-RA0 3 месяца назад

    Hello sir me chahta hu ki Mera NLP model voice pe kaam kare Bina voice to text me convert kiye ...text ko process n kare Balki voice ko hi process kare

  • @deebafarheen2270
    @deebafarheen2270 3 месяца назад

    Is deep learning playlist completed ? Or still going on?

  • @AyushSingh-rx4iv
    @AyushSingh-rx4iv 3 месяца назад

    Aaj mai comments me itna jaldi aa gaya hu ❤

  • @VijayPratapYadav-to7vn
    @VijayPratapYadav-to7vn 3 месяца назад

    Please upload slides of notes🙏

  • @ozairkhan7285
    @ozairkhan7285 3 месяца назад

    Sir, Is there any chance for app deployment in 2024 for free?

  • @TechSpot56
    @TechSpot56 11 дней назад

    🥰🥰🥰

  • @NickMaverick4
    @NickMaverick4 3 месяца назад

    Sir please streamlit ka ekk free course launch kr dijiye na❤

  • @neeleshsethi9914
    @neeleshsethi9914 3 месяца назад

    @CampusX When can we expect paid Gen AI masterclass?

  • @aditygupta3434
    @aditygupta3434 3 месяца назад

    Please I request to remove the membership plan on playlist of Maths for Machine Learning and DeepLearning.
    If you don't able to do that so Launch Maths for Deep Learning and Machine Learning Playlist free.
    Topics: 1. Statistics, 2. Linear Algebra, 3. Probability 4. Derivative

  • @SamiUllah-ql9my
    @SamiUllah-ql9my 3 месяца назад

    Sir I am from Pakistan and mjy ap deep learning for nlp ma enroll krna ha

  • @Asifusain
    @Asifusain 3 месяца назад

    Finally

  • @KumR
    @KumR 3 месяца назад

    23

  • @WIN_1306
    @WIN_1306 3 месяца назад

    i am the 300th person to like this!!!!1

  • @zeroPunisher
    @zeroPunisher 3 месяца назад

    first 🥰

  • @aditygupta3434
    @aditygupta3434 3 месяца назад

    Please I request to remove the membership plan on playlist of Maths for Machine Learning and DeepLearning.
    If you don't able to do that so Launch Maths for Deep Learning and Machine Learning Playlist free.
    Topics: 1. Statistics, 2. Linear Algebra, 3. Probability 4. Derivative

  • @aditygupta3434
    @aditygupta3434 3 месяца назад

    Please I request to remove the membership plan on playlist of Maths for Machine Learning and DeepLearning.
    If you don't able to do that so Launch Maths for Deep Learning and Machine Learning Playlist free.
    Topics: 1. Statistics, 2. Linear Algebra, 3. Probability 4. Derivative