What is RAG? (Retrieval Augmented Generation)

Поделиться
HTML-код
  • Опубликовано: 21 ноя 2024

Комментарии • 231

  • @dwoodlock
    @dwoodlock  7 месяцев назад +41

    Since this video turned out to be so successful and several people requested for me to do a deep dive / demo, here it is! Looking forward to reading your comments and hope you enjoy this one too. ruclips.net/video/P8tOjiYEFqU/видео.html

  • @hussamcheema
    @hussamcheema 8 месяцев назад +64

    One of the best explanation of RAG on RUclips. Thanks Don.

    • @NicolaiDufva
      @NicolaiDufva 8 месяцев назад +5

      I agree. Most other explanations are either way too detailed with live coding that muddles the information or way too high-level talking about how the LLM retrieves the additional data (which it doesn't! it is given to it via the prompt!)

  • @longship44
    @longship44 7 месяцев назад +8

    This is one of the best explanations of large language Models and the value of utilizing RAG I have seen. Don, you are an outstanding communicator. Thank you for taking the time to put this together.

  • @CodeVeda
    @CodeVeda 9 месяцев назад +72

    Finally someone is explaining with an real time example. Otherwise everyone else takes an example of fruits (apple, oranges etc) or movie names etc.

  • @eahmedshendy
    @eahmedshendy 9 месяцев назад +33

    Not confusing at all, just simple and get to the point explanation, thank you.

  • @slov1ker583
    @slov1ker583 Месяц назад +2

    10:18 wasnt confusing at all, your diagram was very helpful sir

  • @RyanRosario
    @RyanRosario Месяц назад +3

    This is much better than the IBM video. They make the assumption that the LLM is transparent and trained by the developer and that the prompt is a mix of word prompt and numerical vectors that are from a different embedding, which just seems wrong. This makes a lot more sense. Prompt -> Vector Database + Nearest Neighbors -> Top k -> Paragraphs ->

  • @BAZ82
    @BAZ82 8 месяцев назад +7

    I found your video to be the most accessible and informative introduction to RAG, especially for those new to this topic.

  • @abhiumn
    @abhiumn 20 дней назад +1

    You explained it very well even for an audience already from ML/NLP background.

  • @aryankushwaha9306
    @aryankushwaha9306 7 месяцев назад +2

    one of the best explanation i ever found. Now I finally understand what RAG is and thank you so much Mr. Don

  • @MrNewAmerican
    @MrNewAmerican 9 месяцев назад +10

    This is probably the best tutorial I have watched. Period. What an amazing teacher!

  • @YousefSharrab
    @YousefSharrab 9 месяцев назад +7

    Thank you. You are the first to explain RAG well. I have hear about a lot without understanding what does it mean.

  • @m.abdullahfiaz9635
    @m.abdullahfiaz9635 7 месяцев назад +2

    Thanks Prof. Don Woodlock you have explained exactly the same as I need to understand about my current project every concept maps to the practical part of project. Please deliver your knowledge more about advance and complex topics.👍

  • @eniniyathamizha2049
    @eniniyathamizha2049 Месяц назад +2

    Fantastic Simple and lucid. You are a wonderful teacher

  • @pragmaticgeek4616
    @pragmaticgeek4616 2 месяца назад +1

    Probably the best ever explanation on RAG

  • @steffenmuller2888
    @steffenmuller2888 8 месяцев назад +1

    I was looking for a general explanation to the RAG topic and you provide it very well! Now, I understand that the quality of RAG systems strongly depend on the information retrieval from the vector database. I will try to implement a RAG system on my own to learn something about it. Thank you very much!

  • @bryanbimantaka
    @bryanbimantaka 8 месяцев назад +2

    WOW! The simplest yet the best explanation! It's easy to understand for a beginner like me.
    THANK YOU!

  • @vinayakminde1090
    @vinayakminde1090 9 месяцев назад +4

    By far this is the most simple explaination for RAG I have came across. Amazing.
    Looking forward to next videos in series.

  • @jasonkey7063
    @jasonkey7063 8 месяцев назад +2

    Great explanation. I believe this has a big market for developers in small towns. Such an easy product to create and sell.

  • @MrFrubez
    @MrFrubez 8 месяцев назад +2

    Such a great explanation of RAG. It really helped me grasp the power of it.

  • @rahulkunal
    @rahulkunal 8 месяцев назад +1

    Thanks for such a simple explanation of the RAG Architecture Concepts.

  • @christopherhunt-walker6294
    @christopherhunt-walker6294 7 месяцев назад +6

    Wow he has explained this really clearly. This is the missing link for me between LLMs and making them actually useful for my projects. Thank you!

  • @Mar-or6wi
    @Mar-or6wi Месяц назад +1

    This great explanation on RAG enlightened me, thank you so much for this. It is very educational, warm and nicely delivered.

  • @chesaku
    @chesaku 7 месяцев назад +2

    Wow.. Job well done. Great and simplistic explanation for such complex topic.

  • @rp0000
    @rp0000 2 месяца назад +1

    The best and most complete explanation I found on RUclips❤

  • @rajmeets9303
    @rajmeets9303 6 месяцев назад +1

    Wow! Wonderful session. I loved the simplicity of explaining RAG. Thanks a lot Don.

  • @endourology
    @endourology 6 месяцев назад +2

    I finally understood what RAG is, including the vector part. Great!

  • @DavidBennell
    @DavidBennell 7 месяцев назад +2

    Great explanation, I have seen a lot of these and people normally go into far too much detail and muddy the water, or are far too abstract, fast and loose, or just get it wrong. I think this is a great level to cover this topic at.

  • @srinidigital4414
    @srinidigital4414 3 месяца назад +1

    You are Simple, Succinct and absolutely effective. I have understood RAG much better now. Thanks a ton !

  • @MuthukumaranPanchalingapuramKo
    @MuthukumaranPanchalingapuramKo 7 месяцев назад +3

    Best content on RAG!! Thank you!

  • @joeytribbiani735
    @joeytribbiani735 8 месяцев назад +2

    the best explanation of rag that've found thank you a lot

  • @coopernelson6947
    @coopernelson6947 7 месяцев назад +2

    Great video. I feel like this is the first time I'm learning stuff that is at the cutting edge. This video was posted 2 months ago, very exciting times

    • @gtarptv_
      @gtarptv_ 6 месяцев назад

      Same here I had no idea that RAG WAS BIG DEAL. I'VE BEEN READING STUFF ON REDDIT WORK PEOPLE TALKING ABOUT THE RAG THIS AND THAT

  • @vuven8930
    @vuven8930 4 месяца назад

    This explanation is absolutely S-Tier... Round of applause for this brother! What a great weaving of concepts.

  • @gt6808a
    @gt6808a 6 месяцев назад +1

    This has been the most helpful video I've found to help me understand how RAG works. Thank you so much for your wonderful explanation!

  • @incognito540
    @incognito540 4 месяца назад

    What a fantastic way to explain a rather complex topic, the multiple complex components in play without miring us in the tech minutia. Refreshing!

  • @johnny1966m
    @johnny1966m 9 месяцев назад +4

    Thank you very much for this video. Now is understand what my colleagues do in work with system documentation handling with use of LLM.:)

  • @itsAlabi
    @itsAlabi 8 месяцев назад

    This is really clear, this will customize the output based on the environment of the user not just on open source data.

  • @wendyhu6988
    @wendyhu6988 3 месяца назад +1

    Very clear explanation for the RAG in real world application!

  • @rsteinmannde
    @rsteinmannde 2 месяца назад +1

    Thanks a lot for this video. It helped me to understand how the question is treated for the vector db, so the whole pre-prompt part. Now finally RAG makes sense to me :) ThumbsUP!

  • @reply2noor
    @reply2noor Месяц назад +2

    Thank you Don , wonderful explanation..

  • @vaidyanathtdakshinamurthy8732
    @vaidyanathtdakshinamurthy8732 6 месяцев назад +1

    Hello Don Sir, thanks for this explanation. You're a blessed master craftsman. Simple and precise description and to the point.

  • @AshisRaj
    @AshisRaj 8 месяцев назад +2

    Excellent explanation Mr. Author

  • @bhaskarmazumdar9478
    @bhaskarmazumdar9478 7 месяцев назад +3

    This is an excellent explanation of the concept. Thank you Don

  • @travelchimps6637
    @travelchimps6637 7 месяцев назад +2

    9:20 not at all confusing, makes perfecf sense the way u exolained it thank you!!!

  • @PR03
    @PR03 8 месяцев назад +1

    great session dear Don. It was very complete, to the point and simply more advanced than other popular videos but of course in simple words. Thank you so much sir. ❤❤

  • @MichaelRuddock
    @MichaelRuddock 9 месяцев назад +3

    Thank you for sharing your knowledge with us, great explanation.

  • @DanielGonzalez-hq4gq
    @DanielGonzalez-hq4gq 28 дней назад +1

    Greatly explained. Quality content. Thank you very much!

  • @CollaborationSimplified
    @CollaborationSimplified 8 месяцев назад +1

    This was great, thank you! I believe this process is what Copilot for Microsoft 365 uses and it is referred to as ‘grounding’. Very helpful 👍

  • @AnuragBandi
    @AnuragBandi 3 месяца назад

    wow! 10:10 Nobody explained this in my past 5 videos about RAG! Thanks Don ❤

  • @MagusArtStudios
    @MagusArtStudios 8 месяцев назад

    I've been doing RAG and not even knowing the definition. Was glad to see I wasn't doing it wrong by injecting it into the end of the prompt.

  • @Tessi42
    @Tessi42 6 месяцев назад

    Thank you very much for this great presentation.
    Tomorrow I will use this video to describe in a few minutes to the decision-makers in my company what this RAG means.
    Many simply do not yet understand the simplicity behind the pattern and therefore the possibilities
    I find it difficult to present it in such simplicity
    Thank you very much🙂

  • @califfa2419
    @califfa2419 6 месяцев назад +1

    what a great explanation of RAG! Thank you

  • @steffimaxion-bergemann3251
    @steffimaxion-bergemann3251 10 дней назад

    that was the best explanation I have seen so far! Thank you very much!

  • @ewarthutton
    @ewarthutton 2 месяца назад +1

    Brilliant video, consise and clear. Many thanks.

  • @davutengin
    @davutengin 5 месяцев назад +1

    Best RAG explanation I think. Thanks.

  • @AhmedSherif-zh4zs
    @AhmedSherif-zh4zs 5 месяцев назад

    Extremely good and simple to understand. This is my first comment to share from ages

  • @easybachha
    @easybachha 9 месяцев назад +2

    Excellent explanation. Exactly what I was looking for! Thank you, Don!

  • @Ak_Seeker
    @Ak_Seeker 9 месяцев назад +4

    Awesome, thanks for the wonderful explanation in simple language

  • @brijeshsingh2103
    @brijeshsingh2103 Месяц назад +1

    So simple explanation 🤓.Thank you.

  • @nomorecramps
    @nomorecramps 5 месяцев назад

    I absolutely loved this explanation, this was so intuitive to understand

  • @ananthasubramanian3453
    @ananthasubramanian3453 2 месяца назад +1

    Greatly appreciated for this wonderfully explained video

  • @ClayBellBrews
    @ClayBellBrews 8 месяцев назад +1

    Great work; would really love to see you dig in on tokens and how they work as well.

  • @nadellaella6416
    @nadellaella6416 7 месяцев назад +3

    Bestt explanation! Thank youu Mr.Don!

  • @GG-uz8us
    @GG-uz8us 2 месяца назад +1

    Best explanation on RAG!

  • @arjbaid2024
    @arjbaid2024 9 месяцев назад +4

    Wonderful explanation of this topic. Thank you!

  • @OsamaAlatraqchi
    @OsamaAlatraqchi 3 месяца назад

    The best explanation ever fro RAG

  • @nexai_official
    @nexai_official 6 месяцев назад +1

    Thank you Don! The explanation was delightful

  • @inaccessiblecardinal9352
    @inaccessiblecardinal9352 10 месяцев назад +2

    Doing RAG stuff right now for work. Just scratching the surface, but very interesting stuff so far. We have a few clients on the horizon who really just need text classification, and the vanilla results from the vector DB might actually be good enough for them. Interesting territory coming fast.

    • @dwoodlock
      @dwoodlock  10 месяцев назад +1

      yes - I have found that pretty small LLMs (like BERT) do just fine for text classification.

  • @Themojii
    @Themojii 6 месяцев назад

    Great explanation of RAG. I subscribed to your channel after watching this. Thank you Don for the great content.

  • @angryITGuy
    @angryITGuy 4 месяца назад

    This was well explained, including the messy diagram .. lol .. thanks for that background!

  • @achen94
    @achen94 8 месяцев назад +1

    Amazing video. Thanks for the great explanation!

  • @kingofartsofficial4431
    @kingofartsofficial4431 9 месяцев назад +3

    Very Good Explanation Sir

  • @zandanshah
    @zandanshah 8 месяцев назад +1

    Good content, please share more.

  • @free8cki
    @free8cki 23 дня назад +1

    Thank you so much. You make it very easy to understand! 😊

  • @govindarajram8553
    @govindarajram8553 6 месяцев назад

    Just I watched from youtube suggestions and you me good explanations on Retrieval augmented generation closure to my use case. Thank you

  • @ComputingAndCoding
    @ComputingAndCoding 6 месяцев назад +1

    Awesome video! It helped explain the concept a great deal. well done👍

  • @infraia
    @infraia 5 месяцев назад

    This is one of the best RAG explanations I’ve seen so far!
    Wss it that hard?
    Well done 🎉

  • @sameenkunwar2231
    @sameenkunwar2231 3 месяца назад +1

    thank you sir for clearing concepts!

  • @LatentSpaceD
    @LatentSpaceD 8 месяцев назад

    Appreciate you and your content. I'm glad I found you again

  • @bigplumppenguin
    @bigplumppenguin 8 месяцев назад +1

    Very good introduction!!!

  • @Chandruhere4u
    @Chandruhere4u 4 месяца назад

    One of the best way of explanation.. thanks for doing this.

  • @sharanaasafiudeen392
    @sharanaasafiudeen392 5 месяцев назад

    Concise and simplified !! Thank you, Don !

  • @peterbedford2610
    @peterbedford2610 9 месяцев назад +1

    Sounds like it is optimizing or creating a more efficient prompt session? I guess "augmentation" is a fairly good description.
    Thank you. I enjoy your teaching style.

  • @miho9453
    @miho9453 14 дней назад +1

    This was so clear. Thank you!

  • @annadehek6533
    @annadehek6533 6 месяцев назад +2

    such a good explanation, thank you so much!!

  • @kerryboyde-preece892
    @kerryboyde-preece892 2 месяца назад +1

    Great explanation, thanks, Don :)

  • @gasfeesofficial3557
    @gasfeesofficial3557 5 месяцев назад +4

    This will simplify the "RAG" stuff -> take a general language model like GPT, Gemini, Llama, and tailor it to answer only things you care about -> sports scores, latest movies, celebrity gossip etc. -> by feeding the model your own data. Data can be real time like from a news website that offers an API or some textbook that is in PDF format.

    • @ikennai274
      @ikennai274 2 месяца назад

      You did well, however your explanation still falls under basic prompting. Yea there is some sort of retrieval in your explanation but it becomes RAG when their is some sort of retrieval that is generative and that is where the concept of vector database comes in.

  • @SrinivasJilla
    @SrinivasJilla 4 месяца назад

    Thanks for an excellent presentation.
    What tools are you using for drawing .. that was an impressive way to present

  • @data9928
    @data9928 Месяц назад +1

    Great video. Appreciate your work!

  • @stephenlii1744
    @stephenlii1744 8 месяцев назад

    it’s a pretty good explanation,thanks Don

  • @dab6726
    @dab6726 4 месяца назад

    Thank you, Don! Could you explain some difference between RAG (Vector DB) and Knowledge graph usage with LLM.

  • @dannysuarez6265
    @dannysuarez6265 9 месяцев назад +1

    Thank you for your great explanation sir!

  • @dharmakelleherauthor
    @dharmakelleherauthor 6 месяцев назад

    Thank you so much! This is a great, easy-to-follow explanation. Coincidentally, I'm having knee surgery tomorrow. LOL.

    • @dwoodlock
      @dwoodlock  5 месяцев назад

      Did you prepare properly??

  • @SuperCatbert
    @SuperCatbert 5 месяцев назад

    thanks. learned about RAG, and liked your style

  • @patiencemugisha19
    @patiencemugisha19 5 месяцев назад

    I was expecting something practical. But this is really helpful

  • @scottt9382
    @scottt9382 3 месяца назад +1

    Cautionary note on loading up your prompts: transformer-based LLMs are quadratic wrt infrastructure. Assuming you have a decent size context window, doubling your prompt size requires 4X as much infrastructure (GPUs, in particular). Leaning into complex prompts is very much a model-maker approaches this issue - but, it can be very impractical for those of us that actually build high ROI, production-grade GenAI workloads. Models are only a backend tool. So, while Anthropic, Google, etc. promote a future with million token context windows, this can be impractical for most 'real' prod workloads.

  • @herculesgixxer
    @herculesgixxer 8 месяцев назад +1

    loved your explanation, thank you

  • @shreyanshkhandelwal6499
    @shreyanshkhandelwal6499 4 месяца назад +1

    Best explanation ever.

  • @anngladyo5668
    @anngladyo5668 Месяц назад +1

    stellar explanation!!

  • @MateoGarcia-rt7xt
    @MateoGarcia-rt7xt 6 месяцев назад

    Thanks for this great explanation, Don!