What is Retrieval-Augmented Generation (RAG)?

Поделиться
HTML-код
  • Опубликовано: 13 май 2024
  • Try RAG with watsonx → ibm.biz/BdMsRT
    Learn more about RAG→ ibm.biz/BdMsRt
    Large language models usually give great answers, but because they're limited to the training data used to create the model. Over time they can become incomplete--or worse, generate answers that are just plain wrong. One way of improving the LLM results is called "retrieval-augmented generation" or RAG. In this video, IBM Senior Research Scientist Marina Danilevsky explains the LLM/RAG framework and how this combination delivers two big advantages, namely: the model gets the most up-to-date and trustworthy facts, and you can see where the model got its info, lending more credibility to what it generates.
    Get started for free on IBM Cloud → ibm.biz/sign-up-now
    Subscribe to see more videos like this in the future → ibm.biz/subscribe-now

Комментарии • 364

  • @xzskywalkersun515
    @xzskywalkersun515 5 месяцев назад +466

    This lecturer should be given credit for such an amazing explanation.

    • @cosmicscattering5499
      @cosmicscattering5499 3 месяца назад +3

      I was thinking the same, she explained this so clearly.

    • @tariqmking
      @tariqmking Месяц назад +1

      Yes this was excellently explained, kudos to her.

    • @brianmi40
      @brianmi40 Месяц назад +6

      Or at least credit for being able to write backwards!

    • @victoriamilhoan512
      @victoriamilhoan512 6 дней назад

      The connection between a human answering a question in real life vs how LLMs (with or without RAG) do it was so helpful!

  • @vt1454
    @vt1454 6 месяцев назад +303

    IBM should start a learning platform. Their videos are so good.

    • @XEQUTE
      @XEQUTE 5 месяцев назад +6

      i think they already do

    • @srinivasreddyt9555
      @srinivasreddyt9555 Месяц назад

      Yes, they have it already. RUclips.

    • @siddheshpgaikwad
      @siddheshpgaikwad 25 дней назад +1

      Its mirrored video, she wrote naturally and video was mirrored later

    • @Hossam_Ahmed_
      @Hossam_Ahmed_ 24 дня назад

      They have skill build but not videos at least most of the content

    • @CaptPicard81
      @CaptPicard81 22 дня назад

      They do, I recently attended a week long AI workshop based on an IBM curriculum

  • @ghtgillen
    @ghtgillen 7 месяцев назад +47

    Your ability to write backwards on the glass is amazing! ;-)

    • @jsonbourne8122
      @jsonbourne8122 6 месяцев назад +20

      They flip the video

    • @Paul-rs4gd
      @Paul-rs4gd 3 месяца назад +8

      @@jsonbourne8122 So obvious, but I did not think of it. My idea was way more complicated!

  • @jordonkash
    @jordonkash 2 месяца назад +29

    4:15 Marina combines the colors of the word prompt to emphasis her point. Nice touch

  • @ericadar
    @ericadar 5 месяцев назад +51

    Marina is a talented teacher. This was brief, clear and enjoyable.

  • @geopopos
    @geopopos 2 месяца назад +40

    I love seeing a large company like IBM invest in educating the public with free content! You all rock!

  • @natoreus
    @natoreus 3 дня назад +1

    I'm sure it was already said, but this video is the most thorough, simple way I've seen RAG explained on YT hands down. Well done.

  • @ntoscano01
    @ntoscano01 4 месяца назад +21

    Very well explained!!! Thank you for your explanation of this. I’m so tired of 45 minute RUclips videos with a college educated professional trying to explain ML topics. If you can’t explain a topic in your own language in 10 minutes or less than you have failed to either understand it yourself or communicate effectively.

  • @vikramn2190
    @vikramn2190 7 месяцев назад +30

    I believe the video is slightly inaccurate. As one of the commenters mentioned, the LLM is frozen and the act of interfacing with external sources and vector datastores is not carried out by the LLM.
    The following is the actual flow:
    Step 1: User makes a prompt
    Step 2: Prompt is converted to a vector embedding
    Step 3: Nearby documents in vector space are selected
    Step 4: Prompt is sent along with selected documents as context
    Step 5: LLM responds with given context
    Please correct me if I'm wrong.

    • @DJ-lo8qj
      @DJ-lo8qj 25 дней назад

      I’m not sure. Looking at OpenAI documentation on RAG, they have a similar flow as demonstrated in this video. I think the retrieval of external data is considered to be part of the LLM (at least per OpenAI)

    • @PlaytimeEntertainment
      @PlaytimeEntertainment 23 дня назад

      I do not think retrieval is part of LLM. LLM is the best model at the end of convergence after training. It can't be modified rather after LLM response you can always use that info for next flow of retrieval

  • @TheAllnun21
    @TheAllnun21 5 месяцев назад +16

    Wow, this is the best beginner's introduction I've seen on RAG!

  • @aam50
    @aam50 5 месяцев назад +17

    That's a really great explanation of RAG in terms most people will understand. I was also sufficiently fascinated by how the writing on glass was done to go hunt down the answer from other comments!

  • @javi_park
    @javi_park 3 месяца назад +29

    hold up - the fact that the board is flipped is the most underrated modern education marvel nobody's talking about

    • @RiaKeenan
      @RiaKeenan 3 месяца назад

      I know, right?!

    • @euseikodak
      @euseikodak 3 месяца назад +4

      Probably they filmed it in front of a glass board and flipped the video on edition later on

    • @politicallyincorrect1705
      @politicallyincorrect1705 3 месяца назад

      Filmed in front of a non-reflective mirror.

    • @TheTomtz
      @TheTomtz Месяц назад

      Just simply write on a glass board ,record it from the other side and laterally flip the image! Simple aa that.. and pls dont distract people from the contents being lectured by thinkin about the process behind the rec🤣

    • @thewallstreetjournal5675
      @thewallstreetjournal5675 Месяц назад

      Is the board fliped or has she been flipped?

  • @maruthuk
    @maruthuk 7 месяцев назад +20

    Loved the simple example to describe how RAG can be used to augment the responses of LLM models.

  • @444Yielding
    @444Yielding 24 дня назад +3

    This video is highly underviewed for as informative as it is!

  • @m.kaschi2741
    @m.kaschi2741 5 месяцев назад +5

    Wow, I opened youtube coming from the ibm blog just to leave a comment. Clearly explained, very good example, and well presented as well!! :) Thank you

  • @ReflectionOcean
    @ReflectionOcean 5 месяцев назад +21

    1. Understanding the challenges with LLMs - 0:36
    2. Introducing Retrieval-Augmented Generation (RAG) to solve LLM issues - 0:18
    3. Using RAG to provide accurate, up-to-date information - 1:26
    4. Demonstrating how RAG uses a content store to improve responses - 3:02
    5. Explaining the three-part prompt in the RAG framework - 4:13
    6. Addressing how RAG keeps LLMs current without retraining - 4:38
    7. Highlighting the use of primary sources to prevent data hallucination - 5:02
    8. Discussing the importance of improving both the retriever and the generative model - 6:01

  • @Lucildor
    @Lucildor 3 месяца назад

    Please keep all these videos coming! They are so easy to understand and straightforward. Muchas gracias!

  • @GregSolon
    @GregSolon 3 месяца назад

    One of the easiest to understand RAG explanations I've seen - thanks.

  • @projectfocrin
    @projectfocrin 5 месяцев назад +5

    Great explanation. Even the pros in the field I have never seen explain like this.

  • @TheMsksk
    @TheMsksk 8 месяцев назад

    Great video as always. Thanks for sharing.

  • @Shailendrashail
    @Shailendrashail 8 месяцев назад +1

    Good Explanation of RAG. Thanks for sharing.

  • @paulaenchina
    @paulaenchina 4 месяца назад +1

    This is the best explanation I have seen so far for RAG! Amazing content!

  • @jyhherng
    @jyhherng 6 месяцев назад +5

    this let's me understand why the embeddings used to generate the vectorstore is a different set from the embeddings of the LLM... Thanks, Marina!

  • @kingvanessa946
    @kingvanessa946 3 месяца назад +1

    For me, this is the most easy-to-understand video to explain RAG!

  • @hamidapremani6151
    @hamidapremani6151 2 месяца назад +1

    The explanation was spot on!
    IBM is the go to platform to learn about new technology with their high quality content explained and illustrated with so much simplicity.

  • @redwinsh258
    @redwinsh258 6 месяцев назад +21

    The interesting part is not retrieval from the internet, but retrieval from long term memory, and with a stated objective that builds on such long term memory, and continually gives it "maintenance" so it's efficient and effective to answer. LLMs are awesome because even though there are many challenges ahead, they sort of give us a hint of what's possible, without them it would be hard to have the motivation to follow the road

  • @past_life_project
    @past_life_project 3 месяца назад

    I have watched many IBM videos and this is the undoubtedly the best ! I will be searching for your videos now Marina!

  • @rujmah
    @rujmah 2 месяца назад

    Brilliant explanation and illustration. Thanks for your hard work putting this presentation together.

  • @user-cd6hp5kc1n
    @user-cd6hp5kc1n 7 месяцев назад +16

    The ability to write backwards, much less cursive writing backwards, is very impressive!

    • @IBMTechnology
      @IBMTechnology  7 месяцев назад +7

      See ibm.biz/write-backwards

    • @jsonbourne8122
      @jsonbourne8122 6 месяцев назад

      Left hand too!

    • @NishanSaliya
      @NishanSaliya 5 месяцев назад

      @@IBMTechnology Thanks .... I was reading comments to check for an answer for that question!

  • @vnaykmar7
    @vnaykmar7 5 месяцев назад +2

    Such an amazing explanation. Thank you ma'am!

  • @afshinkarimi2382
    @afshinkarimi2382 8 месяцев назад

    Great video. Thanks for sharing

  • @rsu82
    @rsu82 День назад

    good explanation, it's very easy to understand. this video is the first one when I search RAG on RUclips. great job ;)

  • @evaiintelligence
    @evaiintelligence 25 дней назад

    Marina has done a great job explaining LLM and RAGs in simple terms.

  • @Anubis2828
    @Anubis2828 2 месяца назад

    Great, simple, quick explanation

  • @kunalsoni7681
    @kunalsoni7681 6 месяцев назад

    Thanks for letting us know about this feature of LLM :)

  • @mstarlingc
    @mstarlingc 5 месяцев назад +1

    Pretty simple explanation, thank you

  • @rvssrkrishna2
    @rvssrkrishna2 2 месяца назад

    Very precise and exact information on RAG in a nutshell. Thank you for saving my time.

  • @francischacko3627
    @francischacko3627 21 день назад

    perfect explanation understood every bit , no lags kept it very interesting ,amazing job

  • @bdouglas
    @bdouglas Месяц назад

    That was excellent, simple, and elegant! Thank you!

  • @rafa1rafa
    @rafa1rafa 5 месяцев назад +2

    Great explanation! The video was very didactic, congratulations!

  • @rockochamp
    @rockochamp 5 месяцев назад +1

    very well executed presentation.
    i had to think twice about how you can write in reverse but then i RAGed my system 2 :)

  • @HimalayJoriwal
    @HimalayJoriwal 2 месяца назад

    Best explanation so far from all the content on internet.

  • @khalidelgazzar
    @khalidelgazzar 5 месяцев назад +2

    Great explanation. Thank you!😊

  • @preciousrose2715
    @preciousrose2715 27 дней назад

    This was such an amazing explanation!

  • @Aryankingz
    @Aryankingz 7 месяцев назад +3

    That's what Knowledge graphs are for, to keep LLMs grounded with a reliable source and up-to-date.

  • @user-im6ub3sf6m
    @user-im6ub3sf6m 3 месяца назад

    Great explanation with an example. Thank you

  • @user-hk5dk9rb6p
    @user-hk5dk9rb6p 4 месяца назад +1

    Fantastic video and explanation. Thank you!

  • @sawyerburnett8319
    @sawyerburnett8319 4 месяца назад +1

    Wow, having a lightbulb moment finally after hearing this mentioned so often. Makes more sense now!

  • @laurentpastorelli1354
    @laurentpastorelli1354 4 месяца назад +1

    Super good and clear, well done!

  • @janhorak8799
    @janhorak8799 2 месяца назад +9

    Did all the speakers have to learn how to write in a mirrored way or is this effect reached by some digital trick?

    • @VlogBySKSK
      @VlogBySKSK 29 дней назад

      There is a digital mirroring technique which is used to show the content this way...

    • @mao-tse-tung
      @mao-tse-tung 21 день назад +2

      She was right handed before the mirror effect

  • @ashwinkumar675
    @ashwinkumar675 20 дней назад

    This is so well explained! Thank you 👍🏻✅

  • @gaemrpaterso-ri2jd
    @gaemrpaterso-ri2jd 8 месяцев назад

    Great video, you guys should do one on promising tech industries

  • @toenytv7946
    @toenytv7946 2 месяца назад +1

    Great down the rabbit hole video. Very deep and understandable. IBM academy worthy in my opinion.

  • @lauther_27
    @lauther_27 5 месяцев назад +1

    Amazing video, thanks IBM ❤

  • @AdarshKumar-kx2cn
    @AdarshKumar-kx2cn 2 месяца назад

    Beautifully explained....thanks

  • @user-bo1kv5zy3w
    @user-bo1kv5zy3w 7 месяцев назад

    Awesome explanation. Love you.

  • @JasonVonHolmes
    @JasonVonHolmes Месяц назад

    This was explained fantastically.

  • @johnmccullough7084
    @johnmccullough7084 6 месяцев назад +1

    Appreciate the succinct explanation. 👍

  • @xdevs23
    @xdevs23 Месяц назад +5

    The entire video I've been wondering how they made the transparent whiteboard

  • @oieieio741
    @oieieio741 5 месяцев назад

    Very Helpful! Great explanation. thx IBM

  • @ericmcnally5128
    @ericmcnally5128 2 месяца назад

    This is a fantastic lesson video.

  • @terencelewis4985
    @terencelewis4985 3 месяца назад

    Excellent explanation!

  • @zuzukouzina-original
    @zuzukouzina-original 3 месяца назад

    Very clear explanation, much respect 🫡

  • @eddisonlewis8099
    @eddisonlewis8099 3 месяца назад

    AWESOME EXPLANATION OF THE CONCEPT RAG

  • @star2k279
    @star2k279 4 месяца назад +1

    Thank you for such a great explanation.

  • @Junglytics
    @Junglytics 3 месяца назад

    Great video, excellent explanation!

  • @sprintwithcarlos
    @sprintwithcarlos 5 месяцев назад

    Great explanation!

  • @PaulGrew-wl7mh
    @PaulGrew-wl7mh Месяц назад

    An amazing explanation that made RAG understandable in about 4:23 minutes!

  • @alexiojunior7867
    @alexiojunior7867 Месяц назад

    wow this was an amazing Explanation ,very easy to understand

  • @aniket_mishr
    @aniket_mishr Месяц назад

    The explanation was very good 💯.

  • @AC-xd7sw
    @AC-xd7sw 4 месяца назад

    Insightful, please more video like this

  • @katsunoi
    @katsunoi 5 месяцев назад

    nice video - great explanation!

  • @Kekko400D
    @Kekko400D 3 месяца назад

    Fantastic explanation, proud to be an IBMer

  • @sk-6032
    @sk-6032 6 дней назад

    Very well explained 🙏🏼👍

  • @mayankbumb7272
    @mayankbumb7272 4 дня назад

    Great explanation

  • @rahulberry4806
    @rahulberry4806 17 дней назад

    thanks for the great explanation

  • @shashankshekharsingh9336
    @shashankshekharsingh9336 14 дней назад

    very good and clear explanation

  • @421sap
    @421sap 5 месяцев назад

    Thank you, Marina Danilevsky ....

  • @sumedhaj9017
    @sumedhaj9017 2 месяца назад

    Amazing explanation! Thank you:)

  • @mohammadsubhan1318
    @mohammadsubhan1318 3 месяца назад

    Nicely explained 👍

  • @deltawhiplash1614
    @deltawhiplash1614 10 дней назад

    This is a really good video thank you for sharing this knowledge

  • @shinemuphy
    @shinemuphy 5 месяцев назад

    Excellent explanation. thx

  • @mohamadhijazi3895
    @mohamadhijazi3895 Месяц назад

    The video is short and consice yet the delivery is very elegant. She might be the best instructor that have teached me. Any idea how the video was created?

  • @yashkhorania3726
    @yashkhorania3726 12 дней назад

    very nicely explained

  • @randomforest_dev
    @randomforest_dev 25 дней назад

    Very good explanation!

  • @carvalhoribeiro
    @carvalhoribeiro Месяц назад

    Amazing work. Thanks for sharing this.

  • @johanhumblet6090
    @johanhumblet6090 4 месяца назад

    very well explained!

  • @MraM23
    @MraM23 3 месяца назад

    Great lessons! Nice of you to step out 🙃 and make such engaging and educative content This is a very useful in helping us in critical thinking. Thank you for sharing this video. 👍
    Current ai models may impose neurotypical norms and expectations based on current data trained on . 🤔
    Curious to see more on how IBM approach the challenges and limitations of Ai

  • @user-xf4vm2gf6g
    @user-xf4vm2gf6g 3 месяца назад

    Excellent ! thank you for sharing this knowledge !

  • @sharingmatters
    @sharingmatters 2 месяца назад

    Well explained!

  • @ayanSaha13291
    @ayanSaha13291 25 дней назад

    Great video! thanks for educating!

  • @vic3sax972
    @vic3sax972 3 месяца назад

    Nice explanation

  • @prasannakulkarni5664
    @prasannakulkarni5664 Месяц назад

    the color coding on your whiteboard is really apt here !

  • @kallamamran
    @kallamamran 4 месяца назад +2

    We also need the models to cross check their own answers with the sources of information before printing out the answer to the user. There is no self control today. Models just say things. "I don't know" is actually a perfectly fine answer sometimes!

  • @MarshallMelnychuk
    @MarshallMelnychuk 8 месяцев назад +9

    Thank you Marina, very helpful and informative video. One question I have is; how do you make these videos like this? Being able to on a screen facing the camera, this is great. What's your secret?

    • @PeterCooperUK
      @PeterCooperUK 7 месяцев назад +10

      Sometimes these are done on transparent "whiteboards" and the video is then flipped horizontally.

    • @ChristopherSmithGPlus
      @ChristopherSmithGPlus 7 месяцев назад

      ruclips.net/video/eVOPDQ5KYso/видео.htmlsi=LADnROL0SF33Hg54

    • @IBMTechnology
      @IBMTechnology  7 месяцев назад +15

      See ibm.biz/write-backwards

    • @lowkeyproducktvt2101
      @lowkeyproducktvt2101 6 месяцев назад +1

      @@IBMTechnology okay now i get it !!!

    • @rayuduaddagarla3857
      @rayuduaddagarla3857 6 месяцев назад +3

      IBM should hire left hand writers so it will right handed after flip 😊

  • @AntenorTeixeira
    @AntenorTeixeira 5 месяцев назад

    That's the best video about RAG that I've watched

  • @paulc465
    @paulc465 5 месяцев назад

    thank you. very informative!

  • @siddharth4251
    @siddharth4251 Месяц назад

    Amazing explanation, finally i understand it.

  • @kd4pba
    @kd4pba 12 дней назад

    Good presentation.

  • @lewi594
    @lewi594 4 месяца назад +1

    This is brilliant and concise, helped make sense of a complex subject..
    Can this be implemented in a small environment with limited computing? Such that the retriever only has access to a closed data source