Agentic RAG: Make Chatting with Docs Smarter

Поделиться
HTML-код
  • Опубликовано: 19 янв 2025

Комментарии • 30

  • @engineerprompt
    @engineerprompt  6 месяцев назад

    Checkout the Advanced RAG course here: prompt-s-site.thinkific.com/courses/rag

    • @criticalnodecapital
      @criticalnodecapital 6 месяцев назад

      thanks.. Can you become the ISHOWSPEED of AI. also are you based in USA or Subcontinent?

    • @engineerprompt
      @engineerprompt  6 месяцев назад

      @@criticalnodecapital haha, that would be a good achievement :D I am based in the USA.

  • @unclecode
    @unclecode 6 месяцев назад +4

    So clear and simple compared to other libraries for building genetic pipelines. Intuitive and feels like it should've been in Hugging Face libraries from the start. Makes other libraries seem overly complex and unnecessary. Easy to create an LLM engine with just a callable class. You can build any structure, with complexity only from yourself, not the library. Not surprising from Hugging Face, just like how fine-tuning models with HF library is intuitive and easy. Love a simple, powerful library that doesn't over-abstract. This is the way. Thanks for sharing.

    • @engineerprompt
      @engineerprompt  6 месяцев назад +2

      Yeah, really like their implementation. Clean and straightforward.

  • @camerongolinsky
    @camerongolinsky 3 месяца назад +1

    Cool idea! When a course comes out focused on csv or databases, then I'll be there!

  • @rocio6454
    @rocio6454 4 месяца назад +1

    Thanks for the video! it would be interesting to see a multiagent and routing approach with 2 sources like a vector store for rag and a sql db, each one with their agents

  • @olympiasaha7165
    @olympiasaha7165 6 месяцев назад +2

    It would have been interesting to see if you would have used GPT-4o as the LLM engine in the traditional RAG method to compare it with the agentic RAG response.

  • @BamiCake
    @BamiCake 6 месяцев назад +3

    In your video the agentic rag takes about 4 times longer (15 sec). Is there a way to speed up agentic rag?

    • @engineerprompt
      @engineerprompt  6 месяцев назад

      Unfortunately, using agents in the loop with take longer than standard RAG since it has to make additional calls to the LLM and do retrieval again. Over time you can cache queries and responses for faster retrieval.

  • @anubisai
    @anubisai 6 месяцев назад +4

    Agentic RAG + Knowledge Graph would be bad ass. Someone steal my idea, please. 😂 🙏

  • @Jonzybeatz
    @Jonzybeatz 3 месяца назад

    thanks for the video.
    I would like to analyze PDF studies of several hundred pages and make summaries to extract insights.
    The problem is that I can't copy/paste the pdf into GPT because it goes beyond the context window.
    Can I use RAG to do this use case?
    The RAG seems to be designed more for answering specific questions from a knowledge base than for synthesizing documents.

  • @nobody84980
    @nobody84980 5 месяцев назад +1

    Why do I need an agent when I can add the agent description as a system prompt

    • @engineerprompt
      @engineerprompt  5 месяцев назад +2

      Agent has the ability to do multiple passes of retrieval if it's not able to find the info in the first pass. If you add this to the system prompt, I will just run once and can't repeat the process with reasoning and Planning.

  • @legendchdou9578
    @legendchdou9578 6 месяцев назад +1

    Great video can we use GROQ API for the LLM?

    • @Parthi97
      @Parthi97 5 месяцев назад

      It depends upon the prompt message you give.. Yes we can utilize GROQ models for simpler agentic RAG process

  • @paragshah2943
    @paragshah2943 6 месяцев назад

    OP, Under what circumstances might you have duplicate chunks? Is it becuase two files that are same with differnt names?

    • @engineerprompt
      @engineerprompt  6 месяцев назад

      Yes, that happens a lot. In big datasets, there can be duplicates.

  • @CreativeEngineering_
    @CreativeEngineering_ 6 месяцев назад +1

    I dont remember the last time I had and issue with hallucinations.

  • @sauxybanana2332
    @sauxybanana2332 6 месяцев назад +1

    how does this compare to graph rag?

    • @iukeay
      @iukeay 5 месяцев назад

      It really depends on your use case.
      GraphRDF is currently ten to twenty times more expensive. Also, depending on the type of data and the type of query, it could be useful for you or not.
      It also increases lag by a very substantial margin. I have not found any startups or ideas implementing graph-lag effectively and usable yet.
      If you do, please keep me in the loop.

  • @barackobama4552
    @barackobama4552 6 месяцев назад

    THANKS!

  • @finalfan321
    @finalfan321 6 месяцев назад

    too technical. where are friendly user interfaces websites/apps?