Retrieval-Augmented Generation chatbot, part 2 - LangChain, Hugging Face, OpenSearch, AWS

Поделиться
HTML-код
  • Опубликовано: 10 сен 2024
  • We'll walk you through the creation of a Retrieval-Augmented Generation (RAG) chatbot using open-source tools and AWS services like LangChain, Hugging Face, Amazon SageMaker, and Amazon OpenSearch Serverless.
    Part 1: • Retrieval-Augmented Ge... - LangChain, Hugging Face, FAISS, Amazon SageMaker, and Amazon TextTract.
    ⭐️⭐️⭐️ Don't forget to subscribe to be notified of future videos. Follow me on Medium at / julsimon or Substack at julsimon.subst.... ⭐️⭐️⭐️
    We start by deploying Mistral 7B, a cutting-edge open-source LLM, onto a SageMaker endpoint. Following this, we work with the Reuters dataset, a Hugging Face dataset comprising 20,000 news articles. We break down these articles into smaller sections and apply bge-small, a compact open-source embedding model, to them.
    Next, we proceed to index these sections into an Amazon OpenSearch Serverless vector index, which we then query through LangChain.
    Additionally, aside from the RAG demonstration, we delve into some vital yet often overlooked steps related to authentication and security for OpenSearch Serverless.
    - Notebook: gitlab.com/jul...
    - LangChain: www.langchain....
    - Amazon OpenSearch Serverless: docs.aws.amazo...
    - Embedding leaderboard: huggingface.co...
    - Embedding model: huggingface.co...
    - LLM: huggingface.co...

Комментарии • 21

  • @bhanuchirutha
    @bhanuchirutha Месяц назад +1

    great , I agree sometimes you have to spend a lot of time on IAM than the original problem what a mess

    • @juliensimonfr
      @juliensimonfr  Месяц назад

      Yes, even if you know what you're doing, it's difficult to be 100% sure 🤣

  • @pfunnell
    @pfunnell 10 месяцев назад

    this is great, my son and I have both been working on something similar, each for different use cases, this is going to help both of us, salut!

  • @Martyniqo
    @Martyniqo 2 месяца назад

    Thanks a lot!

  • @mtin79
    @mtin79 10 месяцев назад

    Merci beaucoup! Very helpful 👍🏻

  • @ravindranshanmugam782
    @ravindranshanmugam782 10 дней назад

    Excellent to bring out IAM issue, I am experiencing it. I am runnig program in Jy.notebook, not ec2. As admin user, how can I attach role (aoss) to achieve this. Thanks.

    • @juliensimonfr
      @juliensimonfr  9 дней назад

      If you work on your local machine, you need AWS credentials (~/.aws/credentials) with enough permissions, and you need to set all roles explicitely with the arn, e.g. you can't use get_execution_role() in the SageMaker SDK. See ruclips.net/video/K3ngZKF31mc/видео.html

  • @Ben-gp5ty
    @Ben-gp5ty 5 месяцев назад

    Julien, if we have a document in S3 that when deleted, i want to trigger a lambda to delete the chunks and embeddings in opensearch belonging to this document. How do I do so ?

    • @juliensimonfr
      @juliensimonfr  5 месяцев назад

      Each chunk should have metadata on the source document, which you could use to query and delete.

  • @TheMrGoodkind
    @TheMrGoodkind 9 месяцев назад

    This is really great! Thank you! If I want to add this RAG-augmented chatbot to my personal website, how would I do that?

  • @WagnerHeleno
    @WagnerHeleno 8 месяцев назад

    Hi Julien, your video is excelente. I have a question, with this solution (using opensearch service) is possible to deploy thought Lambda Service too?

    • @juliensimonfr
      @juliensimonfr  7 месяцев назад

      Hi, serverless inference on AWS is interesting, but no GPUs...

  • @XShollaj
    @XShollaj 10 месяцев назад

    Thank you Julien! Will there be a tutorial deploying this in a front end chat interface ?

    • @juliensimonfr
      @juliensimonfr  10 месяцев назад +1

      no, I couldn't write UI code to save my life ;) Gradio has a chatbot interface, this would probably be a good place to start www.gradio.app/docs/chatbot

    • @XShollaj
      @XShollaj 10 месяцев назад

      @@juliensimonfr Thank you! Highest standards for tutorials as always!

  • @sergioquintero4624
    @sergioquintero4624 9 месяцев назад

    Hi. Can you explain a little more about the cost of this PoC ? Thanks

    • @juliensimonfr
      @juliensimonfr  7 месяцев назад

      Check the pricing for the AWS services involved :)

  • @caiyu538
    @caiyu538 10 месяцев назад

    thumb up first and then watch.