Retrieval-Augmented Generation chatbot, part 1: LangChain, Hugging Face, FAISS, AWS

RAG But Better: Rerankers with Cohere AI

How to build Multimodal Retrieval-Augmented Generation (RAG) with Gemini

Crumbl Conspiracy Investigation

We SURVIVED A Hurricane!

10 Things You SHOULD Be Buying at Costco in September 2024

Retrieval-Augmented Generation chatbot, part 2 - LangChain, Hugging Face, OpenSearch, AWS

Julien Simon

Просмотров 7 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 10 сен 2024
We'll walk you through the creation of a Retrieval-Augmented Generation (RAG) chatbot using open-source tools and AWS services like LangChain, Hugging Face, Amazon SageMaker, and Amazon OpenSearch Serverless.
Part 1: • Retrieval-Augmented Ge... - LangChain, Hugging Face, FAISS, Amazon SageMaker, and Amazon TextTract.
⭐️⭐️⭐️ Don't forget to subscribe to be notified of future videos. Follow me on Medium at / julsimon or Substack at julsimon.subst.... ⭐️⭐️⭐️
We start by deploying Mistral 7B, a cutting-edge open-source LLM, onto a SageMaker endpoint. Following this, we work with the Reuters dataset, a Hugging Face dataset comprising 20,000 news articles. We break down these articles into smaller sections and apply bge-small, a compact open-source embedding model, to them.
Next, we proceed to index these sections into an Amazon OpenSearch Serverless vector index, which we then query through LangChain.
Additionally, aside from the RAG demonstration, we delve into some vital yet often overlooked steps related to authentication and security for OpenSearch Serverless.
- Notebook: gitlab.com/jul...
- LangChain: www.langchain....
- Amazon OpenSearch Serverless: docs.aws.amazo...
- Embedding leaderboard: huggingface.co...
- Embedding model: huggingface.co...
- LLM: huggingface.co...

Комментарии • 21

@bhanuchirutha Месяц назад ⁺¹
great , I agree sometimes you have to spend a lot of time on IAM than the original problem what a mess
@juliensimonfr Месяц назад
Yes, even if you know what you're doing, it's difficult to be 100% sure 🤣
@pfunnell 10 месяцев назад
this is great, my son and I have both been working on something similar, each for different use cases, this is going to help both of us, salut!
@juliensimonfr 10 месяцев назад
Glad I could help!
@Martyniqo 2 месяца назад
Thanks a lot!
@juliensimonfr 2 месяца назад
You're welcome!
@mtin79 10 месяцев назад
Merci beaucoup! Very helpful 👍🏻
@juliensimonfr 10 месяцев назад
You're welcome!
@ravindranshanmugam782 10 дней назад
Excellent to bring out IAM issue, I am experiencing it. I am runnig program in Jy.notebook, not ec2. As admin user, how can I attach role (aoss) to achieve this. Thanks.
@juliensimonfr 9 дней назад
If you work on your local machine, you need AWS credentials (~/.aws/credentials) with enough permissions, and you need to set all roles explicitely with the arn, e.g. you can't use get_execution_role() in the SageMaker SDK. See ruclips.net/video/K3ngZKF31mc/видео.html
@Ben-gp5ty 5 месяцев назад
Julien, if we have a document in S3 that when deleted, i want to trigger a lambda to delete the chunks and embeddings in opensearch belonging to this document. How do I do so ?
@juliensimonfr 5 месяцев назад
Each chunk should have metadata on the source document, which you could use to query and delete.
@TheMrGoodkind 9 месяцев назад
This is really great! Thank you! If I want to add this RAG-augmented chatbot to my personal website, how would I do that?
@WagnerHeleno 8 месяцев назад
Hi Julien, your video is excelente. I have a question, with this solution (using opensearch service) is possible to deploy thought Lambda Service too?
@juliensimonfr 7 месяцев назад
Hi, serverless inference on AWS is interesting, but no GPUs...
@XShollaj 10 месяцев назад
Thank you Julien! Will there be a tutorial deploying this in a front end chat interface ?
@juliensimonfr 10 месяцев назад ⁺¹
no, I couldn't write UI code to save my life ;) Gradio has a chatbot interface, this would probably be a good place to start www.gradio.app/docs/chatbot
@XShollaj 10 месяцев назад
@@juliensimonfr Thank you! Highest standards for tutorials as always!
@sergioquintero4624 9 месяцев назад
Hi. Can you explain a little more about the cost of this PoC ? Thanks
@juliensimonfr 7 месяцев назад
Check the pricing for the AWS services involved :)
@caiyu538 10 месяцев назад
thumb up first and then watch.

Следующие

Автовоспроизведение

Retrieval-Augmented Generation chatbot, part 1: LangChain, Hugging Face, FAISS, AWS

Retrieval-Augmented Generation chatbot, part 1: LangChain, Hugging Face, FAISS, AWS

RAG But Better: Rerankers with Cohere AI

RAG But Better: Rerankers with Cohere AI

How to build Multimodal Retrieval-Augmented Generation (RAG) with Gemini

How to build Multimodal Retrieval-Augmented Generation (RAG) with Gemini

Crumbl Conspiracy Investigation

Crumbl Conspiracy Investigation

We SURVIVED A Hurricane!

We SURVIVED A Hurricane!

10 Things You SHOULD Be Buying at Costco in September 2024

10 Things You SHOULD Be Buying at Costco in September 2024

nba 2k25 has cured my depression for the moment..

nba 2k25 has cured my depression for the moment..

Python RAG Tutorial (with Local LLMs): AI For Your PDFs

Python RAG Tutorial (with Local LLMs): AI For Your PDFs

Deploying a Retrieval-Augmented Generation (RAG) in AWS Lambda

Deploying a Retrieval-Augmented Generation (RAG) in AWS Lambda

Aline Paponaud & Pietro Mele - Implementing an open-source RAG with OpenSearch

Aline Paponaud & Pietro Mele – Implementing an open-source RAG with OpenSearch

Setting Up a Amazon Opensearch (ElasticSearch) Cluster with Free Tier

Setting Up a Amazon Opensearch (ElasticSearch) Cluster with Free Tier

Hugging Face LLMs with SageMaker + RAG with Pinecone

Hugging Face LLMs with SageMaker + RAG with Pinecone

Vector Databases simply explained! (Embeddings & Indexes)

Vector Databases simply explained! (Embeddings & Indexes)

Reliable, fully local RAG agents with LLaMA3

Reliable, fully local RAG agents with LLaMA3

SageMaker JumpStart: deploy Hugging Face models in minutes!

SageMaker JumpStart: deploy Hugging Face models in minutes!

What are AI Agents?

What are AI Agents?

Introducing iPhone 16 | Apple

Introducing iPhone 16 | Apple

Вопрос Ребром - Булкин

Вопрос Ребром - Булкин

Кто же всё-таки такой Арсен Маркарян #сатир #пародия #satyr

Кто же всё-таки такой Арсен Маркарян #сатир #пародия #satyr

Москва или Питер? #амирансардаров #эльдарджарахов

Москва или Питер? #амирансардаров #эльдарджарахов

Евгений Кузнецов кого-то нам сильно напоминает🤬🔥 #КХЛ

Евгений Кузнецов кого-то нам сильно напоминает🤬🔥 #КХЛ

Мы ТУДА НЕ ПРОЕДЕМ…! Купили тачку и отправились в путешествие!

Мы ТУДА НЕ ПРОЕДЕМ…! Купили тачку и отправились в путешествие!

爸爸误以为钱生钱，怎料又被儿子套路了！ #funny #萌娃 #comedy

爸爸误以为钱生钱，怎料又被儿子套路了！ #funny #萌娃 #comedy

Apple Event - September 9

Apple Event - September 9