Fast-track RAG: Chat with SQL Databases using Few-Shot Learning and Gemini | Streamlit | LangChain

Why Agent Frameworks Will Fail (and what to use instead)

5 Steps to Start Making $10k/Month Writing Online

CHICAGO SLAMBALL TOURNAMENT

Massive SCUFFLE ensues after Michigan tries to plant flag on Ohio State's logo

COLLABMAS DAY 2: HEADPHONE CHALLANGE 👀❤️ft. Ceeoyo, Lena Davis & BakJayc

Advanced RAG with Self-Correction | LangGraph | No Hallucination | Agents | LangChain | GROQ | AI

Eduardo Vasquez

Просмотров 7 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 3 дек 2024

Комментарии •

@icecubeinsights8349 4 месяца назад
Hi Eduardo, this is a really nice video, thank you. Do you think you could add a citation functionality, such that the user get's reaffirmed, where the information was taken from? Thanks
@vasudevanvijayaragavan3186 5 месяцев назад ⁺¹
Very nice, the only challenge with this approach is the total cost of answering each query, and it could run forever in some cases till both llms agree or till you get thr eight relevant information from the search. I think of customers want 100% gurantee and are not worried about latency, this will work really well.
@eduardov01 5 месяцев назад ⁺¹
Indeed, it'll depend on the usecase that you have because for some cases you wouldn't sacrifice the quality of the responses for the speed.
@jayden_finaughty 5 месяцев назад
Surely this approach becomes more and more viable as the cost of newly released models keep on decreasing by 5x, 10x est as we are currently seeing?
So the cost of this multi-shot RAG approach with a new model 5x cheaper is still less expensive than a single-shot of its more expensive predecessor?
@eduardov01 5 месяцев назад ⁺¹
Exactly!
@marcoaerlic2576 5 месяцев назад ⁺¹
Awesome video. Thank you.
@eduardov01 5 месяцев назад
Glad you liked it!
@VLM234 4 месяца назад
Great video! But how to break a loop after a few trials if the model gets stuck into an infinite loop during Hallucinations grading or answer relevance?
@SonGoku-pc7jl 5 месяцев назад ⁺¹
thanks, good flow between rag and web search, thanks!!1 :)
@eduardov01 5 месяцев назад
Thank you. I'm glad you found it interesting!
@chikosan99 5 месяцев назад ⁺¹
Great video, very nice
@eduardov01 5 месяцев назад
Thank you very much!
@keilavasquez728 6 месяцев назад ⁺¹
I've been searching for a self-correcting system because sometimes the responses I receive from LLMs aren't precise. Thank you so much for your help.
@eduardov01 6 месяцев назад
I'm glad it was helpful!
@ramakanaveen 5 месяцев назад ⁺¹
Nice one. Question : what if all the docs are marked as irrelevant chunks by the model , do you need to query the vector db again ? I guess an improvement may be to include a Hyde model in between to improve the questions and keep trying to get a different chunks from DB ?
@eduardov01 5 месяцев назад ⁺¹
It'll perform a web search to find the relevant information (node that has the Agent). Yes, that could be an option too.
@pavanpraneeth4659 5 месяцев назад ⁺¹
Awesome
@eduardov01 5 месяцев назад
Thank you!!
@amacegamer 6 месяцев назад ⁺¹
Great video! But I have a question I hope you can answer and help me.
Why is so slowly answering? that's normal for the architecture or there is other reason, and can we do something to fix that?
@eduardov01 6 месяцев назад
The fact that we have 5 LLMs to generate answers + retriever + a websearch is performed when the question is not in the vector store database + we also store the web search results in the database and all these steps can take some time. To make it faster, you can use fewer LLMs and maybe skip the web search, depending on your usecase.
@eucharisticadoration 5 месяцев назад ⁺¹
Can you make an example using only Local LLMs and Local Agents, so no API Keys (and no costs) are created? That would be amazing!
@eduardov01 5 месяцев назад ⁺¹
Yes, I'll have it in mind for the next video!
@eucharisticadoration 5 месяцев назад
@@eduardov01 Amazing!!
@علاءسيد-ف6خ 5 месяцев назад
Please could you share the link here
@eyalfrish 6 месяцев назад ⁺¹
Nice video!
Any chance to get access to the excalidraw version of the diagram?
@eduardov01 6 месяцев назад
Thanks!
I have a free account in Excalidraw and just have 1 session with all my diagrams. But you can get access to the flowchart using this link: github.com/Eduardovasquezn/advanced-rag-app/blob/main/images/rag.png
@keila9874 6 месяцев назад ⁺²
Is the Tavily API for free? Can I use the Google Search Engine instead?
@eduardov01 6 месяцев назад ⁺¹
Yes, you can make 1,000 API calls for free every month.
It's also possible to use Google Search as an agent for this. I have a video explaining step-by-step how to use it: ruclips.net/video/ppGRPWpv9Wc/видео.html

Следующие

Автовоспроизведение

Fast-track RAG: Chat with SQL Databases using Few-Shot Learning and Gemini | Streamlit | LangChain

Fast-track RAG: Chat with SQL Databases using Few-Shot Learning and Gemini | Streamlit | LangChain

Why Agent Frameworks Will Fail (and what to use instead)

Why Agent Frameworks Will Fail (and what to use instead)

5 Steps to Start Making $10k/Month Writing Online

5 Steps to Start Making $10k/Month Writing Online

CHICAGO SLAMBALL TOURNAMENT

CHICAGO SLAMBALL TOURNAMENT

COLLABMAS DAY 2: HEADPHONE CHALLANGE 👀❤️ft. Ceeoyo, Lena Davis & BakJayc

COLLABMAS DAY 2: HEADPHONE CHALLANGE 👀❤️ft. Ceeoyo, Lena Davis & BakJayc

Siblings or Dating? Test Your Radar (ft. the Kalogeras Sisters)

Siblings or Dating? Test Your Radar (ft. the Kalogeras Sisters)

Self-Corrective RAG with LangGraph - Agentic RAG Tutorial

Self-Corrective RAG with LangGraph - Agentic RAG Tutorial

Self-reflective RAG with LangGraph: Self-RAG and CRAG

Self-reflective RAG with LangGraph: Self-RAG and CRAG

Building My Own JARVIS! AI Voice Assistant with Whisper from Open AI Groq, gTTS | LangChain Python

Building My Own JARVIS! AI Voice Assistant with Whisper from Open AI Groq, gTTS | LangChain Python

How to use Ollama in Python | No GPU required for any LLM | LangChain | LLaMa

How to use Ollama in Python | No GPU required for any LLM | LangChain | LLaMa

LangGraph vs. LangChain LCEL - Can we get rid of LCEL (LangChain Expression Language)

LangGraph vs. LangChain LCEL - Can we get rid of LCEL (LangChain Expression Language)

GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem

GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem

Anime Lovers: Building a Content-Based Recommendation System using Python | Embeddings Qdrant AI

Anime Lovers: Building a Content-Based Recommendation System using Python | Embeddings Qdrant AI

Is Tree-based RAG Struggling? Not with Knowledge Graphs!

Is Tree-based RAG Struggling? Not with Knowledge Graphs!

Adding Agentic Layers to RAG

Adding Agentic Layers to RAG

Странные Товары Которые Я Нашел В Интернете #shorts #aliexpress #ozon #wildberries #amazon

Странные Товары Которые Я Нашел В Интернете #shorts #aliexpress #ozon #wildberries #amazon

Мясо вегана? 🧐 @Whatthefshow

Мясо вегана? 🧐 @Whatthefshow

ДОМ ПРЕДАТЕЛЕЙ! **Мамикс, Кукояка, Генсуха, Амина, Даник, Екатзе**

ДОМ ПРЕДАТЕЛЕЙ! **Мамикс, Кукояка, Генсуха, Амина, Даник, Екатзе**

Разрушил Всю ЕДУ и ПОБЕДИЛ! Симулятор Уничтожения Еды в Майнкрафт!

Разрушил Всю ЕДУ и ПОБЕДИЛ! Симулятор Уничтожения Еды в Майнкрафт!

Я ВИЖУ ПРИЗРАКОВ В ДОМЕ ЗЛЫХ РОДИТЕЛЕЙ В SCHOOLBOY RUNAWAY В МАЙНКРАФТ!

Я ВИЖУ ПРИЗРАКОВ В ДОМЕ ЗЛЫХ РОДИТЕЛЕЙ В SCHOOLBOY RUNAWAY В МАЙНКРАФТ!

Где реальные 240 герц?

Где реальные 240 герц?

Шоу «ТОКСИКИ» с Анатолием Сульяновым уже на Ютубе #соболев #юмор #токсики #сульянов #складчикова

Шоу «ТОКСИКИ» с Анатолием Сульяновым уже на Ютубе #соболев #юмор #токсики #сульянов #складчикова

Открытая кондитерская в кофейне Калининграда

Открытая кондитерская в кофейне Калининграда