RAG from scratch: Part 11 (Query Structuring)

Reliable, fully local RAG agents with LLaMA3.2-3b

Building long context RAG with RAPTOR from scratch

BigXthaPlug - The Largest (Official Music Video)

Rod Wave - Fall Fast In Love (Official Video)

Dear Apple: i am sorry

RAG from scratch: Part 10 (Routing)

LangChain

Просмотров 25 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 26 сен 2024
This is the 10th video in our RAG From Scratch series, focused on different types of query routing (logical and semantic).
Notebook:
github.com/lan...
Slides:
docs.google.co...

Комментарии • 21

@b0otable 6 месяцев назад ⁺³
Hi Lance,
Thanks for sharing. One thing that would be helpful would be if you could discuss routing when the state needs to be remembered.
What I mean is that you start in a particular location, based on routing logic, you end up in a new state. The next time the user interacts with the system, you pick up where you left off and you would then have different routing logic. You essentially are building a state machine of logic where each state has separate routing logic.
@CharlieAng-hg1ps 5 месяцев назад ⁺¹
Hi Lance, I have to say that I am enjoying your series very much. Thank you for breaking these concepts down in such a way that makes it easy to digest.
@AnthonyMontandUyendeEdzang 6 месяцев назад ⁺⁶
Lance from Langchain 🙌🏾🙌🏾🙌🏾
@dchedufa 4 месяца назад
Really good series! Thanks Lance!
@OxidoPEZON 6 месяцев назад ⁺⁶
This is a bit unrelated, but I really like the flow diagrams you always have in your videos, what tool do you use for them?
@r.lancemartin7992 6 месяцев назад ⁺¹
Excalidraw
@OxidoPEZON 6 месяцев назад
thank you!!
@kirilchi 6 месяцев назад ⁺²
Both approaches have some problems.
- LLM-based is most accurate but adds latency, which may or may not be acceptable (depending on the complexity of existing chain).
- Semantic is very fast (still slower than say tfidf or other simpler NLP methods, but ~5-10 times faster than LLM), but this misclassifies the route much more often than LLM-based approach.
I ended up doing classification and resulting branches in parallel runnable, and then deciding which output to show in merge step. But this only works if cost / quota is not a problem and there are only few branches.
I have some ideas how to address more complex chains, but still experimenting :)
@08inside 3 месяца назад
Can you please share your experiments on git, i am intrested in this and can also contribute
@jzam5426 5 месяцев назад
Thank you for the videos Lance! I've always wondered how to manually enforce choice of chain use. Would you think of Routing as a manual more basic way of doing multi-agent (non-agenic) chains?
@travelcatchannel8657 4 месяца назад
How can we deal with the situation where a user throw an unrelated question to this router chain??
@jay-dj4ui 4 месяца назад
So this is the trick that LLM has some 'reasoning' ability that can 'think' and routing to the right direction.... right?
@JosemariaSaldias 3 месяца назад
Is there a way to include memory? I mean if we want to mantain a previous conversation or change between different topics, is there a way to include memory to our LLM routing/Semantic Routing?
@jayadrathisangry 6 месяцев назад
Is there a way to use with_structured_output with Gemini pro?
@giovanniguerreiro8640 5 месяцев назад
A non-related question to the video itself - but which software do you use to create the diagrams?
@SDAravind 4 месяца назад
Is it possible to use more than one databases simultaneously based on user query. Kind of multiple ifs statements in python checks if query requires vector db and analytics db, only one or use both to answer.
@laurenstaples7778 6 месяцев назад
Why do we need to route queries?
@prasenjitgiri919 5 месяцев назад
In langchain 0.1.16 , I see in the code the method with_structured_output as Not Implemented - so how are you making it work?
@MayankShukla_26 5 месяцев назад
Can we use AzureChatOpenAI instead of ChatOpenAI for function calling with LLM to create Structured_llm ??
@florinfilip6355 5 месяцев назад ⁺¹
yes, you can.
@MayankShukla_26 5 месяцев назад
@@florinfilip6355 it worked!

Следующие

Автовоспроизведение

RAG from scratch: Part 11 (Query Structuring)

RAG from scratch: Part 11 (Query Structuring)

Reliable, fully local RAG agents with LLaMA3.2-3b

Reliable, fully local RAG agents with LLaMA3.2-3b

Building long context RAG with RAPTOR from scratch

Building long context RAG with RAPTOR from scratch

BigXthaPlug - The Largest (Official Music Video)

BigXthaPlug - The Largest (Official Music Video)

Rod Wave - Fall Fast In Love (Official Video)

Rod Wave - Fall Fast In Love (Official Video)

Dear Apple: i am sorry

Dear Apple: i am sorry

I MADE IT OFFICIAL WITH POOKIE - Talk Tuah Ep. 3 with Kaitlyn Bristowe

I MADE IT OFFICIAL WITH POOKIE - Talk Tuah Ep. 3 with Kaitlyn Bristowe

LangChain Expression Language - The ONLY video you need to TRULY understand LCEL

LangChain Expression Language - The ONLY video you need to TRULY understand LCEL

The Best Programmer I Know • Daniel Terhorst-North • GOTO 2024

The Best Programmer I Know • Daniel Terhorst-North • GOTO 2024

Why Agent Frameworks Will Fail (and what to use instead)

Why Agent Frameworks Will Fail (and what to use instead)

RAG from scratch: Part 12 (Multi-Representation Indexing)

RAG from scratch: Part 12 (Multi-Representation Indexing)

Building STORM from scratch with LangGraph

Building STORM from scratch with LangGraph

LangChain Function Calling Agents vs. ReACt Agents - What's Right for You?

LangChain Function Calling Agents vs. ReACt Agents – What's Right for You?

The Home Server I've Been Wanting

The Home Server I've Been Wanting

Frontend and Backends Timeouts

Frontend and Backends Timeouts

Building Corrective RAG from scratch with open-source, local LLMs

Building Corrective RAG from scratch with open-source, local LLMs

+1000 Aura For This Save! 🥵

+1000 Aura For This Save! 🥵

Свиданку устроим? #пранки #юмор #пранк #прикол @stas.yornik.shorts

Свиданку устроим? #пранки #юмор #пранк #прикол @stas.yornik.shorts

От iPhone 16 такого не ожидал никто!

От iPhone 16 такого не ожидал никто!

БЕЛКА СЬЕЛА КОТЕНКА?#cat

БЕЛКА СЬЕЛА КОТЕНКА?#cat

Техас - новое место силы Америки / вДудь

Техас – новое место силы Америки / вДудь

Как теперь ИЗБАВИТЬСЯ от китайский авто LiXiang ?

Как теперь ИЗБАВИТЬСЯ от китайский авто LiXiang ?

Выпрыгивает ли аккумулятор в iPhone 16?

Выпрыгивает ли аккумулятор в iPhone 16?

爸爸太笨了！被女兒套頭拿走錢都不知道去哪裡找了！ #萌娃#funny#整蠱爸爸

爸爸太笨了！被女兒套頭拿走錢都不知道去哪裡找了！ #萌娃#funny#整蠱爸爸