Livestream: Sentiment Analysis with Vector Databases

Chunking Best Practices for RAG Applications

OpenAI Embeddings and Vector Databases Crash Course

Hawkeye: The Unmatched Sharpshooter | Character Reveal | Marvel Rivals

Katt Williams Replies To Fans Online | Actually Me

Severe weather hits ahead of Thanksgiving Day travel

Overview of RAG Approaches with Vector Databases

KX

Просмотров 3 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 23 ноя 2024

Комментарии • 5

@TusharRathod-li7ql 11 месяцев назад ⁺¹
Thanks Guys For the session. It was really helpful.
@maryamashraf6370 9 месяцев назад ⁺¹
Hey, great video! Just a clarification question because I'm not sure if I heard right - do we usually only take the single top context for RAG? I thought we usually take top-k, with k at 5-8? If we're taking small chunks e.g. a couple of sentences, couldn't multiple chunks be useful for additional context, in case the very top one doesn't exactly capture the answer?
@RyanSieglerAI 7 месяцев назад
Thanks for tuning in! Yes you are correct, typically you will take the top-k retrieved results not just a single chunk. This will provide more context to the LLM.
@photonicdev377 Год назад ⁺¹
Hey guys, I liked your intro to the RAG, I also heard you have a subreddit. You should put a link to it on a video description or somewhere, I couldn't find it directly.
Anyways I have a question:
How would you optimize the retrieval and chunking for working with something like dialogs, to extract meaning and with the embeddings, what could be possible direction or advice would make sense? What kind of embedding model would you suggest using? And what should I look into when retrieving it? It sounds quite easy on a surface but I've been quite struggling to optimize it for it to retrieve meaningful context, if I go for smaller chunks at a sentence length or the change of speaker, it usually does not retrieve meaningful parts of the conversation. Any advice or reading material would be greatly appreciated. And I'm working with LangChain right now and self hosted LLM.
@KxSystems Год назад
Thank you for joining the presentation! The subreddit is: www.reddit.com/r/kdbai/... but it's brand new so not much activity yet!
As to your question - recent embedding models are capable of creating meaningful vectors even from larger pieces of text, so you could try embedding entire conversations as one chunk. You could also try a method like parent document retriever, or sentence windows (both a method of chunk decoupling), where you do retrieval on smaller chunks like sentences and then provide larger texts (parent docs or windows around retrieved sentences) to the LLM for generation.
If you are not getting good retrieval with smaller chunks, try some different embedding models - sentence transformers (huggingface.co/sentence-transformers) could be a good option.

Следующие

Автовоспроизведение

Livestream: Sentiment Analysis with Vector Databases

Livestream: Sentiment Analysis with Vector Databases

Chunking Best Practices for RAG Applications

Chunking Best Practices for RAG Applications

OpenAI Embeddings and Vector Databases Crash Course

OpenAI Embeddings and Vector Databases Crash Course

Hawkeye: The Unmatched Sharpshooter | Character Reveal | Marvel Rivals

Hawkeye: The Unmatched Sharpshooter | Character Reveal | Marvel Rivals

Katt Williams Replies To Fans Online | Actually Me

Katt Williams Replies To Fans Online | Actually Me

Severe weather hits ahead of Thanksgiving Day travel

Severe weather hits ahead of Thanksgiving Day travel

ARCANE Season 2 Episodes 7, 8 & 9 Ending Explained

ARCANE Season 2 Episodes 7, 8 & 9 Ending Explained

🔥 Vector Databases in AI, ML, LLMs: The Dynamic duo of Graphs and Vectors

🔥 Vector Databases in AI, ML, LLMs: The Dynamic duo of Graphs and Vectors

RAG But Better: Rerankers with Cohere AI

RAG But Better: Rerankers with Cohere AI

Making a Verbal AI Assistant With OpenAI, LangChain, and GroundX | RAG Masters ep15

Making a Verbal AI Assistant With OpenAI, LangChain, and GroundX | RAG Masters ep15

Jerry Liu-LlamaIndex - Practical Data Considerations for building Production-Ready LLM Applications

Jerry Liu–LlamaIndex – Practical Data Considerations for building Production-Ready LLM Applications

Postgres pgvector Extension - Vector Database with PostgreSQL / Langchain Integration

Postgres pgvector Extension - Vector Database with PostgreSQL / Langchain Integration

From RAG to Knowledge Assistants

From RAG to Knowledge Assistants

GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem

GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem

Contextual and Semantic Information Retrieval using LLMs and Knowledge Graphs

Contextual and Semantic Information Retrieval using LLMs and Knowledge Graphs

Lessons from Deploying LLMs with LangSmith

Lessons from Deploying LLMs with LangSmith

Вышла проверить район, в итоге сняла тренд 🤣🤣 #машмилаш

Вышла проверить район, в итоге сняла тренд 🤣🤣 #машмилаш

Вот СКОЛЬКО Людей может ВЫДЕРЖАТЬ Экосистема ЗЕМЛИ #shorts

Вот СКОЛЬКО Людей может ВЫДЕРЖАТЬ Экосистема ЗЕМЛИ #shorts

Путин и создатели «Орешника»

Путин и создатели «Орешника»

🤖 “That’s not cool, robot!” Puff faces off with tech gone wild! #PuffVsRobot #TechTroubles

🤖 “That’s not cool, robot!” Puff faces off with tech gone wild! #PuffVsRobot #TechTroubles

Farmer narrowly escapes tiger attack

Farmer narrowly escapes tiger attack

Проверяю Краболовку после Шторма! #shorts

Проверяю Краболовку после Шторма! #shorts

ВСРФ Штурмуют Великую Новоселку И Курахово⚔️Дарино И Николаево-Дарино Пали🎖Военные Сводки 23.11.2024

ВСРФ Штурмуют Великую Новоселку И Курахово⚔️Дарино И Николаево-Дарино Пали🎖Военные Сводки 23.11.2024