5 tiers of long-term memory and personalization for LLM applications (in-person workshop)

How to Implement Hybrid Search with PostgreSQL (Full Tutorial)

Practical RAG - Choosing the Right Embedding Model, Chunking Strategy, and More

Demetrious Johnson Trains w/ KHABIB & ISLAM MAKHACHEV! | EXCLUSIVE FOOTAGE!

This Month Was Tough on Us..

I Ruined an Entire City With Unrelenting 100% Insanity - Highway Police Simulator

Don't naive RAG do hybrid search instead (Pinecone Weaviate or pgvector + full text search & rerank)

LLMs for Devs

Просмотров 10 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 25 дек 2024

Комментарии •

@devlearnllm 5 месяцев назад ⁺⁴
Hey yall, in case you didn't get good full text search results like me, the CEO of Supabase (Paul Copplestone) sent me this to use instead: supabase.com/docs/guides/database/extensions/pgroonga
@pabloarroyo7952 3 месяца назад ⁺²
Watching this 2 months later. Great video, thanks for sharing
@devlearnllm 2 месяца назад
Glad you enjoyed it!
@blackswann9555 Месяц назад ⁺¹
3 Months im here and enjoying
@UnemployMan396-xd7ov 26 дней назад
4 months here love it gonna put this in my graduate thesis
@alienPear 3 месяца назад ⁺¹
Thanks for sharing, bro! Greetings from Colombia
@devlearnllm 3 месяца назад
My pleasure!
@JamesRBentley 5 месяцев назад ⁺¹
Nice video sir. I have already been experimenting with the colab - sincerest thanks
@devlearnllm 5 месяцев назад
Great to hear!
@ironbondar 5 месяцев назад ⁺¹
very good workshop. straight to the point
@gregmeldrum 5 месяцев назад ⁺¹
Very informative! A great resource. Thanks for sharing your wealth of knowledge!!
@magnusjensen5867 2 месяца назад ⁺¹
Nice workshop, thank you for sharing! You mentioned early on that you tried decomposing your queries if they were multi-hop queries / abstract queries. Would you still suggest that approach or is there any new research specifically on this matter? Imagine a query in which a user want to retrieve information from multiple documents at get a comparison or summarization.
@devlearnllm 2 месяца назад ⁺²
I'm still doing the same for my app, and what I'm hoping to do eventually is to prompt the query expansion step so it's expanding in a coherent way. E.g question is about how X affects Y -> find X, find Y
@magnusjensen5867 2 месяца назад
@@devlearnllm Thank you for your response. How exactly would you go about this? Have you played with knowledge graph (GraphRAG) like Neo4j etc?
@Phoenix-gi3gu 4 месяца назад ⁺¹
For experimenting I would recommend using no database at all. You can simply use the cosine similarity (i.e. from torch functional) or quickly implement it and you are nearly done. Just use some argsort to get the best matches. It's like five lines of code or so. For easy store/load you can use pickle to serialize/unserialize the object that holds the embeddings. It is fast on CPU too, but of course you can run it on GPU without any bigger changes.
No services required.
@devlearnllm 4 месяца назад
good point
@ofrylivney367 5 месяцев назад ⁺²
Nice workshop! I'll definitely try out the hybrid search. Do you recon it'll work with nomic text embeddings and ollama?
@devlearnllm 5 месяцев назад
Most likely!
@oamarkanji3153 3 месяца назад
Incredible content. Thank you.
@devlearnllm 2 месяца назад
Much appreciated!
@SandeeshCroos Месяц назад
Hey, great content! Thanks for sharing your knowledge. However, instead of just using tsvector in PostgreSQL, you can leverage sparse vector search by utilizing the pg_search extension, right?
@devlearnllm Месяц назад ⁺¹
yup, they're both full text search. Or use pgroonga
@ThoughtfullySo 5 месяцев назад ⁺¹
You should've tried Qdrant.
@ajkdrag 4 месяца назад ⁺¹
Hi. I have a video request. Is there a way to contact you?
@devlearnllm 3 месяца назад
tally.so/r/n9djRQ
@ajkdrag 3 месяца назад
@@devlearnllm done. Thanks
@vijishmadhavan6093 4 месяца назад
what happens if we use all the 25000 cases, will it work?
@devlearnllm 4 месяца назад
Most likely. Pinecone, Weaviate and pgvector are very performant.
@ArunKumar-bp5lo 5 месяцев назад ⁺¹
great
@artur50 5 месяцев назад
is it possible to run it with Ollama?
@devlearnllm 5 месяцев назад
Most likely
@zuowang5185 5 месяцев назад
Is Openai embedding v3 model better than Bert?
@devlearnllm 5 месяцев назад
Hard to tell unless experiments are run.
huggingface.co/spaces/mteb/leaderboard
@flor.7797 5 месяцев назад
I just use Google 🙃

Следующие

Автовоспроизведение

5 tiers of long-term memory and personalization for LLM applications (in-person workshop)

5 tiers of long-term memory and personalization for LLM applications (in-person workshop)

How to Implement Hybrid Search with PostgreSQL (Full Tutorial)

How to Implement Hybrid Search with PostgreSQL (Full Tutorial)

Practical RAG - Choosing the Right Embedding Model, Chunking Strategy, and More

Practical RAG - Choosing the Right Embedding Model, Chunking Strategy, and More

Demetrious Johnson Trains w/ KHABIB & ISLAM MAKHACHEV! | EXCLUSIVE FOOTAGE!

Demetrious Johnson Trains w/ KHABIB & ISLAM MAKHACHEV! | EXCLUSIVE FOOTAGE!

This Month Was Tough on Us..

This Month Was Tough on Us..

I Ruined an Entire City With Unrelenting 100% Insanity - Highway Police Simulator

I Ruined an Entire City With Unrelenting 100% Insanity - Highway Police Simulator

Barstool Pizza Review - Del Rossi's (Philadelphia, PA) Bonus Cheesesteak Presented by Tommy John

Barstool Pizza Review - Del Rossi's (Philadelphia, PA) Bonus Cheesesteak Presented by Tommy John

Agentically scrape the web with Firecrawl & LangGraph (LangChain)

Agentically scrape the web with Firecrawl & LangGraph (LangChain)

Why Build Enterprise RAG with Postgres?

Why Build Enterprise RAG with Postgres?

I Ditched Traditional RAG for Agentic RAG and Got SHOCKING Results!

I Ditched Traditional RAG for Agentic RAG and Got SHOCKING Results!

One second to compute the largest Fibonacci number I can

One second to compute the largest Fibonacci number I can

AI Deception: How Tech Companies Are Fooling Us

AI Deception: How Tech Companies Are Fooling Us

"The State of (Full) Text Search in PostgreSQL 12" by Jimmy Angelakos

"The State of (Full) Text Search in PostgreSQL 12" by Jimmy Angelakos

Using v0 and Supabase to build a CRM app with AI

Using v0 and Supabase to build a CRM app with AI

Express your ideas by writing your own gems - Ruby Banitsa conf 2024

Express your ideas by writing your own gems — Ruby Banitsa conf 2024

Code/Astro 2024: Day 1

Code/Astro 2024: Day 1

Monsters decorate the Christmas tree! Part 1

Monsters decorate the Christmas tree! Part 1

Təcili. Azal təyyarəsi qəza etdi, videosu. Ölənlər. Səbəblər, suallar

Təcili. Azal təyyarəsi qəza etdi, videosu. Ölənlər. Səbəblər, suallar

Тестирую Вездеход

Тестирую Вездеход

Когда впервые попал на МКАД в Москве

Когда впервые попал на МКАД в Москве

Момент падения самолета вблизи Актау

Момент падения самолета вблизи Актау

到底哪个才是真雏田？#sasuke #naruto #shorts #cosplay

到底哪个才是真雏田？#sasuke #naruto #shorts #cosplay

ТВОЙ СОСЕД ЗАНИМАЕТ ДЕНЬГИ😂#shorts

ТВОЙ СОСЕД ЗАНИМАЕТ ДЕНЬГИ😂#shorts

АВИАТРАГЕДИЯ близ Актау: Почему разбился рейс Баку - Грозный? - ГИПЕРБОРЕЙ. Спецвыпуск

АВИАТРАГЕДИЯ близ Актау: Почему разбился рейс Баку – Грозный? - ГИПЕРБОРЕЙ. Спецвыпуск