“What's wrong with LLMs and what we should be building instead” - Tom Dietterich - #VSCF2023

Transformer Neural Networks Derived from Scratch

Let's Code an AI Search Engine with LLM Embeddings, Django, and pgvector

Cybertruck Frames are Snapping in Half

paris during the olympics

Love Island’s JaNa Craig, Leah Kateb & Serena Page Take a Friendship Quiz | GQ

How We're Building AI Search Engines using LLM Embeddings

ThinkNimble

Просмотров 18 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 22 авг 2024

Комментарии • 29

@engKanani 2 месяца назад
excellent video, much better than tons of other long "bla bla" videos out there, thanks!
@mrdatapsycho 11 месяцев назад
Short and compact. Excellent video to get an Overview of LLM-based search.
@bracodescammer 11 месяцев назад
Awesome. I learned how to build inverted indices and this here now seems so simple, yet versatile in comparison.
@shubhamroy7403 11 месяцев назад ⁺⁵
Great video man. Can you upload more videos like this explaining a bit more on the code side?
@thinknimble 11 месяцев назад ⁺¹
William here - thanks for watching! And sure thing. I recorded myself building out the backend of this. I'll get that edited and posted soon.
@J3R3MI6 11 месяцев назад ⁺³
@@thinknimblethis is worth a sub 🙏🏽💎
@thinknimble 10 месяцев назад ⁺¹
Finally got the more in-depth video posted!
ruclips.net/video/OPy4dLHdZng/видео.html
@WishyIwish 11 месяцев назад ⁺¹
Very nice video - cool to get another perspective on RAGs and how to implement them with a very different stack.
@jeromeeusebius 10 месяцев назад
Thanks for sharing. Great video. This is a useful self contained template for a search usecase. I plan to apply this to one of my use case. I watched the video twice now and the second time around I got a much better understanding. Another interesting part, like you mentioned, is using another LLM call to potentially get an explanation for the output. One question I had, which addressed it towards the end, is how do you logically split the document to ensure consistency, not splitting in the middle of toughts/ideas, etc. One could even try different schemes, e.g., using another higher-level ML model to evaluate different splitting schemes.
Thanks once again.
@RustemShaimagambetov 10 месяцев назад ⁺¹
To be honest, this is not a search through LLM, embeddings that generate large language models, in videos just used sentences transformers (all-MiniLM-L6-v2)
@thinknimble 10 месяцев назад ⁺¹
That's correct. The goal is not to generate an LLM, but to use an existing LLM to search natural language content.
@mvasa2582 10 месяцев назад
lot of potential - research papers / legal etc. Nice job is explaining . Now how would you productize this so that a customer would simply need to install it on a target system or drive where they have their files stored - you have the ability to automatically (efficiently) consume those ... and inference on any new adds?
@thinknimble 9 месяцев назад
Great questions. We'll likely explore these in future videos.
@jpops8767 10 месяцев назад
Thanks for the vid!! Have you guys heard of the Bittensor / Opentensor foundation?
@thinknimble 9 месяцев назад ⁺¹
We have not. We'll check it out!
@jpops8767 8 месяцев назад
@@thinknimble You won't be disappointed!
@devd4001 11 месяцев назад ⁺¹
Great video, but I am unable to find the code in the given github link, could you please add the python script!
@thinknimble 11 месяцев назад ⁺¹
Thank you for your question! The code on GitHub is an entire project with a frontend ('client' folder) and backend ('server' folder). The key Python code demonstrated in this video is a few folders deep in core models.py: github.com/thinknimble/embeddings-search-demo/blob/main/server/vector_demonstration/core/models.py
I hope this helps!
@ShikharDadhich 11 месяцев назад
Thanks a lot ☺@@thinknimble
@SiD-hq2fo 11 месяцев назад
this maybe weird question but, being beginner I'm thinking about getting into it, do you have any other platform like discord where you share and interact with user like me ? :) thank you
@thinknimble 11 месяцев назад
Not weird at all! I know many RUclips channels have Discords. We appreciate your interest, but we do not currently have a public Discord, but we will consider setting one up in the future.
@andylee8283 11 месяцев назад
FYI!!!!
@khatharrmalkavian3306 10 месяцев назад
This honestly seems like the dumbest application of this technology.
@wildfotoz 11 месяцев назад
wow - there are so many things wrong with this video it's not funny. First, Excel documents are structured data, not unstructured. If the data was a bunch of resumes in word or pdf format, then you'd have unstructured data. Second, your csv files are not csv. Csv stands for comma separated values. You showed html files. I feel sorry for your clients.
@thinknimble 11 месяцев назад ⁺⁴
Hello! We appreciate your comment. William didn't mention Excel documents, but you raise an interesting and profound question about what "structured" vs. "unstructured" data even is. LLMs are fascinating, because they seem to be revealing a hidden structure in even the most unstructured, natural language.
The files in the video and available in the open source codebase are definitely CSVs. So - sorry - you are wrong about that! But as you observed, the 'description' column of each CSV file is HTML. Since HTML is plaintext, it's can be included inside a CSV.
@agentDueDiligence 10 месяцев назад ⁺²
LOL - if you think excel files are structured data, then you have never really worked with excel files in the real world 😂
Nothing could be less structured than your average excel file 😂
@timonweb_com 11 месяцев назад
Hey, nice idea about chunks! Thanks a lot for the video!

Следующие

Автовоспроизведение

“What's wrong with LLMs and what we should be building instead” - Tom Dietterich - #VSCF2023

“What's wrong with LLMs and what we should be building instead” - Tom Dietterich - #VSCF2023

Transformer Neural Networks Derived from Scratch

Transformer Neural Networks Derived from Scratch

Let's Code an AI Search Engine with LLM Embeddings, Django, and pgvector

Let's Code an AI Search Engine with LLM Embeddings, Django, and pgvector

Cybertruck Frames are Snapping in Half

Cybertruck Frames are Snapping in Half

paris during the olympics

paris during the olympics

Love Island’s JaNa Craig, Leah Kateb & Serena Page Take a Friendship Quiz | GQ

Love Island’s JaNa Craig, Leah Kateb & Serena Page Take a Friendship Quiz | GQ

Full speech: Gov. Tim Walz speaks at 2024 DNC | USA TODAY

Full speech: Gov. Tim Walz speaks at 2024 DNC | USA TODAY

Emerging architectures for LLM applications

Emerging architectures for LLM applications

Internet is going wild over this problem

Internet is going wild over this problem

QLoRA is all you need (Fast and lightweight model fine-tuning)

QLoRA is all you need (Fast and lightweight model fine-tuning)

Musica Axiomatica - Dev Log 1 - Lexer/Parser/Interpreter/MALang - C++ Music Engine Solo Development

Musica Axiomatica - Dev Log 1 - Lexer/Parser/Interpreter/MALang - C++ Music Engine Solo Development

How to Brainstorm better than ChatGPT with Knowledge Graphs and GPT 4 | InfraNodus Tutorial

How to Brainstorm better than ChatGPT with Knowledge Graphs and GPT 4 | InfraNodus Tutorial

OpenAI Embeddings and Vector Databases Crash Course

OpenAI Embeddings and Vector Databases Crash Course

Vector Embeddings Tutorial - Code Your Own AI Assistant with GPT-4 API + LangChain + NLP

Vector Embeddings Tutorial – Code Your Own AI Assistant with GPT-4 API + LangChain + NLP

ChatGPT: 30 Year History | How AI Learned to Talk

ChatGPT: 30 Year History | How AI Learned to Talk

JWST has turned the Crisis in Cosmology into a bigger problem | Night Sky News August 2024

JWST has turned the Crisis in Cosmology into a bigger problem | Night Sky News August 2024

You can Run but you can't Hide 🤣😍 #funny #dubai #snatcher #viral #fyp #dubai #india #tajikistan

You can Run but you can't Hide 🤣😍 #funny #dubai #snatcher #viral #fyp #dubai #india #tajikistan

Более 122 тысяч человек эвакуировали из Курской области

Более 122 тысяч человек эвакуировали из Курской области

Золото в МОГИЛЕ дочери. Громкое дело советского мошенника Георгия Зуйкова

Золото в МОГИЛЕ дочери. Громкое дело советского мошенника Георгия Зуйкова

OBLADAET - BARMAN

OBLADAET — BARMAN

Свекровь не хочет видеть тебя на юбилее, ты там лишняя - муж с ухмылкой поехал на торжество.

Свекровь не хочет видеть тебя на юбилее, ты там лишняя - муж с ухмылкой поехал на торжество.

Арестович оценил ответ России на Курскую операцию ЗСУ: Кремль не остановит наступление на Покровск

Арестович оценил ответ России на Курскую операцию ЗСУ: Кремль не остановит наступление на Покровск

Новый фонарик в iPhone с iOS 18

Новый фонарик в iPhone с iOS 18

Российские поезда 👌 #тнт #shorts #юмор #шоу #однаждывроссии #дорохов #поезд #россия

Российские поезда 👌 #тнт #shorts #юмор #шоу #однаждывроссии #дорохов #поезд #россия