Let's build a RAG system - The Ollama Course

Matt Williams

Просмотров 31 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 11 янв 2025

Комментарии • 68

@MirkriBiznesmenaIgracha-lr2ce 4 месяца назад ⁺³⁷
Just 3 days ago I had an idea to build an AI project to help me summarize 4 books a week before school starts. 2 days ago I started researching libraries, methods to get concise data without hallucinations. I was progressing but not a single video was up to date or taught what I needed. It started stressing me out. Then I found your channel. Just when I needed it you uploaded exactly what I searched for. I don't have the words to describe what a change you were to me and only because of this video I want to keep tinkering with LLM's. You are a legend who changed a 16 year old's mind about developing with AI and just gained a new patreon and I've never paid for Patreon. I look forward to watching your other videos.
@TalkingWithBots 3 месяца назад
Have a nice learning path
@DrCognitive Месяц назад ⁺³
You should still read the books. Otherwise, the degree you earn will be about as worthless as the paper it's printed on. You're also not learning how to do anything other than mimic that which others do. Don't settle to be a little man that stands on the shoulders of greater men who come before you.
@ZRowton 4 месяца назад ⁺¹⁰
You are a legend. Love your Ollama series. Keep up the great work!
@solyarisoftware 4 месяца назад ⁺³
Hi Matt. I always upvote your videos. It comes in mind two possible topics for future course deepenings: 1. comparison benchmark /even if qualitative) among different size of the same model, different quantization and different context window size expaining the HW respopurces trade-offs. 2: using various RAG techniques to memorize chat conversations for a sort of "long term memory" (that's indeed a very general usage). Just ideas. Thanks for all
@MKalavera111 Месяц назад ⁺¹
Spot on. Thanks. I like the speed and depth of the videos. This one in particular is very helpful, as it shows everything from embedding, db setup and preparation of the junks till the final execution on CLI. I would use it exactly like this. Still, making it available for colleagues as well, there is a need for a front and which is Open WebUI in my case.
Anyways, I learned so much and have tons of questions, that I would not dared raising without this input. Whether or not you use an frontend or CLI, it helps follow-up the technology side of it.
@JakeSmithdx6 4 месяца назад
Thanks for boiling this down to the main components and how they can come together to make a solution. Its a great foundation to start with, many of the videos I have seen have focused on all kinds of different tooling and its hard to know what aspects are essential.
@sacredgeometry4950 4 месяца назад
Thank you for providing playlist to learn this. Very helpful how you explain things. Always looking forward to your training series. Keep up the good work Matt!
@JoeBurnett 4 месяца назад
You are an amazing teacher. Thank you!
@nigeldogg 3 месяца назад
Would love to see further deep dive on this including hybrid keyword/semantic search and reranker for large datasets applied with an LLM via Ollama. Thanks for the great tutorial as always!
@SoloPlax 4 месяца назад
Brilliant. I just subscribed. Thank You for your video series.
@HassanAllaham 4 месяца назад
1- You forgot to drink... It is important to keep hydrated.🧐
2- I prefer ready solutions especially those that give complete choices and options (Hybrid RAG with graph knowledge)💥
Thanks for the good content 🌹
@MauricioDavid77 3 месяца назад ⁺²
Video Suggestion: Table-Augmented Generation (TAG). TAG is a unified and general-purpose paradigm for answering natural language questions over databases.
@user-bj7sy3qr3k Месяц назад ⁺¹
Great video. Any chance you can add RAG citation video to your collection? It is valuable to have the RAG output cite where in the document the content was obtained from that was used in the response.
@technovangelist Месяц назад ⁺¹
Hmm that’s pretty easy. Sure I can add it to a list
@technologist-lx9nq 4 месяца назад ⁺²
It's such a breeze to find your channel, a lot of influencer noise about such topics, make it challenging to find quality material.
I came in to try to answer a question, which you pointed at, at the end of the video. How do the pre-made RAG solutions compare to each other, and to the DIY one. Off course building your own, comes with the gift of knowledge of how things work, and eventually better understanding and implementing them, fixing problems when found ....etc.
My perspective if it helps: the topic has many choice points, and it can get easily overwhelming for someone of humble knowledge, and what really help is to know for someone such as yourself, why choose this over that, or what is the balance to look for? for example why choose this embedding model ?
If epub is better than PDF for the task, should we try to convert it first, to get better text content from PDFs ? Will it be helpful to rank the text extraction part before deciding to feed it to the model (what I mean is that PDFs have very much varying degrees of readability)? I'm just thinking out-loud at this point :D
Lastly, thank you and looking forward to the next video.
@MrNevado 4 месяца назад
Great video, as usual. Looking forward to the RAG tools video and maybe some integrations, please
@JohnSigvald 3 месяца назад
I absolutely love this!
@DLAW-d8u 4 месяца назад
simplicity is the elegance - a prime example :-)
@CuvelierPhilippe 4 месяца назад
I have to jump into coding before build a rag, so actually i'm using open webui and i'm very satisfied of it 🎉
@diegocassinera Месяц назад
Thanks really cool, however, not sure why you included magic when is really not needed. You are fetching most docs from the web, so you can get the mime type from the request.
@tecnopadre 3 месяца назад ⁺¹
After the launch of llama 3.2 1B & 3B, this video should Skyrocket so Ollama
@Chris-kq7ir 9 дней назад
What is the process to maintain dynamic data ex: customer list with active balances, balance changes everyday…. would I constantly have to delete and upload new data into the database or is there a simpler method?
@technovangelist 9 дней назад
that’s pretty simple. I can't think of a simpler way
@hugogreg-hf8zl 4 месяца назад
Thank you so much for this omg
@GrandpasPlace 2 месяца назад
Can you use a rag with the new llama3.2 in order to have it do facial or person recognition?
@themax2go 4 месяца назад ⁺¹
a continuation aimed at advanced users: local vs global vs native context comparison, using graphrag and triplex - when to use which one
@solyarisoftware 4 месяца назад
Hi, what do you men for "local vs global vs native context comparison" ?
@themax2go 4 месяца назад
great intro to rag dev vid
@aristotelesfernando 3 месяца назад ⁺¹
Did anyone have problems with the python scripts? I had to correct some and the requiriments.txt didn't have all the necessary packages
@fedayka 3 месяца назад
Great video Matt, thanks for the awesome content.
I am a novice with AI and I sometimes have issues where my RAG uses the LLM internal knowledge to answer a question, even though I provided the context and told it to use that to answer the question, just like your example.
Would you have any suggestions on how to avoid that? Maybe it's something really easy I am missing 😅😅 Thanks
@bobdowling6932 4 месяца назад ⁺³
This was great, but... (There’s always a “but,” isn’t there?)
I’m building a RAG system (using ollama for embedding and querying) at the moment and the hard part isn’t the RAG. It’s getting the text in the first place from PDF, MSG (including direct attachments and nested email chains), DOC[X], HTML etc. Do you have any recommendations for tooling in this arena?
@MichaelChanslor 4 месяца назад
Thank you!
@utvikler-no 4 месяца назад
Thank you so much❤
@technovangelist 3 месяца назад
You're welcome 😊
@karlutxo 3 месяца назад
Nice videos, clearly explained in plain English. A couple of questions: why are you using different models to get the rag and non-rag respose?. And why using ollama to get the embeddings instead of leveraging the chroma embedding feature?. Thanks for sharing your expertise
@technovangelist 3 месяца назад
I must have changed one and forgot the other. no reason
@travelrealindia1 2 месяца назад
Please suggest best open-source model for local embeddings
@jafarekrami 3 месяца назад ⁺³
there is an error in python code ,"collectionname" in line 12
@gianlucapegoraro4350 2 месяца назад
any(collection.name == collectionname for collectionname in chromaclient.list_collections()) correction
@rocknfroll 5 дней назад
What should it be? I hit this and just commented it out to get around it.
@startingoverpodcast 3 месяца назад
I am still using msty and having mixed results. The formating issue for data is my biggest problem right now.
@devinsmith4713 2 месяца назад
So I'm testing this out with the older video from april, i'm running into an issue where the embedded documents are matching the context, but the answer generated by gemma:2b says that "The text provided does not contain any information.... ...so I cannot answer this question with the provided context" anyone know why this is happening?
@DrexxLaggui 3 месяца назад
Most excellent vid sir! Can you expamd on this by showing how to make RAG perform faster at 25 tokens per second at least, with several GB or 1000's of md files uplloaded to it please?
@technovangelist 3 месяца назад
25 tokens per second seems pretty slow. I get double that on my 3 year old Mac.
@abhinavkant 3 месяца назад
next video of the series please
@marvinacklin792 16 дней назад
This is excellent. I need to hire a python coder, can you refer me?
@fastmamajama 29 дней назад
is there a way for ollama to do yes or no answers?
@technovangelist 29 дней назад ⁺¹
Yes. Ask it a yes no question and tell it to answer that way. Or use structured outputs as shown in a recent video.
@fastmamajama 29 дней назад
@@technovangelist ill give it a try. i use llava and i ask how many flying saucers do you see in the pictures and i always get random answers.
@fastmamajama 29 дней назад
@@technovangelist it worked. I am getting Yes or No answers.
@JeffMcJunkin 4 месяца назад ⁺¹
Timestamp format is off in the description, friend ❤
@GhassanYousif 4 месяца назад
dear Matt, could you please make practical videos, i kept watching several videos from you but never got to the point where i get things working. please mix you videos between lecturing and practical guides step by step so we all can benifit
@technovangelist 4 месяца назад ⁺¹
There is a lot of that on this channel. And will be more too
@urgyengurung5453 3 месяца назад ⁺¹
I agree with @GhassanYousif
Can you please make your videos easier to follow.
What kind of audience are you targeting?
We are new to programming and AI, so a lot of the technical speak doesn't really help us.
If you could show steps by steps slower and clearer, that would really help.
By the way, I read that there's a Llama Stack, would you be doing a simple video on how we can install and use Llama Stack?
@Igbon5 2 месяца назад
The steps you describe are far too complex for me and my experience, 100% of the time is that no matter how exactly I try and follow along with these sorts of things, it won't work. It is either slightly out of date or one step is slightly misunderstood or whatever, and then comes the inevitable screen loads of cryptic gibberish.
So, I was wondering if the easy way why via open-Webui which you mention at the end, is as good. Just add documents, create a custom model and away we go?
Or is it too easy to be as good.
@technovangelist 2 месяца назад
If you feel better with using open webui, great. Ollama is a tool for software developers first and so understanding how to build a rag system is one of those core projects everyone should learn.
@Igbon5 2 месяца назад
@@technovangelist I'll give it a try.
4 месяца назад
When having bad results, I can"t decide if it's because of my chunking or because the data is in French.
But I can't easily find models for embedding French texts.
@DaleIsWigging 4 месяца назад
There are A few French Natural Language Processing (NLP) models on HuggingFace that might work for your needs.
@LawrenceOrsini 4 месяца назад
Open WebUi first pls!
@hamburger--fries 4 месяца назад ⁺²
I was waiting for the "other shoe to drop" -- "This is a FREE course" and then the scam grift begins. Its always a course with people these days. "I have to make money" he says....... You could make a Social Network and charge a $10 monthly fee to be part of a community or something with more value. Don't go into this as a scammer. Im older than you and spent many years doing Black Hat so I know exactly what Im talking about........
@technovangelist 4 месяца назад ⁺⁴
I don’t want your money for this course. I plan to keep going with the for a while. I may take some sponsorships but this will always be free. And I’m only mid 50s so not old at all.
@tecnopadre 4 месяца назад ⁺⁶
A "thank you" would be a better comment

Следующие

Автовоспроизведение

What are the different types of models - The Ollama Course