Open Source RAG running LLMs locally with Ollama

Weaviate • Vector Database

Просмотров 30 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 7 янв 2025

Комментарии •

@JohnPamplin 7 месяцев назад ⁺⁵
Your "All Your Base Are Belong To Us" ending just earned a subscription from me. WHAT YOU SAY!
@divyaraj-rana 5 месяцев назад
Really amazing innovation by Weaviate team! Their workshops speaks about their groundbreaking applications and making it open-source.
@kapkanfps3694 7 месяцев назад ⁺⁶
Might have to give a try, the ending is hilarious nostalgic 😂
@mohz832 7 месяцев назад ⁺⁵
Why the layout of the installed version via pip is not the same as your demo? Also, how can we use PDF files without an API key from Unstructured? I believe this is still a showstopper for most of us.
@iandanforth 7 месяцев назад ⁺¹
Very exciting! Thanks for all the hard work. (and fun Easter eggs)
@AnugrahPrahasta 7 месяцев назад ⁺¹
One of the best RAG opensource I installed.
@Weaviate 7 месяцев назад ⁺¹
facts 💚
@AlangHsu 7 месяцев назад ⁺²
Thank you for the open-source project. It's great.
@wojciechperchuc2734 7 месяцев назад ⁺¹
Love the background music ❤ You can feel the Berlin vibe ;)
@MrAnkitnakra 6 месяцев назад
Very helpful , I am able to set up on local host , however not able to ingest data, trying to upload a PDF
@arpitaingermany 3 месяца назад ⁺¹
I am not able to view the Overview
@jagrat12354 5 месяцев назад ⁺¹
I wonder if it can read data from a sql database directly ?
@rodericksweet6546 7 месяцев назад ⁺¹
This is exactly what I have been looking for. However, I install it and none of the variables seem to populate the application. At lease none are showing.
@JenuelDevTutors 7 месяцев назад ⁺³
is their an API where I can use to upload data? rather than uploading it in the admin ui. and also is their a way to access chat through api as well so that I can use the chat inside any website or apps?
@kyudechama 7 месяцев назад ⁺¹
I would like to know as well!
@jordan-kz3rx 4 месяца назад ⁺¹
It seems that the chat is calling @app.websocket("/ws/generate_stream") which is ran on the server at localhost:8000
@saulyarhi675 6 месяцев назад
This is beautiful. I'm working with 5 classmates (electromechanical and software engineering college) on a proyect, we developed a tiny robot able to chat with patients as a co therapist, using a raspberry pi and a LLM. But the hallucinations are way too dangerous here, so i suggested to my team we start implementing RAG. Generating and "validating" the psychology database is really, really, really time consuming, it's hard, tricky and it takes a long time to have good quality examples, but we are pretty sure it's gonna be 100% worth it.
I just had knowledge of Ollama and i would love to try out Verba in our prototype, so people in need can start getting attention and we can right away start recollecting data from the final model already deployed.
I would love to collaborate with you guys, I'm such an enthusiast of opensource communities and corporations, and I loved the concept you evoke so much.
@Pregidth 6 месяцев назад
This is very cool! Thank you! Can I distribute just the chat interface without the configuration behind?
@stebansb 7 месяцев назад ⁺³
hey, this is awesome. Also love the end of the video, brings back memories!
@philipvollet 7 месяцев назад
ruclips.net/video/Qra1oWdJQPs/видео.htmlsi=outDexl5AGXlNTOW
@christenjacquottet9799 7 месяцев назад ⁺¹
Is it more recommended to break down your markdown blogs into separate files rather than one big file to ingest? I tried with one big file and didn’t get accurate results
@m.c.4458 4 месяца назад
I have been making my own local rag. for my professon. Just using prompt engineering :P I know how hard this is to achieve.
@freddiechipres 7 месяцев назад
Awesome app you guys. Is it possible to add OCR capability?
@kanunssol1246 7 месяцев назад
Does this have user limitation (user accounts) like openwebui and Danswer? Please reply. Thanks
@MrRaja 7 месяцев назад
How do I speed up the Vectorizing of Documents?
@blueedu4958 7 месяцев назад ⁺¹
all your base are belong to us! Brilliant 🙂😍😎😎😎
@Weaviate 7 месяцев назад
Yes they are!
@philipvollet 7 месяцев назад
ruclips.net/video/Qra1oWdJQPs/видео.html
@KOTAGIRISIVAKUMAR 6 месяцев назад
can anyone help me with the alternatives to the verba?
@TechTrek-su7hl 7 месяцев назад
Could you let us know how did you create this animation/gif please?
@tlfmcooper 7 месяцев назад
This looks great. Can I deploy verba to the cloud? Please provide a link to the resource if available
@tusharbhatnagar8143 7 месяцев назад ⁺²
Quick question. Does setting up and using Verba support Windows or WSL? Also, what exactly is the process. Does it simply work like a RAG app off the shelf after setup or we need to have weviate DB running on the side as well.
@Weaviate 7 месяцев назад ⁺³
Weaviate Embedded isn't currently supported on Windows but we're working on it! On other devices, Weaviate Embedded is setup automatically and locally in the background when installing Verba, but you also got other deployment options such as Docker or using a Free Sandbox Cluster Hosted on our Cloud Platform (console.weaviate.cloud/)
@tusharbhatnagar8143 7 месяцев назад ⁺¹
@@Weaviate Got it. Will have to wait it out then to try it on Windows or WSL as those are the primary devices at my org.
@benwatson5211 7 месяцев назад
I saw that people were requesting a windows deployment almost 12 months ago. Are you actively working on this or not? @weaviate
@tusharbhatnagar8143 7 месяцев назад ⁺¹
@@benwatson5211 I don't think they are. They just replied to me yesterday about the status and incompatibility.
@trvsgrant 7 месяцев назад
How is this different than chatrtx?
@DerekDickerson 7 месяцев назад ⁺³
the installer and the env information needs allot of work
@SamiBenSalah 7 месяцев назад
the use of GPU is highly recommended but not so clear. I am using Verba on my WSL on Windows but as it is using only CPU, it is kind of slow. How can I plug my GPU to help?
@m.c.4458 4 месяца назад
kuda/ nvidia - check what you have, mske sure of compatibikity with your python version and package - it is a big job for windows users it took me ages to activate kuda. but not with this program.
@Mr6499 7 месяцев назад ⁺¹
Not free ! you're giving your Base to Weaviate!
@JimMendenhall 7 месяцев назад ⁺¹
Very nice work!
@marilynlucas5128 7 месяцев назад ⁺¹
Good job guys!
@mikestaub 7 месяцев назад
Great job!
@botondvasvari5758 7 месяцев назад ⁺¹
demo is empty
@martin22336 7 месяцев назад ⁺²
Useful with small models like phi3.
@Adante. 7 месяцев назад
Can this be connected to via an api for external apps? eg: Automation of emailing facts about ingested data to someone interested/able to receive email responses
@igorshingelevich7627 7 месяцев назад
Sounds interesting.
@googleyoutubechannel8554 7 месяцев назад
Nice system, great that it works with ollama. I think like everyone who isn't openai, we want rag to work... but it just doesn't. I've come to the conclusion that it basically just 'can't' work, embedding dbs just don't represent information in 'connected enough' way to make an nl 'query' successful in 99% of use cases. And for the 1% where rag does work... keywords also seem to work just as well....
The sooner funded companies like weaviate accept that current rag just doesn't work, the better chance we have of the hard work of creating a system that can work... and basically you're probably going to have to 'train' self contained embeddings against a more general model in lora-like fashion, to have any hope of teasing out the actual relationships 'activations', that will give a natural language query against unstructured data a chance.
@restrollar8548 6 месяцев назад ⁺¹
Like the tech, but the health use case is really clunky and very simple. Medical data is messy and you would obviously be asking patients most of these questions, not an LLM!
@ApeOfGod1 7 месяцев назад ⁺¹
3.1.0 != 3.10.0.
@arpitaingermany 3 месяца назад
But again sharing private info to this company can be dangerous, so uploading documents is a doubt
@handler007 7 месяцев назад
ohhh... buttons. NAH
@AtomicPixels 7 месяцев назад
Don’t use rag. Use efficient decision graph networks that actually work like soul reasoning

Следующие

Автовоспроизведение