Open Source RAG running LLMs locally with Ollama

Поделиться
HTML-код
  • Опубликовано: 20 окт 2024

Комментарии • 55

  • @JohnPamplin
    @JohnPamplin 4 месяца назад +3

    Your "All Your Base Are Belong To Us" ending just earned a subscription from me. WHAT YOU SAY!

  • @kapkanfps3694
    @kapkanfps3694 5 месяцев назад +7

    Might have to give a try, the ending is hilarious nostalgic 😂

  • @divyaraj-rana
    @divyaraj-rana 2 месяца назад

    Really amazing innovation by Weaviate team! Their workshops speaks about their groundbreaking applications and making it open-source.

  • @mohz832
    @mohz832 4 месяца назад +3

    Why the layout of the installed version via pip is not the same as your demo? Also, how can we use PDF files without an API key from Unstructured? I believe this is still a showstopper for most of us.

  • @wojciechperchuc2734
    @wojciechperchuc2734 4 месяца назад +1

    Love the background music ❤ You can feel the Berlin vibe ;)

  • @AnugrahPrahasta
    @AnugrahPrahasta 5 месяцев назад +1

    One of the best RAG opensource I installed.

    • @Weaviate
      @Weaviate  5 месяцев назад +1

      facts 💚

  • @iandanforth
    @iandanforth 5 месяцев назад +1

    Very exciting! Thanks for all the hard work. (and fun Easter eggs)

  • @AlangHsu
    @AlangHsu 5 месяцев назад +2

    Thank you for the open-source project. It's great.

  • @saulyarhi675
    @saulyarhi675 4 месяца назад

    This is beautiful. I'm working with 5 classmates (electromechanical and software engineering college) on a proyect, we developed a tiny robot able to chat with patients as a co therapist, using a raspberry pi and a LLM. But the hallucinations are way too dangerous here, so i suggested to my team we start implementing RAG. Generating and "validating" the psychology database is really, really, really time consuming, it's hard, tricky and it takes a long time to have good quality examples, but we are pretty sure it's gonna be 100% worth it.
    I just had knowledge of Ollama and i would love to try out Verba in our prototype, so people in need can start getting attention and we can right away start recollecting data from the final model already deployed.
    I would love to collaborate with you guys, I'm such an enthusiast of opensource communities and corporations, and I loved the concept you evoke so much.

  • @JenuelDevTutors
    @JenuelDevTutors 4 месяца назад +3

    is their an API where I can use to upload data? rather than uploading it in the admin ui. and also is their a way to access chat through api as well so that I can use the chat inside any website or apps?

    • @kyudechama
      @kyudechama 4 месяца назад +1

      I would like to know as well!

    • @jordan-kz3rx
      @jordan-kz3rx Месяц назад +1

      It seems that the chat is calling @app.websocket("/ws/generate_stream") which is ran on the server at localhost:8000

  • @christenjacquottet9799
    @christenjacquottet9799 5 месяцев назад +1

    Is it more recommended to break down your markdown blogs into separate files rather than one big file to ingest? I tried with one big file and didn’t get accurate results

  • @jagrat12354
    @jagrat12354 2 месяца назад +1

    I wonder if it can read data from a sql database directly ?

  • @blueedu4958
    @blueedu4958 5 месяцев назад +1

    all your base are belong to us! Brilliant 🙂😍😎😎😎

    • @Weaviate
      @Weaviate  5 месяцев назад

      Yes they are!

    • @philipvollet
      @philipvollet 5 месяцев назад

      ruclips.net/video/Qra1oWdJQPs/видео.html

  • @Pregidth
    @Pregidth 4 месяца назад

    This is very cool! Thank you! Can I distribute just the chat interface without the configuration behind?

  • @MrAnkitnakra
    @MrAnkitnakra 3 месяца назад

    Very helpful , I am able to set up on local host , however not able to ingest data, trying to upload a PDF

  • @tusharbhatnagar8143
    @tusharbhatnagar8143 5 месяцев назад +2

    Quick question. Does setting up and using Verba support Windows or WSL? Also, what exactly is the process. Does it simply work like a RAG app off the shelf after setup or we need to have weviate DB running on the side as well.

    • @Weaviate
      @Weaviate  5 месяцев назад +3

      Weaviate Embedded isn't currently supported on Windows but we're working on it! On other devices, Weaviate Embedded is setup automatically and locally in the background when installing Verba, but you also got other deployment options such as Docker or using a Free Sandbox Cluster Hosted on our Cloud Platform (console.weaviate.cloud/)

    • @tusharbhatnagar8143
      @tusharbhatnagar8143 5 месяцев назад +1

      @@Weaviate Got it. Will have to wait it out then to try it on Windows or WSL as those are the primary devices at my org.

    • @benwatson5211
      @benwatson5211 5 месяцев назад

      I saw that people were requesting a windows deployment almost 12 months ago. Are you actively working on this or not? @weaviate

    • @tusharbhatnagar8143
      @tusharbhatnagar8143 5 месяцев назад +1

      @@benwatson5211 I don't think they are. They just replied to me yesterday about the status and incompatibility.

  • @stebansb
    @stebansb 5 месяцев назад +3

    hey, this is awesome. Also love the end of the video, brings back memories!

    • @philipvollet
      @philipvollet 5 месяцев назад

      ruclips.net/video/Qra1oWdJQPs/видео.htmlsi=outDexl5AGXlNTOW

  • @TechTrek-su7hl
    @TechTrek-su7hl 5 месяцев назад

    Could you let us know how did you create this animation/gif please?

  • @rodericksweet6546
    @rodericksweet6546 5 месяцев назад

    This is exactly what I have been looking for. However, I install it and none of the variables seem to populate the application. At lease none are showing.

  • @kanunssol1246
    @kanunssol1246 4 месяца назад

    Does this have user limitation (user accounts) like openwebui and Danswer? Please reply. Thanks

  • @arpitaingermany
    @arpitaingermany Месяц назад

    I am not able to view the Overview

  • @freddiechipres
    @freddiechipres 4 месяца назад

    Awesome app you guys. Is it possible to add OCR capability?

  • @MrRaja
    @MrRaja 4 месяца назад

    How do I speed up the Vectorizing of Documents?

  • @tlfmcooper
    @tlfmcooper 5 месяцев назад

    This looks great. Can I deploy verba to the cloud? Please provide a link to the resource if available

  • @Hits-Sandbox
    @Hits-Sandbox 4 месяца назад

    Wow, Victoria, is a natural in front of the Camera, perfect presentation and body language movement just right for the level of presentation required. It almost looks like she went to school for acting, NLP and Micro Expression presentation for marketing.

  • @m.c.4458
    @m.c.4458 Месяц назад

    I have been making my own local rag. for my professon. Just using prompt engineering :P I know how hard this is to achieve.

  • @SamiBenSalah
    @SamiBenSalah 4 месяца назад

    the use of GPU is highly recommended but not so clear. I am using Verba on my WSL on Windows but as it is using only CPU, it is kind of slow. How can I plug my GPU to help?

    • @m.c.4458
      @m.c.4458 Месяц назад

      kuda/ nvidia - check what you have, mske sure of compatibikity with your python version and package - it is a big job for windows users it took me ages to activate kuda. but not with this program.

  • @KOTAGIRISIVAKUMAR
    @KOTAGIRISIVAKUMAR 4 месяца назад

    can anyone help me with the alternatives to the verba?

  • @JimMendenhall
    @JimMendenhall 5 месяцев назад +1

    Very nice work!

  • @marilynlucas5128
    @marilynlucas5128 5 месяцев назад +1

    Good job guys!

  • @trvsgrant
    @trvsgrant 4 месяца назад

    How is this different than chatrtx?

  • @DerekDickerson
    @DerekDickerson 5 месяцев назад +3

    the installer and the env information needs allot of work

  • @mikestaub
    @mikestaub 4 месяца назад

    Great job!

  • @Mr6499
    @Mr6499 4 месяца назад +1

    Not free ! you're giving your Base to Weaviate!

  • @igorshingelevich7627
    @igorshingelevich7627 4 месяца назад

    Sounds interesting.

  • @botondvasvari5758
    @botondvasvari5758 5 месяцев назад +1

    demo is empty

  • @martin22336
    @martin22336 5 месяцев назад +2

    Useful with small models like phi3.

  • @Adante.
    @Adante. 5 месяцев назад

    Can this be connected to via an api for external apps? eg: Automation of emailing facts about ingested data to someone interested/able to receive email responses

  • @googleyoutubechannel8554
    @googleyoutubechannel8554 5 месяцев назад

    Nice system, great that it works with ollama. I think like everyone who isn't openai, we want rag to work... but it just doesn't. I've come to the conclusion that it basically just 'can't' work, embedding dbs just don't represent information in 'connected enough' way to make an nl 'query' successful in 99% of use cases. And for the 1% where rag does work... keywords also seem to work just as well....
    The sooner funded companies like weaviate accept that current rag just doesn't work, the better chance we have of the hard work of creating a system that can work... and basically you're probably going to have to 'train' self contained embeddings against a more general model in lora-like fashion, to have any hope of teasing out the actual relationships 'activations', that will give a natural language query against unstructured data a chance.

  • @ApeOfGod1
    @ApeOfGod1 5 месяцев назад +1

    3.1.0 != 3.10.0.

  • @arpitaingermany
    @arpitaingermany Месяц назад

    But again sharing private info to this company can be dangerous, so uploading documents is a doubt

  • @restrollar8548
    @restrollar8548 3 месяца назад

    Like the tech, but the health use case is really clunky and very simple. Medical data is messy and you would obviously be asking patients most of these questions, not an LLM!

  • @handler007
    @handler007 4 месяца назад

    ohhh... buttons. NAH

  • @AtomicPixels
    @AtomicPixels 4 месяца назад

    Don’t use rag. Use efficient decision graph networks that actually work like soul reasoning