Massive Update to Local GPT-Now with Vision Models!

Поделиться
HTML-код
  • Опубликовано: 21 сен 2024

Комментарии • 24

  • @kai_s1985
    @kai_s1985 День назад +2

    Thanks again for the great work! I have tested similar approach with the vision model. It is especially good for pdf's with lots of unstructured data like graphs, plots, pictures, text, etc... One limitation for this approach was when I created a chatbot and wanted to get the hyperlink within the documents I couldn't because the url of the hyperlink is not visible in the image, but it was not a problem when I used markdown with the standard text based RAG system.
    Questions:
    - how many pdf's can I upload? Is there any size limit?
    - Does the chatbot has a memory of the current conversation? If so, how are you handling it?

  • @akashnagarkar7560
    @akashnagarkar7560 20 часов назад

    Would love a video about the detailed architecture and code explanation. Thanks.

  • @kenchang3456
    @kenchang3456 2 дня назад

    Indeed, this is an amazing project. I'll check out the code and give try. Thank you very much for sharing, there's a lot to learn from this one.

  • @MeinDeutschkurs
    @MeinDeutschkurs 2 дня назад

    Wooohoooo!!! This is so cool! I need more time, I definitely have to test it!!!!

  • @Elingsanto
    @Elingsanto День назад

    Cool! Is there a context window or any strict limit on the quantity of pages or images that can be uploaded?
    WIll try it out

  • @magmikefpv
    @magmikefpv 2 дня назад +1

    This is amazing ! Thanks will try it out

  • @bwljustus8077
    @bwljustus8077 День назад +1

    ERROR - models.indexer - Error during indexing: Unable to get page count. Any ideas?

  • @ysy69
    @ysy69 День назад

    This is awesome. Very grateful. What is your local setup, GPU?

  • @trevorbaylis7423
    @trevorbaylis7423 16 часов назад

    What would be the complexity level to combining Verbi and Local GPT --Vision? Is this a realistic possibility?

  • @thenextension9160
    @thenextension9160 День назад

    Very nice work

  • @nyliveechay-so3ps
    @nyliveechay-so3ps 2 дня назад

    Pdf document format is specific right, so maybe posssible to compare results just using that formatted content data?
    It's closed, owned, controlled by Adobe correct?
    So why do this?

  • @awesomedata8973
    @awesomedata8973 День назад +1

    Any chance you can input the new Mistral Pixtral model in your software? -- It seems to be the best version of a local model for vision, and it's based on Nemo.

    • @engineerprompt
      @engineerprompt  День назад +1

      Yes, I think it can be added. Will have a look into it.

  • @bwljustus8077
    @bwljustus8077 День назад

    If poppler is missing under Windows, use: choco install poppler

  • @nyliveechay-so3ps
    @nyliveechay-so3ps 2 дня назад

    Great stuff though!! Nice work!

  • @TeamDman
    @TeamDman День назад

    VERY cool!

  • @brianhopson2072
    @brianhopson2072 День назад

    I like the concept of this, but I don't like the original model selection. Can you add other open ai api's like 4o?

    • @engineerprompt
      @engineerprompt  День назад

      Yes, will update the list with more models

  • @RyanSmith-rb1ch
    @RyanSmith-rb1ch День назад

    I think google-generativeai is misspelled as google-generative-ai in the requirements.txt

  • @MagagnaJayzxui
    @MagagnaJayzxui 12 часов назад

    Qwen2.5 VL 72b support?