Mistral-7B with LocalGPT: Chat with YOUR Documents

Prompt Engineering

Просмотров 53 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 2 окт 2024
In this video, I will show you how to use the newly released Mistral-7B by Mistral AI as part of the LocalGPT. LocalGPT lets you chat with your own documents. We will also go over some of the new updates to the project.
If you like the repo, don't forget to give it a ⭐
💻 Pre-configured localGPT VM: bit.ly/localGPT (use Code: PromptEngineering for 50% off).
#localGPT #mistral #mistral-7B #langchain
CONNECT:
☕ Buy me a Coffee: ko-fi.com/prom...
|🔴 Support my work on Patreon: Patreon.com/PromptEngineering
🦾 Discord: / discord
📧 Business Contact: engineerprompt@gmail.com
💼Consulting: calendly.com/e...
💻 Pre-configured localGPT VM: bit.ly/localGPT (use Code: PromptEngineering for 50% off).
LINKS:
LocalGPT Github: github.com/Pro...
LocalGPT Playlist: tinyurl.com/37...
Embedding Models: • Understanding Embeddin...
Text Splitters: • LangChain: How to Prop...

Комментарии • 57

@anuvratshukla7061 Год назад ⁺²⁸
Can you make a video ho how to use open source LLMs as chatbot on tabular data
@ilyasssaadi7707 4 месяца назад
would you mind to recommend any videos you found related to your question?
10 месяцев назад ⁺³
thank you for this valuable train. I want to ask you about the languages rather than English. What do you advice about write a LocalGPT in a non-english language?
@maxamad13 Год назад ⁺³
Can you run this in langchane or flowise
@alx8439 Год назад ⁺²
Make a comparison of your project with "h2o gpt" project please
@derarfares Год назад ⁺¹
thanks it is a good video , is there a suggestion to make the response faster . i tested wit Nidia GeForce RTX 3050
@Techonsapevole Год назад ⁺¹
cool is possible to use it in oobabooga text generation ui ?
@engineerprompt Год назад
I believe so, yes.
@michaelkirk4173 8 месяцев назад
"I'm using this on mac"
Buddy just buy a computer, this is basically irrelevant to the world when you are using CPU.
@henkhbit5748 Год назад ⁺¹
Thanks for showing RAG with mistral. Why your advise to use gptq instead of gguf when u have a gpu?
@engineerprompt Год назад ⁺⁵
from my understanding, gptq are optimized specifically for NVidia gpus. GGUF supports both cpu and gpu but I have seen gptq performs better on gpus (speed wise)
@gaganpreetsingh-6453 8 месяцев назад
I tried GGUF format but it only utilizing my cpu not GPU.. why its happening? as I guess is it possible to add gpu layers on it . right?@@engineerprompt
@wilfredomartel7781 6 месяцев назад
😊
@syedluqman 11 месяцев назад ⁺²
Hi - Thanks for uploading. Why do I get this error while running your model?
super().__init__(**kwargs)
File "pydantic/main.py", line 341, in pydantic.main.BaseModel.__init__
pydantic.error_wrappers.ValidationError: 1 validation error for LLMChain
llm
none is not an allowed value (type=type_error.none.not_allowed)
@logicalm4th 10 месяцев назад
have you managed to fix this - am getting the same error when running with Mistral. Any help/advise would be appreciated.
@syedluqman 10 месяцев назад
Hi - Yes I have. On windows, it worked when I changed the 'mps' to 'cpu'. On mac book it needed installing the required library of llama-cpp-python. Hope it helps, if not let me know and I can look into the error.
@user-wy7lb8uo8i 10 месяцев назад
@@logicalm4th I'm struggling with the same issue as well. Did you find a solution?
@birb2726 7 месяцев назад
so you just implemented lama along with a RAG approach to the prompts right?
@satyajamalla1429 7 месяцев назад
nice video. how can we test the model with test data. how can we ensure that it is generating data correctly?
@livb4139 Год назад ⁺¹
is 3070 enough to run model?
@zhaojieyin9771 10 месяцев назад
When I tested the code, it always returned Split into 0 chunks of text. Does anyone know what causes this
@gjsxnobody7534 9 месяцев назад ⁺¹
I'm still unclear about what we do with these models once they are fine tuned on our data. Which or Where do we put this file, to be used by the public in a chat application say on wordpress? Customers don't want to log into terminal obvioiusly, they go to a site, and have a chatbot prompt them, and they want that chat bot to reply to them personally. Is there software already out there that can accept a fine-tuned-LLM? can you suggest one that doesn't have a subscription? preferreabley for WP.
@mikefreeman6399 7 месяцев назад
Oobanoogas text generation webui runs locally, just git clone, put models in the models folder, in parameters>characters tab, customize characters, etc etc.. I may have misspelled that, idk.
@gjsxnobody7534 7 месяцев назад
@@mikefreeman6399 but that is still a terminal on your PC, even if you add an API, it's still a Terminal looking thing. I'm specifically asking about an application like a Chat app, that sits on Wordpress site, that a customer who is looking to buy something, can ask a question on that site. Oobabbooga is just the terminal to the model. I hope I'm explaining myself. But to 2x clarify, if you go on any random site to buy something, say toothbrushes, and you need to ask a specific question about their toothbrush, on their site, you don't want to go to another page to chat with Oobaboga interface, you just want a small chatbox on the side with a "live person" or AI in this case right?
@kingfunny4821 10 месяцев назад
can make video how convert this modal to exe
@wilfredomartel7781 6 месяцев назад
😊😊 how much RAM is needes to run this model?
@engineerprompt 6 месяцев назад
The quantized one will need about 4-6GB (4-bit).
@capravasranjan2121 11 месяцев назад
Can you make how to train our personal documents. Pdf or text
@llamamaguluri Год назад
the program is running with internet. instead can we run the local gpt without internet. please tell how to do that
@umangternate 9 месяцев назад
I tried to run this on thinkpad x250 core i5, win11, 8gb ram... 😂 It ran like a dead turtle... So, please make a video about this but instead of running a model locally, let's use HF API, if possible. But keep the objective unchanged... Chat with multiple PDFs. It will be great for those who cannot afford a high spec system.
@engineerprompt 9 месяцев назад ⁺²
I agree. You might be able to run the highly quantized version, though. 2024 is going to be awesome for local models.
@echofloripa Год назад
Wow, 2 bit quantization isn't too few possible values for the weights?
@engineerprompt Год назад
Yes but if you have 7B parameters, the network might still be able to preserve some of the learnings
@shivamverma-wm3vv 11 месяцев назад
Index out of range error,why this?
@timtensor6994 10 месяцев назад
Have you thought of having a colab notebook ?
@WilsonCely Год назад
Thanks!! Awesome video. Is there a way to do it in google Colab?
@Nihilvs Год назад ⁺⁴
Thank you so much for providing us with the updated code for mistral ! I have tested mistral vs. llama-2 chat, on long texts about philosophy, it seems in my case that llama-2 is doing better with understanding it atm. Thank you for developing this project !
@philipcoffee Год назад ⁺²
still working on my project which is similar. my long text comprises four hundred thousand chinese characters.
@SMFahim-vo5zn 11 месяцев назад
Why is it called "GPT"? Does it use any API key to interact with GPT models? If yes, then why do you need other LLMs with it? If not, then what does it do that makes the other LLMs work like a charm? Like, just takes a document, and extract answers for unseen questions.
Sorry for my newbie question, exploring this topic for the first day.
@minhthang6789 11 месяцев назад ⁺¹
GPT stand for Generative Pre-trained Transformers, not own by OpenAI
@ShaileshSarda-m6z 8 месяцев назад
How to optimize the LLM model interaction timing?
@sourcejosh Год назад
Is a RTX 4070 good enough to use a gpu model?
@filemonek123 9 месяцев назад
If I ingest fileA and and then I want to create another gpt instance with different base knowledge, separate from the one earlier, should I just rerun the ingest with replaced files or I need to create separate conda environment?
@engineerprompt 9 месяцев назад ⁺¹
Currently, you will need to delete the "DB" folder and run ingest again. In the constants.py file, you can set the folder name of the DB you want to create/use.
@mohsenghafari7652 4 месяца назад
its work with Persian language? thanks
@engineerprompt 4 месяца назад
You can use something like Aya for persian
cohere.com/research/aya
@mohsenghafari7652 4 месяца назад
@@engineerprompt thanks
@llamamaguluri Год назад
Hi, Is internet is required to run the model?
@oskar4239 9 месяцев назад
Thanks so much!
@debatradas1597 Год назад
thanks
@KyrinWu 11 месяцев назад
how to get apikey on mistral
@quentin5035 8 месяцев назад
On their website
@hikaroto2791 11 месяцев назад
God Bless you

Следующие

Автовоспроизведение

LocalGPT: Convert Your Chats with Docs to Fine-tuning Datasets