Create ChatBot Based On The Data Feed By You - GPT-Index | OpenAI | Python

Shweta Lodha

Просмотров 124 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 23 ноя 2024

Комментарии • 311

@njorogekamau3820 Год назад ⁺³⁸
This tutorial truly stands out from the rest! After struggling with coding along other RUclips tutorials on GPT-index and langchain, I finally stumbled upon this gem, and behold, my code worked like a charm. Thanks for sharing your expertise and making it easy to follow. You're a lifesaver!
@glebp7020 Год назад ⁺¹³
Thank you, Shweta. I was able to get something working based on your code. Please note that lots of libraries are now outdated (gpt_index is now llama_index). My complete code (works 100%!):
from langchain import OpenAI
import sys
import os
from llama_index import SimpleDirectoryReader,GPTListIndex,GPTVectorStoreIndex,LLMPredictor,PromptHelper,ServiceContext
from llama_index import StorageContext, load_index_from_storage
def create_index(path):
max_input = 4096
tokens = 200
chunk_size = 600 #for LLM, we need to define chunk size
max_chunk_overlap = 20
prompt_helper = PromptHelper(max_input,tokens,max_chunk_overlap,chunk_size_limit=chunk_size) #define prompt
llm_predictor = LLMPredictor(llm=OpenAI(temperature=0, model_name="text-ada-001",max_tokens=tokens)) #define LLM
docs = SimpleDirectoryReader(path).load_data() #load data
service_context = ServiceContext.from_defaults(llm_predictor=llm_predictor, prompt_helper=prompt_helper)
vectorIndex = GPTVectorStoreIndex.from_documents(
docs, service_context=service_context
)
vectorIndex.storage_context.persist(persist_dir="storage")
return vectorIndex
def answerMe():
storage_context = StorageContext.from_defaults(persist_dir="storage")
index = load_index_from_storage(storage_context)
query_engine = index.as_query_engine()
q = "What is the best plan?"
print(q)
print('------------')
response = query_engine.query(q)
print(response)
#create_index('data')
answerMe()
@shweta-lodha Год назад
Thanks Gleb. I have created few videos on how to fix these broken ones and few more are on the way. Please feel free to let me know, if anything else is not working. I will try to provide solutions at my earliest convenience. Thanks once again.
@stalinamirtharaj1353 Год назад ⁺²
Great bro... It was helpful to test it out.
@sumankalyanghosh645 Год назад ⁺¹
@@shweta-lodha Currently I am getting RateLimitError with whatever account I try. Is there a way I can bypass this and test out
@sumankalyanghosh645 Год назад
With the above code I have tried as well mentioned by Gleb.
@rayfellers 3 месяца назад
As someone just learned how to use GPT-4o Mini to write python code, your explanations are well done.
@deepaksartape9502 Год назад ⁺⁴
Please cover UI part as well in one of the coming video. Thank you so much in advance.
@ddsharma2327 Год назад ⁺³
Question about execution flow: Hi Sweta - Thanks for a very helpful tutorial. Trying to wrap my head around the execution flow. For example, createvectorIndex probably sends the document to openAI servers, the servers return embeddings that are then stored locally. During answerMe - is the prompt sent back to openAI for embeddings? Does answerMe again send document embeddings back to openAI servers? How much is done in the local process (on PC) vs how much is done by openAI servers? Thanks.
@shweta-lodha Год назад
One way to understand this is by disabling the internet connection once you have embedding stored on your local device and then try to call answerMe. Doing this will clear all your doubts 😊
@SmritiChaudhary-k5b Год назад ⁺³
Hii,
I am facing this error
RetryError: RetryError[] while calling answerMe function, can you please help me out .
@mangesh29506 Год назад
Your instructions are really good, thank you so much Shweta Ji
@LEARNINGS-Amit Год назад ⁺³
Hi Shweta, Awesome video, great learning. Thanks. Just one concern - when we grant access to ChatGpt to our custom data stored in our machine, then is there a risk that the data can be copied/ used by users of ChatGPT or Open AI or otherwise?
@DandaFundas Год назад ⁺²
Thank you Swetha. It can’t get any better than your explanation with show and tell 👍
@congtycophanxaydungo.m.d1735 Год назад ⁺¹
thanks for your contribution, it's easy to understand for a beginner like me. You can continue to make the next video to guide the question and answer continuously like chatgpt.
@Spaz42 Год назад ⁺⁵
Thank you so much for this! You made something very intimidating to me super easy to understand. I am very grateful for the time and effort your put into this video. Thanks a lot!
@LearnWithSibin Год назад ⁺⁵
I noticed that in the implementation shown in the video, the GPT model is being called every time a user prompt is received, and this can be expensive in a real-world scenario where the application is serving multiple users concurrently. Each request to the GPT model requires a certain number of OpenAI tokens, and this can quickly add up and become expensive.
@shweta-lodha Год назад ⁺⁴
Indeed! For production scenarios, you have to extensively used vector database
@mikhailtrufanov9356 Год назад ⁺⁴
Hi, thank you for the video! I have tried this code to answer questions based on data about a very specific product. It answers well, however, it answers general questions as well (such as "What is Earth?") having no information at all about this in the files provided. How can I make this code answer based on the information that I provided only?
@shweta-lodha Год назад ⁺²
Thanks for watching Mikhail. Give a try to my this video, it gives better results - Use Your Locally Stored Files To Get Response From GPT like ChatGPT | Python
@usmanahmed1073 Год назад
Very good explanation. You speak very calm and make audience to follow all step in detail. Keep good work
@fascinatingstories101 5 месяцев назад ⁺¹
Thank you. Is there a way to capture user inputs like contact details (email, name etc)? That would be really useful.
@MoturuPraveenBhargav Год назад
Ms. Shweta, your tutorials are superb!! Stands out first from the rest.
@shweta-lodha Год назад
Thank you! Cheers!
@maggynewnham Год назад ⁺⁴
Hi Shweta - I am planning to follow this tutorial, it looks amazing, I was wondering if you know if the local data we used is kept and is local only or if any of the data gets make public or goes back to OpenAI?
@shweta-lodha Год назад ⁺¹
Thanks Maggy, glad you find it useful. Data will go to OpenAI servers
@M-ABDULLAH-AZIZ Год назад ⁺¹
having data in a file and real time embeddings vs embeddings in a db for chatbot for an application (provides information about an application)?
@jeffersonhope2668 Год назад ⁺²
Outstanding preparation and presentation. Thanks so much!
@shweta-lodha Год назад
Glad you enjoyed it!
@harshilpatel323 Год назад ⁺¹
Hi Shweta, that was a great tutorial! However, I have a question. Just like you used the custom data from the local disk here, How can we use the data from aws/elasticsearch?. I have a huge database (About 20 million records) which our employees access via querying on elasticsearch, and If I wish to create a custom chatbot trained on that data, how would one achieve that?
@giadavolpin3564 Год назад ⁺¹
Thank you so much Shweta! You got me so passionate on the topic. After completing your tutorial how do I move this into a chatbot that I can bring to my app? I’m stuck
@shweta-lodha Год назад ⁺¹
You need web UI
@fakeshirin4642 Год назад ⁺¹
@@shweta-lodha can you drop a tutorial on that because that would be very helpful
@atony229 Год назад
Great Video, you are inspiring me to start learning how to code. I am doing this as a project to see if I can follow along withyou. Thank you.
@shweta-lodha Год назад
Best of luck!
@rajparmar6420 8 месяцев назад
hi shweta, does this take structured data like csv or excel for manipulate any data?
@SanjanaMagar-d5q Год назад
hi Shweta, i tried this code on my end but in { vectorIndex.save_to_disk('vectorIndex.json') } is giving me an error so i tried { index.storage_context.persist('vectorIndex') } this is not creating a vectorIndex.json file on my system its creating a folder name vectorIndex.json inside which i has 4 json file docsstore, graph_store, index_store and vector_store.. can you pls tell me where i am going wrong
@naveennoelj Год назад ⁺²
@shweta-lodha, wonderful video and article. Thank You. Please keep up the good work. One qs: is there a way to implement the same solution using a web based solution - angular, react etc.
@shweta-lodha Год назад ⁺¹
Thank you so much Naveen, glad you liked it. I was thinking to try this out but couldn't find API endpoints. Please let me know, if you come across any such documentation.
@mehdirizvi3305 Год назад ⁺¹
For me, this code was throwing an error. If you update the code by first importing ServiceContext from gpt_index (along with the other stuff you were already importing) and then update the code by adding the following lines,
'-----#load data
docs = SimpleDirectoryReader(path).load_data()
service_context=ServiceContext.from_defaults(llm_predictor=llmPredictor,prompt_helper=PromptHelper)

#create vector index
vectorIndex=GPTSimpleVectorIndex.from_documents(documents=docs,service_context=service_context) ---
it should solve your issue.
P.S: I used this code from another one of your videos :))
@shweta-lodha Год назад ⁺¹
Please refer my How-To-Fix video. All this happened because of breaking changes in API 😊
@rezahamzeh3736 Год назад
A tutor that is looking for the result of a func without calling the func :))))))!!!!!
@shweta-lodha Год назад
Haha 😃
@shivampatel4179 Год назад ⁺¹
ImportError Traceback (most recent call last)
Cell In[29], line 1
----> 1 from gpt_index import SimpleDirectoryReader, GPTListIndex, GPTSimpleVectorIndex, LLMPredictor, PromptHelper
2 from langchain import OpenAI
3 import sys
What should I do for this error???
@shweta-lodha Год назад
Please refer my how to fix video entitled as breaking changes. Things are broken because of API enhancements
@satishkumar-ir9wy Год назад
I have not found such a detailed explanation of Open AI based Chat bot, Thankyou for Sharing.
Can you please make a video about how i can feed my NLP based ML model to create a Chat bot based on ChatGPT.
@shweta-lodha Год назад
Yes, soon
@pauldad1 Год назад ⁺²
This looks like a brilliant tutorial thank you. Please excuse the silly question, but where are you editing your code, I opened IDLE but it must be some other editor/console? Thank you again.
@shweta-lodha Год назад ⁺¹
I am using Visual Studio Code (VS Code). You can install it as it very easy to use. Please make sure to install Python and Jupyter extension from Extensions panel.
@pauldad1 Год назад
Amazingly helpful, thank you!
@sqlsql4304 Год назад ⁺¹
Very good tutorial, one question if have some logs which contains sensitive data like ip etc, if I feed the logs to the llm will my data move out of my system to the llm model provider's server. If so is there a security threat. I have a presentation to make on using llm to my company and for sure this question will rise, any feedback will be appreciated
@shweta-lodha Год назад
Thanks. Yes, it will go to model provider’s server and it could be a security threat. If privacy is your concern, then my suggestion would be to use Azure OpenAI
@danish1794 Год назад ⁺¹
Hello
Shweta Lodha, I have a couple of questions. Firstly, do you have any ideas on how we can reduce the cost of tokens? It would be really helpful if you could make a video explaining any potential solutions. Additionally, as a learner, I'm facing issues with expired APIs. Do you have any suggestions on how I can resolve this problem? Thank you in advance for your help.
@shweta-lodha Год назад ⁺³
I have already created a video on cost factor, please check my playlist. I didn’t understand- what do you mean by expired API
@SGSaisreenath Год назад ⁺²
@Shweta this is brilliant! as usual top notch like any of your other videos, thank you so much for taking time doing this. One question here, can you please provide the differences in the parameter if the same had to be achieved through Azure Open AI with openai.api_base, openai.api_type, openai.api_version, deployment_name - I have been trying to meddle with this but no luck so far..
@shweta-lodha Год назад
Sure, I'll cover this in my Azure OpenAI series, which I started recently :)
@AliRaza-u3u1m Год назад ⁺¹
Hi Shweta, You did a great job. Can you please help me how to set that environment on which your are doing code.
@shweta-lodha Год назад
Sure. Drop me an email and we will connect
@darkmode4467 2 месяца назад
What is the token for? Is it limited to ask the bot? Is there a way it can be unlimited?
@faisalsaddique3323 Год назад ⁺¹
This is really useful, Shweta! Thank you so much for making such an awesome content.
@shweta-lodha Год назад
Glad you think so!
@spicytuna08 Год назад
there is no longer GPTKeywordVectorIndex. There is GPTSImpleKeywordTableIndex()
are they the same?
@mneama4383 Год назад
Thanks a lot great video , just want to know if all my data is stored in sharepoint how can i take them all into the same mechanizm u did thanks again for this great tutorial
@Priyanka-js8zl Год назад
Thankyou for this detailed tutorial, so you asked 2 question from it, can you please share how much openai credits does it used in this complete operation?
that would be really helpful
@shweta-lodha Год назад
It should not be that much. I'm uncertain about the exact number as I sent multiple requests around same time frame :(
@shahbazzaman4143 Год назад ⁺¹
I have implemented code, works perfectly, Thanks for that, I have question, it is just providing answer from context, Is gpt_index is all about context ? can not get general information like, who is trump or any other information? if there is way please help me with in this code
@shweta-lodha Год назад
This video is about how to get answers based on context. If you want general info, then you can simply achieve that using OpenAI
@sriramyapriya4110 Год назад ⁺¹
Hi, thanks for the great video. I'm a beginner. How do I proceed now that gpt_index has been replaced by llama_index? I'm getting the error "No module named 'gpt_index'
@shweta-lodha Год назад
Yes, there are changes. Please refer my How-To-Fix video, it has the fix.
@narjessabbassi124 Год назад ⁺²
Hello, this tutorial is really amazing but i was trying to replicate the same work but got an error when executing the function createVectorIndex saying that the function __init__() in GPTSimpleVectorIndex got an unexpected keyword argument 'documents' did anyone get the same issue ?
@shweta-lodha Год назад
Please check my How-To-Fix video
@gauravrathi5028 Год назад ⁺¹
Hi Shweta, Very useful tutorial and I followed the same but m always getting the response None for any question, However I kept the document at same place where you have mentioned. Could you please help me out, what could be the reason for the same.
@shweta-lodha Год назад
Please give complete/absolute path and try it. If still it doesn’t work, then issue is not with the input file
@oreos_overdose Год назад
Hello and thanks for the video. Very useful. Is it possible to use a sphinx generated website (essentially a handful of html pages) as the resource documents instead of a plan txt file?
@syedrizwanali5212 2 месяца назад ⁺¹
Hi Mam,
Does this work today also?
Gpt_index does not have any function?
@shweta-lodha 2 месяца назад
Perhaps not as lot many things have changed recently, in terms of SDK and API. Lot many functions are renamed and moved here and there.
@syedrizwanali5212 2 месяца назад
@@shweta-lodha Thank you for the reply. Would you have any latest video on same topic? Or would you know if someone have created on this topic in simpler way like you had done here.
@prudhvi70 Год назад ⁺³
Any plans for coming up with part 2 for this with custom data that needs to be continuously indexed in the background?
@shweta-lodha Год назад ⁺²
Will plan soon. Thanks Prudhvi for the pointer :)
@smudgepost Год назад
Very good video thank you. I must have blinked and it was over! Which specific function uses langchain?
@shweta-lodha Год назад ⁺¹
LLMPredictor
@liyuanzhu6111 Год назад ⁺²
Thank you for this wonderful video🎉. I have a question when I was trying it. I run the codes to create vector index,but I cannot find it in the OSDisk. There is no error😂
@shweta-lodha Год назад
If you didn't provide complete path, then it must be in your current directory. Current directory is the one, from where you are running your script.
@smart-sg5cs Год назад ⁺¹
hi shweta
your tutorial is really helpful
and your knowledge on open Ai. i have also emailed you please reply
@shweta-lodha Год назад
I'll try to respond at the earliest.
@bhagatsingh.. Год назад
Very nicely presented its a marvel
@shweta-lodha Год назад
Thanks!
@EllenSimpson-w9n Год назад ⁺¹
Hi Shweta, this tutorial is amazing! I have one question after running the bot, on my OpenAI usage I am getting text-davinci requests as well as text-embedding-ada-002-v2. Any thoughts on why I am getting the davinci requests?
@shweta-lodha Год назад ⁺¹
I didn’t understand your question completely. Your embeddings would be using text-embedding-Ada-002-v2 model. Don’t you want to use this?
@EllenSimpson-w9n Год назад
@@shweta-lodha Thank you for your reply. My usage looks like this does this make more sense?
10:55 PM Local time: Apr 26, 2023, 8:55 AM
text-davinci, 2 requests
3,805 prompt + 75 completion = 3,880 tokens
10:55 PM Local time: Apr 26, 2023, 8:55 AM
text-embedding-ada-002-v2, 1 request
8 prompt + 0 completion = 8 tokens
@EllenSimpson-w9n Год назад
The bot is also able to answer questions about topics that I have not fed it. I think that it is accessing other sources than what I have fed it.
@Gapxpag Год назад ⁺¹
Hi Shweta, amazing job, hope you can help me: when running vectorIndex = create_index('Knowledge'), I get an error message: Output exceeds the size limit. Open the full output data in a text editor. maybe you know why?
@shweta-lodha Год назад
Hope 'Knowledge' directory exists in the same path, from where you are running your script.
@IgorBožić-o8n Год назад
Thank you Swetha for this easy and well-formed tutorial! Is there a way to connect this bot to a WordPress site? I have created a custom bot explained in this tutorial which uses a dataset from a knowledgebase site. Now I'd like to connect that chatbot to that site so that users can directly communicate there, in addition, to seeing articles in a standard way.
@shweta-lodha Год назад
Thanks and glad you liked it. Did you try plug-in?
@ralphanthony5198 Год назад ⁺¹
Hello, Shweta Lodha! Great job. I really love your videos. And I have a quick question: Do I have to have a paid GPT plan in order for the code to work?
@shweta-lodha Год назад ⁺¹
No necessary. It would work with your free account too, assuming you have required credits.
@ralphanthony5198 Год назад ⁺¹
@@shweta-lodha Thank you so much., Shweta. But I will get me a subscription today anyhow. I don't want to ran out of credits. With Love from Tijuana. You are great!
@joserodolfobeluzo3100 Год назад ⁺¹
HI Shweta Lodha! First... Thanks a lot for the tutorial! :)
I have a problem with an error. Can You help me?
I wrote the same code and use the same data. But when I try to run the code, this error appear:
TypeError: __init__() got an unexpected keyword argument 'documents'.
I Printed the variable and it contains the book text... The error is on the line:
vectorIndex = GPTSimpleVectorIndex(documents=docs,llm_predictor=llmPredictor,prompt_helper=prompt_helper)
I looked for it on internet but I didn't understand why there's a problem with the arg "documents"...
:/
@joserodolfobeluzo3100 Год назад
I solved! :D
For anyone that had the some problem I had, here is my function:
def createVectorIndex(path):
max_input = 1024
tokens = 256
chunk_size = 600
max_chunk_overlap = 20

#define LLM
llm_predictor = LLMPredictor(llm=OpenAI(temperature=0, model_name="text-ada-001"))
#load data
docs = SimpleDirectoryReader(path).load_data()
service_context = ServiceContext.from_defaults(llm_predictor=llm_predictor, chunk_size_limit=512)
vectorIndex = GPTSimpleVectorIndex.from_documents(docs, service_context=service_context)
vectorIndex.save_to_disk('vectorIndex.json')
return vectorIndex
The problem is: I removed this line:
prompt_helper = PromptHelper(max_input,tokens,max_chunk_overlap,chunk_size_limit=chunk_size)
Should it be a problem?
It worked here without it and with other texts...
If it's a problem, could anyone explain the part of the code?
@shweta-lodha Год назад
Please have a look at documentation, in case something has changed. At the time of this video, it was all good.
@shweta-lodha Год назад ⁺¹
You rock! I just created a video on how to fix this :)
@youwang9156 Год назад ⁺¹
appreciate for your work, just wonder if this method works for extracting specific number ? like i want to extract some price for mutilple products
@shweta-lodha Год назад
Yes, I tried this and in most of the cases, I got accurate output.
@youwang9156 Год назад
@@shweta-lodha thank u so much for reply
@nikunjpatel6522 Год назад
also getting this as well
Output exceeds the size limit. Open the full output data in a text editor
@shweta-lodha Год назад
This is because you have restricted the output length for Jupyter. You can open it in notepad to see full error. Not a big deal 👍🏻
@alexdomla Год назад ⁺¹
HI! This is a very helpful tutorial. I have a question:
How would you bring this to a website? I mean, creating a chat website where you can ask questions and the bot answers based on your custom data. Is it possible?
Thanks in advance :)
@shweta-lodha Год назад ⁺¹
Glad you find it useful. Either you can create a website or you can create a widget and plug it onto your website
@adityakadam2256 Год назад ⁺²
That's a great video thanks for sharing this. I have a question if you don't mind. How different is this method rather than using Open AI APIs for chat completion, embedding and completion? Thanks.
@shweta-lodha Год назад
Here you can save the vectored index locally and re-read it.
@adityakadam2256 Год назад
@@shweta-lodha Thanks a lot for a prompt reply. One last question. The method you explained under the hood it uses text-ada-002 model for embedding and text-davinci-003 for completion, right?
@shweta-lodha Год назад
Yes, you're right :)
@adityakadam2256 Год назад
@@shweta-lodha thanks for the response and clarification 😁
@karthikeyand6678 Год назад
@Shweta, Awesome video and Thanks for sharing.. I have data in CSV which has many text columns and many rows. I want to build chat application based on the data. Can you please let me know how can i implement this .
@shweta-lodha Год назад
You can read CSV in memory, chunk it and you’re good to go
@leoheise9967 Год назад
hey, any tips on how to fine tune a model based on a very large pdf document without the "
" to split prompt/resolution? I thought maybe have a script break down in every question mark? Or is there some other way?
@shweta-lodha Год назад
I'm going to publish a video on this today. Stay tuned!
@mayurbhor2231 Год назад ⁺¹
Thanks a lot . Can we use llama_index instead of gpt_index ?
@shweta-lodha Год назад
Yes, you have to use as gpt_index is deprecated. You can refer my GPT_Index breaking changes video for that.
@SPonharshitaP Год назад
could this learn from new unlabeled questions provided by the user ? Is this a trainable chatbot which could learn new things every time user asks a new question
@shweta-lodha Год назад
No
@tuyenhuan-i3y Год назад
Thank you very much. I really hope you will have more useful videos like that.
@shweta-lodha Год назад
Thank you, I will
@RitvikGautam-p9z Год назад
Hi shweta please tell which extensions you have used in VS code for this video
@shweta-lodha Год назад
Jupyter
@lifedesignguru Год назад
Amazing tutorial, thank you!!
@iulihardt9368 Год назад
Hi, I done it with sucess! really thank you! However its possible to start one chat as the last part of the video and save where it stop and come back in other moment?
@shweta-lodha Год назад
Glad you find it useful. You need to save your chat history and refer it whenever you're starting your conversation next time.
@tchalaful Год назад
awesome !! very inspiring, thanks a lot for your work !
@dealersagent Год назад
Thank you for this great video!
@educationdelightenglish3819 Год назад
madam, I am from India, after doing tons of research on ChatGPT API ,this is the best! I am working on Linux platform and have Jupyter notebook, will it work in that env?
@shweta-lodha Год назад
Yes, it should.
@michaelxue5335 Год назад
Exactly what I need! thank you
@thinanadl4939 Год назад
Hi @shweta, can this bot integrate with any database..?
@shitaldhakne7989 Год назад
Hi shweta, this is very helpful tutorial.
I tried this code but after asking question, it is not responding...I waited for 15 minutes...still no response.
@shweta-lodha Год назад
Are you using VS Code? If so, please check your terminal, your py file or command box
@prem280182 Год назад
wonderful. you made it look so simple.
@shweta-lodha Год назад
Thank you! Cheers! If you can't make things simple, it means you yourself didn't understand :)
@Mohammed-lo7xr Год назад
Thank you for the tutorial, it was great! How would I deploy this app to the internet onto my own custom domain?
@shweta-lodha Год назад
You need to create web app for that 😊
@Shailukool Год назад
Hii need your help. I have followed exact same steps but facing below issue.
I have given pip install gpt_index and got response same as shown in your video then when I am giving from gpt_index import SinpleDirectoryReader code then I am getting error Module not found “gpt_index”
I tried uninstalling, re-installing and checking on chatgpt but failed for everything.
Plz plz plz plz help
@shweta-lodha Год назад
Please check my most recent How-To-Fix video. It contains the solution
@aaron6822 Год назад
i got this error "TypeError: BaseGPTIndex.__init__() got an unexpected keyword argument 'documents'". dont know if i will get any assistance what the heck.
@shweta-lodha Год назад
Please refer my How-To-Fix video
@noronhasanjay Год назад
Thank you. Very useful.
@nikunjpatel6522 Год назад
i am getting this error while running code:
Output exceeds the size limit. Open the full output data in a text editor---------------------------------------------------------------------------
TypeError Traceback (most recent call last)
Cell In[17], line 1
----> 1 vectorIndex = createVectorIndex('Chatbot')
Cell In[14], line 16, in createVectorIndex(path)
13 docs = SimpleDirectoryReader(path).load_data()
15 #create vector index
---> 16 vectorIndex = GPTSimpleVectorIndex(documents=docs,llmPredictor=llmPredictor,prompt_helper=prompt_helper)
17 vectorIndex.save_to_disk('vectorIndex.json')
18 return vectorIndex
@shweta-lodha Год назад
APIs have changed a bit, since I published this video. Please refer the updated documentation, I am sure, it would be easy to fix.
@mcsquared361 Год назад
This was awesome, thank you so much!
@shweta-lodha Год назад
Glad you enjoyed it!
@spicytuna08 Год назад
getting this error on line PromptHelper():
chunk_overlap_ratio must be a float between - and 1
@shweta-lodha Год назад
Yes, you need to change this between 0 and 1 due to recent API changes
@zhiyingwang1234 Год назад
Is my own data imported to openai system in this case? I don't want to breach the company's data confidentiality code.
@shweta-lodha Год назад
Yes
@rickp.5778 Год назад
I appreciate your video very much!
@shweta-lodha Год назад
Glad it was helpful! Stay tuned for more...
@partheshtiwari1808 Год назад
How to restrict the bot to only search for answers based on data provided or how to make it generate an error if the question asked is outside the available data
@shweta-lodha Год назад
For this, you need to tweak your prompt.
@sumanghosh6326 Год назад
Post, llama-index replaced by gpt-index, I am getting the error during creation of vector index. Wondering if anyone else was also facing the same.
INFO:openai:error_code=404 error_message='Resource not found' error_param=None error_type=None message='OpenAI API error received' stream_error=False
@shweta-lodha Год назад
Please check out my How-To-Fix videos. Perhaps they can help you 😊
@reymardp Год назад
Hi, just want to ask if the data that I will feed will be exposed to public or to openAI? Is this safe for business? Thank you.
@shweta-lodha Год назад
IT would be exposed to OpenAI. If you are concerned about security part, I would recommend you to check Azure OpenAI.
@hammidfire Год назад
Does it have to be a txt file? What if I have a csv data table? Also does it have to be in a directory?
@hammidfire Год назад
Do you think I should have the script convert the csv to a text file first?
@shweta-lodha Год назад
It is not mandatory to take text file. You can use other file types too provided you are able to read them and convert the text to vector. No need to put in directory, if it is single file. But in that case, you have to look for different function.
@radbirden6111 Год назад
Hi thank you for a great video. Is there a way we can combine the code and add gradio ui ?
@shweta-lodha Год назад
Yes, you can.
@jmisc Год назад
How do we load multiple data, and will the script and OpenAI remember the prior conversation?
@shweta-lodha Год назад
For memory, you need to change this implementation a bit
@SergeyBazyliuk Год назад
ValueError: One of documents or index_struct must be provided. what i need to do with this error?
@shweta-lodha Год назад
Looking like it is not able to generate JSON properly. Please validate your docs
@kareemmatouk Год назад
Hey so im wondering How I can create a interface once I have made my language model.
@shweta-lodha Год назад
You can go for flask
@Cantoneseaudiobook Год назад
thanks for the amazin video, can it read podf file instead of txt?
@shweta-lodha Год назад
Yes you can! But there would be different class/API/function to do so.
@Bludgeoned Год назад
Hi, it seems I can’t find VectorSimpleIndex, has it been replaced by VectorStoreIndex?
@shweta-lodha Год назад
Please check my latest video: How to Fix[GPT-Index]: Fixing GPT-Index Related Broken Pieces
@Bludgeoned Год назад
@@shweta-lodha because it's llama_index right?
@shweta-lodha Год назад
Absolutely!
@ibrahimjaballa2706 Год назад
I am a beginner
When using gpt chat, do I have to get the API from it, or can I get it from anywhere?
@shweta-lodha Год назад
Please get it from Openai.com
@noronhasanjay Год назад
I am having issues installing gpt_index. I install it using pip install but when I do a pip list I see gpt-index (note the hyphen). I then cannot import gpt-index. Has anyone faced this problem? Thank you.
@shweta-lodha Год назад ⁺¹
GptIndex doesn’t exist anymore, it has been renamed. Please check my how to fix video on breaking changes of gptIndex
@noronhasanjay Год назад
@@shweta-lodha Thanks Shweta. Just saw the other video. Much appreciated.
@noronhasanjay Год назад
Shweta, please note that llama_index does not have GPTSimpleVectorIndex
@shweta-lodha Год назад
It has been changed to GPTVectorStoreIndex.
@farhanjunaid9180 Год назад
Can you explain how you setup your IDE here?
@shweta-lodha Год назад
I am using VS Code and installed extensions for python and Jupyter
@harshvindersingh5330 Год назад
Can u help in making similar bot for providing solution to accounts questions in step by step manner accurately!!!
@shweta-lodha Год назад
100% accuracy can’t be guaranteed, this is AI 😊
@algarve-video Год назад
is possible to creat a simple page with a search page that request for the prompts?
@shweta-lodha Год назад
Yes, it can be done
@cmaly6167 Год назад
Are there any other vector index options, I'm not getting the expected results.
@shweta-lodha Год назад
There are few vector databases you can use like pinecone, redis, etc
@_Darth_Panda_ Год назад
for this line
vectorIndex = GPTSimpleVectorIndex(documents=docs,llm_predictor=llmPredictor,prompt_helper=promptHelper)
I am getting the following error
__init__() got an unexpected keyword argument 'documents'. Any tips?
@shweta-lodha Год назад
Please have a look at documentation, in case something has changed. At the time of this video, it was all good.
@_Darth_Panda_ Год назад ⁺¹
@@shweta-lodha
You are right. There appears to be some change in new version of gpt_index. This modified function code worked for me. Thanks for this amazing tutorial. was looking for something like this for a long time.
def create_index(path):
max_input = 4096
tokens = 200
chunk_size = 600 #for LLM, we need to define chunk size
max_chunk_overlap = 20

#define prompt
promptHelper = PromptHelper(max_input,tokens,max_chunk_overlap,chunk_size_limit=chunk_size)

#define LLM - there could be many models we can use, but in this example, let’s go with OpenAI model
llmPredictor = LLMPredictor(llm=OpenAI(temperature=0, model_name="text-ada-001",max_tokens=tokens))
service_context = ServiceContext.from_defaults(llm_predictor=llmPredictor, prompt_helper=promptHelper)
#load data - it will take all the .txtx files, if there are more than 1
docs = SimpleDirectoryReader(path).load_data()
#create vector index
vectorIndex = GPTSimpleVectorIndex.from_documents(documents=docs,service_context=service_context)
vectorIndex.save_to_disk('vectorIndex.json')
return vectorIndex
@shweta-lodha Год назад
Thanks Darth. Lot many people are getting this error. Hence I created a new video explaining this change. Cheers!

Следующие

Автовоспроизведение

Search Content Which ChatGPT Can’t Search Today - OpenAI | Langchain | ChatGPT