Create ChatBot Based On The Data Feed By You - GPT-Index | OpenAI | Python

Поделиться
HTML-код
  • Опубликовано: 23 ноя 2024

Комментарии • 311

  • @njorogekamau3820
    @njorogekamau3820 Год назад +38

    This tutorial truly stands out from the rest! After struggling with coding along other RUclips tutorials on GPT-index and langchain, I finally stumbled upon this gem, and behold, my code worked like a charm. Thanks for sharing your expertise and making it easy to follow. You're a lifesaver!

  • @glebp7020
    @glebp7020 Год назад +13

    Thank you, Shweta. I was able to get something working based on your code. Please note that lots of libraries are now outdated (gpt_index is now llama_index). My complete code (works 100%!):
    from langchain import OpenAI
    import sys
    import os
    from llama_index import SimpleDirectoryReader,GPTListIndex,GPTVectorStoreIndex,LLMPredictor,PromptHelper,ServiceContext
    from llama_index import StorageContext, load_index_from_storage
    def create_index(path):
    max_input = 4096
    tokens = 200
    chunk_size = 600 #for LLM, we need to define chunk size
    max_chunk_overlap = 20
    prompt_helper = PromptHelper(max_input,tokens,max_chunk_overlap,chunk_size_limit=chunk_size) #define prompt
    llm_predictor = LLMPredictor(llm=OpenAI(temperature=0, model_name="text-ada-001",max_tokens=tokens)) #define LLM
    docs = SimpleDirectoryReader(path).load_data() #load data
    service_context = ServiceContext.from_defaults(llm_predictor=llm_predictor, prompt_helper=prompt_helper)
    vectorIndex = GPTVectorStoreIndex.from_documents(
    docs, service_context=service_context
    )
    vectorIndex.storage_context.persist(persist_dir="storage")
    return vectorIndex
    def answerMe():
    storage_context = StorageContext.from_defaults(persist_dir="storage")
    index = load_index_from_storage(storage_context)
    query_engine = index.as_query_engine()
    q = "What is the best plan?"
    print(q)
    print('------------')
    response = query_engine.query(q)
    print(response)
    #create_index('data')
    answerMe()

    • @shweta-lodha
      @shweta-lodha  Год назад

      Thanks Gleb. I have created few videos on how to fix these broken ones and few more are on the way. Please feel free to let me know, if anything else is not working. I will try to provide solutions at my earliest convenience. Thanks once again.

    • @stalinamirtharaj1353
      @stalinamirtharaj1353 Год назад +2

      Great bro... It was helpful to test it out.

    • @sumankalyanghosh645
      @sumankalyanghosh645 Год назад +1

      @@shweta-lodha Currently I am getting RateLimitError with whatever account I try. Is there a way I can bypass this and test out

    • @sumankalyanghosh645
      @sumankalyanghosh645 Год назад

      With the above code I have tried as well mentioned by Gleb.

  • @rayfellers
    @rayfellers 3 месяца назад

    As someone just learned how to use GPT-4o Mini to write python code, your explanations are well done.

  • @deepaksartape9502
    @deepaksartape9502 Год назад +4

    Please cover UI part as well in one of the coming video. Thank you so much in advance.

  • @ddsharma2327
    @ddsharma2327 Год назад +3

    Question about execution flow: Hi Sweta - Thanks for a very helpful tutorial. Trying to wrap my head around the execution flow. For example, createvectorIndex probably sends the document to openAI servers, the servers return embeddings that are then stored locally. During answerMe - is the prompt sent back to openAI for embeddings? Does answerMe again send document embeddings back to openAI servers? How much is done in the local process (on PC) vs how much is done by openAI servers? Thanks.

    • @shweta-lodha
      @shweta-lodha  Год назад

      One way to understand this is by disabling the internet connection once you have embedding stored on your local device and then try to call answerMe. Doing this will clear all your doubts 😊

  • @SmritiChaudhary-k5b
    @SmritiChaudhary-k5b Год назад +3

    Hii,
    I am facing this error
    RetryError: RetryError[] while calling answerMe function, can you please help me out .

  • @mangesh29506
    @mangesh29506 Год назад

    Your instructions are really good, thank you so much Shweta Ji

  • @LEARNINGS-Amit
    @LEARNINGS-Amit Год назад +3

    Hi Shweta, Awesome video, great learning. Thanks. Just one concern - when we grant access to ChatGpt to our custom data stored in our machine, then is there a risk that the data can be copied/ used by users of ChatGPT or Open AI or otherwise?

  • @DandaFundas
    @DandaFundas Год назад +2

    Thank you Swetha. It can’t get any better than your explanation with show and tell 👍

  • @congtycophanxaydungo.m.d1735
    @congtycophanxaydungo.m.d1735 Год назад +1

    thanks for your contribution, it's easy to understand for a beginner like me. You can continue to make the next video to guide the question and answer continuously like chatgpt.

  • @Spaz42
    @Spaz42 Год назад +5

    Thank you so much for this! You made something very intimidating to me super easy to understand. I am very grateful for the time and effort your put into this video. Thanks a lot!

  • @LearnWithSibin
    @LearnWithSibin Год назад +5

    I noticed that in the implementation shown in the video, the GPT model is being called every time a user prompt is received, and this can be expensive in a real-world scenario where the application is serving multiple users concurrently. Each request to the GPT model requires a certain number of OpenAI tokens, and this can quickly add up and become expensive.

    • @shweta-lodha
      @shweta-lodha  Год назад +4

      Indeed! For production scenarios, you have to extensively used vector database

  • @mikhailtrufanov9356
    @mikhailtrufanov9356 Год назад +4

    Hi, thank you for the video! I have tried this code to answer questions based on data about a very specific product. It answers well, however, it answers general questions as well (such as "What is Earth?") having no information at all about this in the files provided. How can I make this code answer based on the information that I provided only?

    • @shweta-lodha
      @shweta-lodha  Год назад +2

      Thanks for watching Mikhail. Give a try to my this video, it gives better results - Use Your Locally Stored Files To Get Response From GPT like ChatGPT | Python

  • @usmanahmed1073
    @usmanahmed1073 Год назад

    Very good explanation. You speak very calm and make audience to follow all step in detail. Keep good work

  • @fascinatingstories101
    @fascinatingstories101 5 месяцев назад +1

    Thank you. Is there a way to capture user inputs like contact details (email, name etc)? That would be really useful.

  • @MoturuPraveenBhargav
    @MoturuPraveenBhargav Год назад

    Ms. Shweta, your tutorials are superb!! Stands out first from the rest.

  • @maggynewnham
    @maggynewnham Год назад +4

    Hi Shweta - I am planning to follow this tutorial, it looks amazing, I was wondering if you know if the local data we used is kept and is local only or if any of the data gets make public or goes back to OpenAI?

    • @shweta-lodha
      @shweta-lodha  Год назад +1

      Thanks Maggy, glad you find it useful. Data will go to OpenAI servers

  • @M-ABDULLAH-AZIZ
    @M-ABDULLAH-AZIZ Год назад +1

    having data in a file and real time embeddings vs embeddings in a db for chatbot for an application (provides information about an application)?

  • @jeffersonhope2668
    @jeffersonhope2668 Год назад +2

    Outstanding preparation and presentation. Thanks so much!

  • @harshilpatel323
    @harshilpatel323 Год назад +1

    Hi Shweta, that was a great tutorial! However, I have a question. Just like you used the custom data from the local disk here, How can we use the data from aws/elasticsearch?. I have a huge database (About 20 million records) which our employees access via querying on elasticsearch, and If I wish to create a custom chatbot trained on that data, how would one achieve that?

  • @giadavolpin3564
    @giadavolpin3564 Год назад +1

    Thank you so much Shweta! You got me so passionate on the topic. After completing your tutorial how do I move this into a chatbot that I can bring to my app? I’m stuck

    • @shweta-lodha
      @shweta-lodha  Год назад +1

      You need web UI

    • @fakeshirin4642
      @fakeshirin4642 Год назад +1

      @@shweta-lodha can you drop a tutorial on that because that would be very helpful

  • @atony229
    @atony229 Год назад

    Great Video, you are inspiring me to start learning how to code. I am doing this as a project to see if I can follow along withyou. Thank you.

  • @rajparmar6420
    @rajparmar6420 8 месяцев назад

    hi shweta, does this take structured data like csv or excel for manipulate any data?

  • @SanjanaMagar-d5q
    @SanjanaMagar-d5q Год назад

    hi Shweta, i tried this code on my end but in { vectorIndex.save_to_disk('vectorIndex.json') } is giving me an error so i tried { index.storage_context.persist('vectorIndex') } this is not creating a vectorIndex.json file on my system its creating a folder name vectorIndex.json inside which i has 4 json file docsstore, graph_store, index_store and vector_store.. can you pls tell me where i am going wrong

  • @naveennoelj
    @naveennoelj Год назад +2

    @shweta-lodha, wonderful video and article. Thank You. Please keep up the good work. One qs: is there a way to implement the same solution using a web based solution - angular, react etc.

    • @shweta-lodha
      @shweta-lodha  Год назад +1

      Thank you so much Naveen, glad you liked it. I was thinking to try this out but couldn't find API endpoints. Please let me know, if you come across any such documentation.

  • @mehdirizvi3305
    @mehdirizvi3305 Год назад +1

    For me, this code was throwing an error. If you update the code by first importing ServiceContext from gpt_index (along with the other stuff you were already importing) and then update the code by adding the following lines,
    '-----#load data
    docs = SimpleDirectoryReader(path).load_data()
    service_context=ServiceContext.from_defaults(llm_predictor=llmPredictor,prompt_helper=PromptHelper)

    #create vector index
    vectorIndex=GPTSimpleVectorIndex.from_documents(documents=docs,service_context=service_context) ---
    it should solve your issue.
    P.S: I used this code from another one of your videos :))

    • @shweta-lodha
      @shweta-lodha  Год назад +1

      Please refer my How-To-Fix video. All this happened because of breaking changes in API 😊

  • @rezahamzeh3736
    @rezahamzeh3736 Год назад

    A tutor that is looking for the result of a func without calling the func :))))))!!!!!

  • @shivampatel4179
    @shivampatel4179 Год назад +1

    ImportError Traceback (most recent call last)
    Cell In[29], line 1
    ----> 1 from gpt_index import SimpleDirectoryReader, GPTListIndex, GPTSimpleVectorIndex, LLMPredictor, PromptHelper
    2 from langchain import OpenAI
    3 import sys
    What should I do for this error???

    • @shweta-lodha
      @shweta-lodha  Год назад

      Please refer my how to fix video entitled as breaking changes. Things are broken because of API enhancements

  • @satishkumar-ir9wy
    @satishkumar-ir9wy Год назад

    I have not found such a detailed explanation of Open AI based Chat bot, Thankyou for Sharing.
    Can you please make a video about how i can feed my NLP based ML model to create a Chat bot based on ChatGPT.

  • @pauldad1
    @pauldad1 Год назад +2

    This looks like a brilliant tutorial thank you. Please excuse the silly question, but where are you editing your code, I opened IDLE but it must be some other editor/console? Thank you again.

    • @shweta-lodha
      @shweta-lodha  Год назад +1

      I am using Visual Studio Code (VS Code). You can install it as it very easy to use. Please make sure to install Python and Jupyter extension from Extensions panel.

    • @pauldad1
      @pauldad1 Год назад

      Amazingly helpful, thank you!

  • @sqlsql4304
    @sqlsql4304 Год назад +1

    Very good tutorial, one question if have some logs which contains sensitive data like ip etc, if I feed the logs to the llm will my data move out of my system to the llm model provider's server. If so is there a security threat. I have a presentation to make on using llm to my company and for sure this question will rise, any feedback will be appreciated

    • @shweta-lodha
      @shweta-lodha  Год назад

      Thanks. Yes, it will go to model provider’s server and it could be a security threat. If privacy is your concern, then my suggestion would be to use Azure OpenAI

  • @danish1794
    @danish1794 Год назад +1

    Hello
    Shweta Lodha, I have a couple of questions. Firstly, do you have any ideas on how we can reduce the cost of tokens? It would be really helpful if you could make a video explaining any potential solutions. Additionally, as a learner, I'm facing issues with expired APIs. Do you have any suggestions on how I can resolve this problem? Thank you in advance for your help.

    • @shweta-lodha
      @shweta-lodha  Год назад +3

      I have already created a video on cost factor, please check my playlist. I didn’t understand- what do you mean by expired API

  • @SGSaisreenath
    @SGSaisreenath Год назад +2

    @Shweta this is brilliant! as usual top notch like any of your other videos, thank you so much for taking time doing this. One question here, can you please provide the differences in the parameter if the same had to be achieved through Azure Open AI with openai.api_base, openai.api_type, openai.api_version, deployment_name - I have been trying to meddle with this but no luck so far..

    • @shweta-lodha
      @shweta-lodha  Год назад

      Sure, I'll cover this in my Azure OpenAI series, which I started recently :)

  • @AliRaza-u3u1m
    @AliRaza-u3u1m Год назад +1

    Hi Shweta, You did a great job. Can you please help me how to set that environment on which your are doing code.

    • @shweta-lodha
      @shweta-lodha  Год назад

      Sure. Drop me an email and we will connect

  • @darkmode4467
    @darkmode4467 2 месяца назад

    What is the token for? Is it limited to ask the bot? Is there a way it can be unlimited?

  • @faisalsaddique3323
    @faisalsaddique3323 Год назад +1

    This is really useful, Shweta! Thank you so much for making such an awesome content.

  • @spicytuna08
    @spicytuna08 Год назад

    there is no longer GPTKeywordVectorIndex. There is GPTSImpleKeywordTableIndex()
    are they the same?

  • @mneama4383
    @mneama4383 Год назад

    Thanks a lot great video , just want to know if all my data is stored in sharepoint how can i take them all into the same mechanizm u did thanks again for this great tutorial

  • @Priyanka-js8zl
    @Priyanka-js8zl Год назад

    Thankyou for this detailed tutorial, so you asked 2 question from it, can you please share how much openai credits does it used in this complete operation?
    that would be really helpful

    • @shweta-lodha
      @shweta-lodha  Год назад

      It should not be that much. I'm uncertain about the exact number as I sent multiple requests around same time frame :(

  • @shahbazzaman4143
    @shahbazzaman4143 Год назад +1

    I have implemented code, works perfectly, Thanks for that, I have question, it is just providing answer from context, Is gpt_index is all about context ? can not get general information like, who is trump or any other information? if there is way please help me with in this code

    • @shweta-lodha
      @shweta-lodha  Год назад

      This video is about how to get answers based on context. If you want general info, then you can simply achieve that using OpenAI

  • @sriramyapriya4110
    @sriramyapriya4110 Год назад +1

    Hi, thanks for the great video. I'm a beginner. How do I proceed now that gpt_index has been replaced by llama_index? I'm getting the error "No module named 'gpt_index'

    • @shweta-lodha
      @shweta-lodha  Год назад

      Yes, there are changes. Please refer my How-To-Fix video, it has the fix.

  • @narjessabbassi124
    @narjessabbassi124 Год назад +2

    Hello, this tutorial is really amazing but i was trying to replicate the same work but got an error when executing the function createVectorIndex saying that the function __init__() in GPTSimpleVectorIndex got an unexpected keyword argument 'documents' did anyone get the same issue ?

  • @gauravrathi5028
    @gauravrathi5028 Год назад +1

    Hi Shweta, Very useful tutorial and I followed the same but m always getting the response None for any question, However I kept the document at same place where you have mentioned. Could you please help me out, what could be the reason for the same.

    • @shweta-lodha
      @shweta-lodha  Год назад

      Please give complete/absolute path and try it. If still it doesn’t work, then issue is not with the input file

  • @oreos_overdose
    @oreos_overdose Год назад

    Hello and thanks for the video. Very useful. Is it possible to use a sphinx generated website (essentially a handful of html pages) as the resource documents instead of a plan txt file?

  • @syedrizwanali5212
    @syedrizwanali5212 2 месяца назад +1

    Hi Mam,
    Does this work today also?
    Gpt_index does not have any function?

    • @shweta-lodha
      @shweta-lodha  2 месяца назад

      Perhaps not as lot many things have changed recently, in terms of SDK and API. Lot many functions are renamed and moved here and there.

    • @syedrizwanali5212
      @syedrizwanali5212 2 месяца назад

      @@shweta-lodha Thank you for the reply. Would you have any latest video on same topic? Or would you know if someone have created on this topic in simpler way like you had done here.

  • @prudhvi70
    @prudhvi70 Год назад +3

    Any plans for coming up with part 2 for this with custom data that needs to be continuously indexed in the background?

    • @shweta-lodha
      @shweta-lodha  Год назад +2

      Will plan soon. Thanks Prudhvi for the pointer :)

  • @smudgepost
    @smudgepost Год назад

    Very good video thank you. I must have blinked and it was over! Which specific function uses langchain?

  • @liyuanzhu6111
    @liyuanzhu6111 Год назад +2

    Thank you for this wonderful video🎉. I have a question when I was trying it. I run the codes to create vector index,but I cannot find it in the OSDisk. There is no error😂

    • @shweta-lodha
      @shweta-lodha  Год назад

      If you didn't provide complete path, then it must be in your current directory. Current directory is the one, from where you are running your script.

  • @smart-sg5cs
    @smart-sg5cs Год назад +1

    hi shweta
    your tutorial is really helpful
    and your knowledge on open Ai. i have also emailed you please reply

    • @shweta-lodha
      @shweta-lodha  Год назад

      I'll try to respond at the earliest.

  • @bhagatsingh..
    @bhagatsingh.. Год назад

    Very nicely presented its a marvel

  • @EllenSimpson-w9n
    @EllenSimpson-w9n Год назад +1

    Hi Shweta, this tutorial is amazing! I have one question after running the bot, on my OpenAI usage I am getting text-davinci requests as well as text-embedding-ada-002-v2. Any thoughts on why I am getting the davinci requests?

    • @shweta-lodha
      @shweta-lodha  Год назад +1

      I didn’t understand your question completely. Your embeddings would be using text-embedding-Ada-002-v2 model. Don’t you want to use this?

    • @EllenSimpson-w9n
      @EllenSimpson-w9n Год назад

      @@shweta-lodha Thank you for your reply. My usage looks like this does this make more sense?
      10:55 PM Local time: Apr 26, 2023, 8:55 AM
      text-davinci, 2 requests
      3,805 prompt + 75 completion = 3,880 tokens
      10:55 PM Local time: Apr 26, 2023, 8:55 AM
      text-embedding-ada-002-v2, 1 request
      8 prompt + 0 completion = 8 tokens

    • @EllenSimpson-w9n
      @EllenSimpson-w9n Год назад

      The bot is also able to answer questions about topics that I have not fed it. I think that it is accessing other sources than what I have fed it.

  • @Gapxpag
    @Gapxpag Год назад +1

    Hi Shweta, amazing job, hope you can help me: when running vectorIndex = create_index('Knowledge'), I get an error message: Output exceeds the size limit. Open the full output data in a text editor. maybe you know why?

    • @shweta-lodha
      @shweta-lodha  Год назад

      Hope 'Knowledge' directory exists in the same path, from where you are running your script.

  • @IgorBožić-o8n
    @IgorBožić-o8n Год назад

    Thank you Swetha for this easy and well-formed tutorial! Is there a way to connect this bot to a WordPress site? I have created a custom bot explained in this tutorial which uses a dataset from a knowledgebase site. Now I'd like to connect that chatbot to that site so that users can directly communicate there, in addition, to seeing articles in a standard way.

    • @shweta-lodha
      @shweta-lodha  Год назад

      Thanks and glad you liked it. Did you try plug-in?

  • @ralphanthony5198
    @ralphanthony5198 Год назад +1

    Hello, Shweta Lodha! Great job. I really love your videos. And I have a quick question: Do I have to have a paid GPT plan in order for the code to work?

    • @shweta-lodha
      @shweta-lodha  Год назад +1

      No necessary. It would work with your free account too, assuming you have required credits.

    • @ralphanthony5198
      @ralphanthony5198 Год назад +1

      @@shweta-lodha Thank you so much., Shweta. But I will get me a subscription today anyhow. I don't want to ran out of credits. With Love from Tijuana. You are great!

  • @joserodolfobeluzo3100
    @joserodolfobeluzo3100 Год назад +1

    HI Shweta Lodha! First... Thanks a lot for the tutorial! :)
    I have a problem with an error. Can You help me?
    I wrote the same code and use the same data. But when I try to run the code, this error appear:
    TypeError: __init__() got an unexpected keyword argument 'documents'.
    I Printed the variable and it contains the book text... The error is on the line:
    vectorIndex = GPTSimpleVectorIndex(documents=docs,llm_predictor=llmPredictor,prompt_helper=prompt_helper)
    I looked for it on internet but I didn't understand why there's a problem with the arg "documents"...
    :/

    • @joserodolfobeluzo3100
      @joserodolfobeluzo3100 Год назад

      I solved! :D
      For anyone that had the some problem I had, here is my function:
      def createVectorIndex(path):
      max_input = 1024
      tokens = 256
      chunk_size = 600
      max_chunk_overlap = 20

      #define LLM
      llm_predictor = LLMPredictor(llm=OpenAI(temperature=0, model_name="text-ada-001"))
      #load data
      docs = SimpleDirectoryReader(path).load_data()
      service_context = ServiceContext.from_defaults(llm_predictor=llm_predictor, chunk_size_limit=512)
      vectorIndex = GPTSimpleVectorIndex.from_documents(docs, service_context=service_context)
      vectorIndex.save_to_disk('vectorIndex.json')
      return vectorIndex
      The problem is: I removed this line:
      prompt_helper = PromptHelper(max_input,tokens,max_chunk_overlap,chunk_size_limit=chunk_size)
      Should it be a problem?
      It worked here without it and with other texts...
      If it's a problem, could anyone explain the part of the code?

    • @shweta-lodha
      @shweta-lodha  Год назад

      Please have a look at documentation, in case something has changed. At the time of this video, it was all good.

    • @shweta-lodha
      @shweta-lodha  Год назад +1

      You rock! I just created a video on how to fix this :)

  • @youwang9156
    @youwang9156 Год назад +1

    appreciate for your work, just wonder if this method works for extracting specific number ? like i want to extract some price for mutilple products

    • @shweta-lodha
      @shweta-lodha  Год назад

      Yes, I tried this and in most of the cases, I got accurate output.

    • @youwang9156
      @youwang9156 Год назад

      @@shweta-lodha thank u so much for reply

  • @nikunjpatel6522
    @nikunjpatel6522 Год назад

    also getting this as well
    Output exceeds the size limit. Open the full output data in a text editor

    • @shweta-lodha
      @shweta-lodha  Год назад

      This is because you have restricted the output length for Jupyter. You can open it in notepad to see full error. Not a big deal 👍🏻

  • @alexdomla
    @alexdomla Год назад +1

    HI! This is a very helpful tutorial. I have a question:
    How would you bring this to a website? I mean, creating a chat website where you can ask questions and the bot answers based on your custom data. Is it possible?
    Thanks in advance :)

    • @shweta-lodha
      @shweta-lodha  Год назад +1

      Glad you find it useful. Either you can create a website or you can create a widget and plug it onto your website

  • @adityakadam2256
    @adityakadam2256 Год назад +2

    That's a great video thanks for sharing this. I have a question if you don't mind. How different is this method rather than using Open AI APIs for chat completion, embedding and completion? Thanks.

    • @shweta-lodha
      @shweta-lodha  Год назад

      Here you can save the vectored index locally and re-read it.

    • @adityakadam2256
      @adityakadam2256 Год назад

      @@shweta-lodha Thanks a lot for a prompt reply. One last question. The method you explained under the hood it uses text-ada-002 model for embedding and text-davinci-003 for completion, right?

    • @shweta-lodha
      @shweta-lodha  Год назад

      Yes, you're right :)

    • @adityakadam2256
      @adityakadam2256 Год назад

      @@shweta-lodha thanks for the response and clarification 😁

  • @karthikeyand6678
    @karthikeyand6678 Год назад

    @Shweta, Awesome video and Thanks for sharing.. I have data in CSV which has many text columns and many rows. I want to build chat application based on the data. Can you please let me know how can i implement this .

    • @shweta-lodha
      @shweta-lodha  Год назад

      You can read CSV in memory, chunk it and you’re good to go

  • @leoheise9967
    @leoheise9967 Год назад

    hey, any tips on how to fine tune a model based on a very large pdf document without the "
    " to split prompt/resolution? I thought maybe have a script break down in every question mark? Or is there some other way?

    • @shweta-lodha
      @shweta-lodha  Год назад

      I'm going to publish a video on this today. Stay tuned!

  • @mayurbhor2231
    @mayurbhor2231 Год назад +1

    Thanks a lot . Can we use llama_index instead of gpt_index ?

    • @shweta-lodha
      @shweta-lodha  Год назад

      Yes, you have to use as gpt_index is deprecated. You can refer my GPT_Index breaking changes video for that.

  • @SPonharshitaP
    @SPonharshitaP Год назад

    could this learn from new unlabeled questions provided by the user ? Is this a trainable chatbot which could learn new things every time user asks a new question

  • @tuyenhuan-i3y
    @tuyenhuan-i3y Год назад

    Thank you very much. I really hope you will have more useful videos like that.

  • @RitvikGautam-p9z
    @RitvikGautam-p9z Год назад

    Hi shweta please tell which extensions you have used in VS code for this video

  • @lifedesignguru
    @lifedesignguru Год назад

    Amazing tutorial, thank you!!

  • @iulihardt9368
    @iulihardt9368 Год назад

    Hi, I done it with sucess! really thank you! However its possible to start one chat as the last part of the video and save where it stop and come back in other moment?

    • @shweta-lodha
      @shweta-lodha  Год назад

      Glad you find it useful. You need to save your chat history and refer it whenever you're starting your conversation next time.

  • @tchalaful
    @tchalaful Год назад

    awesome !! very inspiring, thanks a lot for your work !

  • @dealersagent
    @dealersagent Год назад

    Thank you for this great video!

  • @educationdelightenglish3819
    @educationdelightenglish3819 Год назад

    madam, I am from India, after doing tons of research on ChatGPT API ,this is the best! I am working on Linux platform and have Jupyter notebook, will it work in that env?

  • @michaelxue5335
    @michaelxue5335 Год назад

    Exactly what I need! thank you

  • @thinanadl4939
    @thinanadl4939 Год назад

    Hi @shweta, can this bot integrate with any database..?

  • @shitaldhakne7989
    @shitaldhakne7989 Год назад

    Hi shweta, this is very helpful tutorial.
    I tried this code but after asking question, it is not responding...I waited for 15 minutes...still no response.

    • @shweta-lodha
      @shweta-lodha  Год назад

      Are you using VS Code? If so, please check your terminal, your py file or command box

  • @prem280182
    @prem280182 Год назад

    wonderful. you made it look so simple.

    • @shweta-lodha
      @shweta-lodha  Год назад

      Thank you! Cheers! If you can't make things simple, it means you yourself didn't understand :)

  • @Mohammed-lo7xr
    @Mohammed-lo7xr Год назад

    Thank you for the tutorial, it was great! How would I deploy this app to the internet onto my own custom domain?

    • @shweta-lodha
      @shweta-lodha  Год назад

      You need to create web app for that 😊

  • @Shailukool
    @Shailukool Год назад

    Hii need your help. I have followed exact same steps but facing below issue.
    I have given pip install gpt_index and got response same as shown in your video then when I am giving from gpt_index import SinpleDirectoryReader code then I am getting error Module not found “gpt_index”
    I tried uninstalling, re-installing and checking on chatgpt but failed for everything.
    Plz plz plz plz help

    • @shweta-lodha
      @shweta-lodha  Год назад

      Please check my most recent How-To-Fix video. It contains the solution

  • @aaron6822
    @aaron6822 Год назад

    i got this error "TypeError: BaseGPTIndex.__init__() got an unexpected keyword argument 'documents'". dont know if i will get any assistance what the heck.

  • @noronhasanjay
    @noronhasanjay Год назад

    Thank you. Very useful.

  • @nikunjpatel6522
    @nikunjpatel6522 Год назад

    i am getting this error while running code:
    Output exceeds the size limit. Open the full output data in a text editor---------------------------------------------------------------------------
    TypeError Traceback (most recent call last)
    Cell In[17], line 1
    ----> 1 vectorIndex = createVectorIndex('Chatbot')
    Cell In[14], line 16, in createVectorIndex(path)
    13 docs = SimpleDirectoryReader(path).load_data()
    15 #create vector index
    ---> 16 vectorIndex = GPTSimpleVectorIndex(documents=docs,llmPredictor=llmPredictor,prompt_helper=prompt_helper)
    17 vectorIndex.save_to_disk('vectorIndex.json')
    18 return vectorIndex

    • @shweta-lodha
      @shweta-lodha  Год назад

      APIs have changed a bit, since I published this video. Please refer the updated documentation, I am sure, it would be easy to fix.

  • @mcsquared361
    @mcsquared361 Год назад

    This was awesome, thank you so much!

  • @spicytuna08
    @spicytuna08 Год назад

    getting this error on line PromptHelper():
    chunk_overlap_ratio must be a float between - and 1

    • @shweta-lodha
      @shweta-lodha  Год назад

      Yes, you need to change this between 0 and 1 due to recent API changes

  • @zhiyingwang1234
    @zhiyingwang1234 Год назад

    Is my own data imported to openai system in this case? I don't want to breach the company's data confidentiality code.

  • @rickp.5778
    @rickp.5778 Год назад

    I appreciate your video very much!

    • @shweta-lodha
      @shweta-lodha  Год назад

      Glad it was helpful! Stay tuned for more...

  • @partheshtiwari1808
    @partheshtiwari1808 Год назад

    How to restrict the bot to only search for answers based on data provided or how to make it generate an error if the question asked is outside the available data

    • @shweta-lodha
      @shweta-lodha  Год назад

      For this, you need to tweak your prompt.

  • @sumanghosh6326
    @sumanghosh6326 Год назад

    Post, llama-index replaced by gpt-index, I am getting the error during creation of vector index. Wondering if anyone else was also facing the same.
    INFO:openai:error_code=404 error_message='Resource not found' error_param=None error_type=None message='OpenAI API error received' stream_error=False

    • @shweta-lodha
      @shweta-lodha  Год назад

      Please check out my How-To-Fix videos. Perhaps they can help you 😊

  • @reymardp
    @reymardp Год назад

    Hi, just want to ask if the data that I will feed will be exposed to public or to openAI? Is this safe for business? Thank you.

    • @shweta-lodha
      @shweta-lodha  Год назад

      IT would be exposed to OpenAI. If you are concerned about security part, I would recommend you to check Azure OpenAI.

  • @hammidfire
    @hammidfire Год назад

    Does it have to be a txt file? What if I have a csv data table? Also does it have to be in a directory?

    • @hammidfire
      @hammidfire Год назад

      Do you think I should have the script convert the csv to a text file first?

    • @shweta-lodha
      @shweta-lodha  Год назад

      It is not mandatory to take text file. You can use other file types too provided you are able to read them and convert the text to vector. No need to put in directory, if it is single file. But in that case, you have to look for different function.

  • @radbirden6111
    @radbirden6111 Год назад

    Hi thank you for a great video. Is there a way we can combine the code and add gradio ui ?

  • @jmisc
    @jmisc Год назад

    How do we load multiple data, and will the script and OpenAI remember the prior conversation?

    • @shweta-lodha
      @shweta-lodha  Год назад

      For memory, you need to change this implementation a bit

  • @SergeyBazyliuk
    @SergeyBazyliuk Год назад

    ValueError: One of documents or index_struct must be provided. what i need to do with this error?

    • @shweta-lodha
      @shweta-lodha  Год назад

      Looking like it is not able to generate JSON properly. Please validate your docs

  • @kareemmatouk
    @kareemmatouk Год назад

    Hey so im wondering How I can create a interface once I have made my language model.

  • @Cantoneseaudiobook
    @Cantoneseaudiobook Год назад

    thanks for the amazin video, can it read podf file instead of txt?

    • @shweta-lodha
      @shweta-lodha  Год назад

      Yes you can! But there would be different class/API/function to do so.

  • @Bludgeoned
    @Bludgeoned Год назад

    Hi, it seems I can’t find VectorSimpleIndex, has it been replaced by VectorStoreIndex?

    • @shweta-lodha
      @shweta-lodha  Год назад

      Please check my latest video: How to Fix[GPT-Index]: Fixing GPT-Index Related Broken Pieces

    • @Bludgeoned
      @Bludgeoned Год назад

      @@shweta-lodha because it's llama_index right?

    • @shweta-lodha
      @shweta-lodha  Год назад

      Absolutely!

  • @ibrahimjaballa2706
    @ibrahimjaballa2706 Год назад

    I am a beginner
    When using gpt chat, do I have to get the API from it, or can I get it from anywhere?

  • @noronhasanjay
    @noronhasanjay Год назад

    I am having issues installing gpt_index. I install it using pip install but when I do a pip list I see gpt-index (note the hyphen). I then cannot import gpt-index. Has anyone faced this problem? Thank you.

    • @shweta-lodha
      @shweta-lodha  Год назад +1

      GptIndex doesn’t exist anymore, it has been renamed. Please check my how to fix video on breaking changes of gptIndex

    • @noronhasanjay
      @noronhasanjay Год назад

      @@shweta-lodha Thanks Shweta. Just saw the other video. Much appreciated.

    • @noronhasanjay
      @noronhasanjay Год назад

      Shweta, please note that llama_index does not have GPTSimpleVectorIndex

    • @shweta-lodha
      @shweta-lodha  Год назад

      It has been changed to GPTVectorStoreIndex.

  • @farhanjunaid9180
    @farhanjunaid9180 Год назад

    Can you explain how you setup your IDE here?

    • @shweta-lodha
      @shweta-lodha  Год назад

      I am using VS Code and installed extensions for python and Jupyter

  • @harshvindersingh5330
    @harshvindersingh5330 Год назад

    Can u help in making similar bot for providing solution to accounts questions in step by step manner accurately!!!

    • @shweta-lodha
      @shweta-lodha  Год назад

      100% accuracy can’t be guaranteed, this is AI 😊

  • @algarve-video
    @algarve-video Год назад

    is possible to creat a simple page with a search page that request for the prompts?

  • @cmaly6167
    @cmaly6167 Год назад

    Are there any other vector index options, I'm not getting the expected results.

    • @shweta-lodha
      @shweta-lodha  Год назад

      There are few vector databases you can use like pinecone, redis, etc

  • @_Darth_Panda_
    @_Darth_Panda_ Год назад

    for this line
    vectorIndex = GPTSimpleVectorIndex(documents=docs,llm_predictor=llmPredictor,prompt_helper=promptHelper)
    I am getting the following error
    __init__() got an unexpected keyword argument 'documents'. Any tips?

    • @shweta-lodha
      @shweta-lodha  Год назад

      Please have a look at documentation, in case something has changed. At the time of this video, it was all good.

    • @_Darth_Panda_
      @_Darth_Panda_ Год назад +1

      @@shweta-lodha
      You are right. There appears to be some change in new version of gpt_index. This modified function code worked for me. Thanks for this amazing tutorial. was looking for something like this for a long time.
      def create_index(path):
      max_input = 4096
      tokens = 200
      chunk_size = 600 #for LLM, we need to define chunk size
      max_chunk_overlap = 20

      #define prompt
      promptHelper = PromptHelper(max_input,tokens,max_chunk_overlap,chunk_size_limit=chunk_size)

      #define LLM - there could be many models we can use, but in this example, let’s go with OpenAI model
      llmPredictor = LLMPredictor(llm=OpenAI(temperature=0, model_name="text-ada-001",max_tokens=tokens))
      service_context = ServiceContext.from_defaults(llm_predictor=llmPredictor, prompt_helper=promptHelper)
      #load data - it will take all the .txtx files, if there are more than 1
      docs = SimpleDirectoryReader(path).load_data()
      #create vector index
      vectorIndex = GPTSimpleVectorIndex.from_documents(documents=docs,service_context=service_context)
      vectorIndex.save_to_disk('vectorIndex.json')
      return vectorIndex

    • @shweta-lodha
      @shweta-lodha  Год назад

      Thanks Darth. Lot many people are getting this error. Hence I created a new video explaining this change. Cheers!