How to Build a Custom Knowledge ChatGPT Clone in 5 Minutes

Поделиться
HTML-код
  • Опубликовано: 1 дек 2024

Комментарии • 304

  • @LiamOttley
    @LiamOttley  Год назад +4

    Leave your questions below! 😎
    📚 My Free Skool Community: bit.ly/3uRIRB3
    🤝 Work With Me: www.morningside.ai/
    📈 My AI Agency Accelerator: bit.ly/3wxLubP

    • @Oooluuuu
      @Oooluuuu Год назад

      Could you please show us how to deploy this to a web app, so it can be used in a web app?

    • @tombarimohamedamine3331
      @tombarimohamedamine3331 Год назад

      Is whole thing free?

  • @rushikshah2824
    @rushikshah2824 Год назад +64

    God bless the algorithm for showing this channel to me!

  • @adboost_AI
    @adboost_AI Год назад +39

    The AI beast dropping knowledge bombs again! Awesome video Liam, punchy, engaging and dripping with actionable content 👏🏼 Cutting edge stuff.

  • @ahmaddada-dq7zn
    @ahmaddada-dq7zn Год назад +5

    Nice one Liam, always enjoying your contents.

    • @LiamOttley
      @LiamOttley  Год назад

      Much appreciated mate, glad I could help!

  • @Noboy504gaming
    @Noboy504gaming 8 месяцев назад

    Great info!

  • @Mich6961
    @Mich6961 8 месяцев назад

    omg thank you! You've helped me not have to categorise my grocery shopping list into fruit, meat etc. manually.

  • @martinmadlmayr9947
    @martinmadlmayr9947 Год назад +3

    Sorry, I am a novice in the area of AI, but I have a typical management question:
    What would it need to being able to use my own knowledge bot WITHOUT feeding the critical information to meta, Google or openai? Or: how can I ensure that my data is safe?
    By the way: great content- much appreciated

    • @workinprogress2077
      @workinprogress2077 Год назад +2

      I am looking for an asnwer for this too. I think the answer is you need to run the chatbot locally/on a server you control. There are many different ways to do this, including using Portainer + Docker (this is what I was told by a exeriences coder)

    • @bl8596
      @bl8596 Год назад +1

      Yeah I would like a video specifically on this

    • @kavian4249
      @kavian4249 Год назад

      Did you find any solutions?

    • @kavian4249
      @kavian4249 Год назад

      @@workinprogress2077 Did you find any solutions?

    • @kavian4249
      @kavian4249 Год назад

      @@bl8596 Did you find any solutions?

  • @Warclimb64
    @Warclimb64 Год назад +1

    Thanks for making this!

  • @AdamPaulTalks
    @AdamPaulTalks Год назад +2

    Yes an update to this with GPT 3.5 turbo (current model) would be incredible.

  • @TheHeartShow
    @TheHeartShow Год назад +3

    Great vid as always!

  • @sohamagarwal00
    @sohamagarwal00 Год назад +3

    really cool stuff! Musch more efficient than traditional methods of custom training the model or making custom responses. Thanks a lot!

  • @ExploitInsight
    @ExploitInsight Год назад +1

    you deserve tones of subscribers

  • @caseyriley1014
    @caseyriley1014 Год назад

    WOW! Imagine using this to look for a specific "tag", "object", or whatever from the developer docs of a coding language...
    You could probably use this, to get an ai to write obscure languages by telling it the structure to use

  • @Faisal1504
    @Faisal1504 Год назад

    Very Interesting

  • @birkopheim-3263
    @birkopheim-3263 Год назад +3

    Anyone having trouble with the code, llama index updated the name "of "GPTSimpleVectorIndex" to "GPTVectorStoreIndex". Just replace them and it should work if that is the error you are getting

  • @samwilliams3929
    @samwilliams3929 Год назад +3

    Cracking video ! Short, sharp and focused ! Information is spot on and really helpful. Thanks! Looking forward to seeing more good content.

  • @oryxchannel
    @oryxchannel Год назад

    I edited this because I want to emphasize the importance of search (and how poor RUclips search is). I had a vein in the center of my forehead trying to do this on my own about three weeks ago....and I'm only seeing Liams video now. Get your search alerts and notifications down during this revolution, and you just may be lucky enough to find the Liams of this world giving you a play-by-play breakdown of exactly what you want in your AI build.

    • @LiamOttley
      @LiamOttley  Год назад

      Very kind words mate glad I could help ❤️🙏🏼

  • @webdancer
    @webdancer Год назад +3

    Liam, thanks for sharing this information. This is quality stuff.

    • @LiamOttley
      @LiamOttley  Год назад

      Glad you enjoyed it 🙏🏼

    • @EnlistedBootCamp
      @EnlistedBootCamp Год назад

      @@LiamOttley wish i knew 5% of how to get any idea started, ugh, thanks you are golden

  • @codershorts
    @codershorts Год назад +4

    Liam never disappoints :)

  • @olamilekanajao6377
    @olamilekanajao6377 Год назад

    Well done Liam

  • @paulpaturle6957
    @paulpaturle6957 Год назад +4

    Super interesting ! Thank you, Liam ! I was wondering how can we see the model used, and if we can control the temperature ?

  • @mamdouhalmheid9685
    @mamdouhalmheid9685 Год назад +2

    I really like your content, thank you!

  • @adil.acoustic
    @adil.acoustic Год назад +2

    Amazing Liam bro..

  • @nadinejammet7683
    @nadinejammet7683 Год назад

    Thank you, i can already see how to use it in education.

  • @StudioTatsu
    @StudioTatsu Год назад

    Thank you, I've been looking for something like this for weeks

  • @ColtonCampbell
    @ColtonCampbell Год назад +2

    FYI, "GPTSimpleVectorIndex" changed to "GPTVectorStoreIndex"

  • @lonniesims868
    @lonniesims868 Год назад +3

    very informative video! Could we get a video on langchain soon? 👀

    • @LiamOttley
      @LiamOttley  Год назад +1

      Quite a big beast to tackle, hard to not make it too technical for most of my viewers :/

    • @lonniesims868
      @lonniesims868 Год назад

      @@LiamOttley understandable, you have been one of the best teachers when it comes to AI and how to leverage it! If your able to do a video on it in the future it would definitely help a lot. Until then I’ll be waiting for your next video!

  • @TheRealPlayer00
    @TheRealPlayer00 Год назад +2

    Good guy Liam!

  • @udaynj
    @udaynj Год назад +1

    Put speed for the video at 0.75 - Liam speaks really fast!

  • @Limesh
    @Limesh Год назад

    Superb Video, I loved it. ❤❤❤

  • @ashishrathore7783
    @ashishrathore7783 Год назад +3

    The code isn't working anymore, got stuck with an error on GPTVectorStoreIndex. The libraries you have used have been modified.

  • @owen_silk
    @owen_silk Год назад +1

    keep making great videos

  • @paul-thys
    @paul-thys Год назад +1

    It seems anyone will be able to do this soon. The value will be in the data. Can you use AI to gather the data to train in on?

  • @ramp2011
    @ramp2011 Год назад +1

    thank you for the video. What is the difference between using LlamaIndex vs Langchain? thank you

  • @jornreuvers1598
    @jornreuvers1598 Год назад

    Awesome video! Going to mess around with all this as my first steps into AI programming... well sorta first steps anyway!

  • @tomtomatron8625
    @tomtomatron8625 Год назад

    Great pacing and demo, thank you for the tutorial.

    • @muradbaghirli
      @muradbaghirli Год назад

      Hi, do I have to pay for open_ai key?

  • @mohdjibly6184
    @mohdjibly6184 Год назад +1

    Awesome video...thanks bro

  • @romancandlefight1144
    @romancandlefight1144 Год назад

    Great video
    Respect for sharing your files 🙏

  • @yakkalabour
    @yakkalabour Год назад

    Really cool video

  • @bilalmsd07
    @bilalmsd07 Год назад +3

    Great video as always. Keep up the good work. Wish you very best of luck for your channel. I hope it will rock in the near future.

  • @AlbanBytyqi
    @AlbanBytyqi Год назад

    Thank you. It is a vit above my head

  • @patrick.cheung
    @patrick.cheung Год назад

    Great Video. Thanks for sharing. 🎉

  • @michielsmissaert
    @michielsmissaert Год назад

    Wow impressive video, thsnk you so much!

  • @SimonStJohn
    @SimonStJohn Год назад +1

    Hey Liam! Awesome thanks...can you do a follow-up on an addon to index a website like a blog? And have the output to include a link to the article used for the answer so users can click through to read more?

  • @ishaanme91
    @ishaanme91 Год назад

    Extremely cool! Looking forward to more awesome content.

  • @Umuragewanjye
    @Umuragewanjye Год назад

    Thanks. for sharing the skills

  • @smann43231816
    @smann43231816 Год назад

    Thankyou, great video

  • @greg_thomson
    @greg_thomson Год назад

    Amazing tutorial! subscribed

  • @roberthuff3122
    @roberthuff3122 Год назад

    Fantastic! Thank you.

  • @vishnuvardhanvaka
    @vishnuvardhanvaka Год назад

    How much would be the cost of the api usage , please tell me I need that !

  • @z1mt0n1x2
    @z1mt0n1x2 Год назад

    oh..... now imagine throwing in all the D&D PDF's into one simple bot :D

  • @michaelabdoofficial
    @michaelabdoofficial Год назад +1

    Fucking brilliant man. Keep up the mad hustle.

  • @krisdavant5813
    @krisdavant5813 Год назад

    This is very very good

  • @chevvvv
    @chevvvv Год назад +3

    I would like to see a Javascript version of this

    • @LiamOttley
      @LiamOttley  Год назад +1

      Not sure if there are javascript equivalents for libraries like Llamaindex

  • @TheRealPlayer00
    @TheRealPlayer00 Год назад +3

    I can't use the custom code for some reason. it says OpenAI is not defined here: llm_predictor = LLMPredictor(llm=OpenAI(temperature=0.1, model_name="text-davinci-002")).
    I tried from openai import OpenAI but it is not working.
    Any suggestions anyone?

    • @JebliMohamed
      @JebliMohamed Год назад +3

      You need to add : from langchain import OpenAI

    • @TheRealPlayer00
      @TheRealPlayer00 Год назад

      @@JebliMohamed good man

    • @LiamOttley
      @LiamOttley  Год назад

      Thanks Jebli

    • @millerco2000
      @millerco2000 Год назад +1

      I am still getting this error.
      NameError Traceback (most recent call last)
      Cell In[37], line 6
      2 from llama_index import LLMPredictor, GPTSimpleVectorIndex, PromptHelper
      5 # define LLM
      ----> 6 llm_predictor = LLMPredictor(llm=OpenAI(temperature=0.1, model_name="text-davinci-002"))
      7 from langchain import OpenAI
      8 from openai import OpenAI
      NameError: name 'OpenAI' is not defined

    • @TheRealPlayer00
      @TheRealPlayer00 Год назад

      @@millerco2000 from langchain import OpenAI this fixed it for me

  • @문세영-e9g
    @문세영-e9g Год назад +1

    Thanks, but I can't see any code that 'prompt' variable is used for openai's api.
    Can u explain how can the chatbot remember previous chat?

    • @문세영-e9g
      @문세영-e9g Год назад +1

      I figured out it.
      Need to modyify as query(prompt) intead of query(user_input)

    • @LiamOttley
      @LiamOttley  Год назад

      🙏🏼

  • @会飞的猪-s8d
    @会飞的猪-s8d Год назад

    How do I break the word limit for an answer,Sometimes the answer feels half, not quite ,How can I modify it thank you

  • @joebanks4997
    @joebanks4997 Год назад

    Very good. How do we train the bot to be context oriented? If I only want the bot to have knowledge of radio controlled cars for example. At the moment I can ask this bot about what's in its index, but I can also ask it about washing machines and it will answer.

  • @mohamednihal8215
    @mohamednihal8215 Год назад +1

    Wow! Can we use any open source llm model instead of using openai api key?

  • @ISK_VAGR
    @ISK_VAGR Год назад

    Crazy Good Amigo

  • @tonaltti
    @tonaltti Год назад

    How to personalize the bot? To have a custom knowledge base but also tell the conversation style?

  • @april11729_
    @april11729_ Год назад

    wow!!!! than you so much !!!

  • @victorquinones9111
    @victorquinones9111 Год назад +1

    Question: the information that we index, is this shared with an external server loosing its confidentiality, or it just remain in the users computer?

  • @OnChainEpic
    @OnChainEpic Год назад +2

    Hey Liam, sorry not sure what Jupiter is, is that what your running the code in? Assuming we can run this locally? Also how would you integrate this into something you built, by referencing this new model or what within Open AI? Would like some more details on the code your using and integration???

    • @LiamOttley
      @LiamOttley  Год назад

      This is all Python code running in Jupyter notebook. Super easy install with the anaconda launcher. Deploying apps is a bit trickier so you’d probably want to play around with things on your own as I am then hire a developer once you’re happy with it to create a product out of it

  • @Sleeperknot
    @Sleeperknot Год назад

    In this example, where does the transformer model sit? At facebook servers or locally?

  • @SeanieinLombok
    @SeanieinLombok Год назад +1

    can we teach it to iteratively improve its own code?

  • @lucasalvarezlacasa2098
    @lucasalvarezlacasa2098 Год назад +1

    Great video!. I have some questions:
    1) When we create an index, I understand that what's going on is that somehow based on the question we know which part of the files inside the index should be used to reply to it, and this is context information given to GPT as part of the prompt. Is that the case?
    2) Is there a limit in how bit this index can be?

  • @DanielBowne
    @DanielBowne Год назад +1

    Great info. How did you capture page numbers from pdf and auto add to the output?

    • @LiamOttley
      @LiamOttley  Год назад

      Which part do you mean? Drop the timestamp

  • @squiddymute
    @squiddymute 10 месяцев назад

    can you do any of this stuff without open ai api ?

  • @aprilrobertson7450
    @aprilrobertson7450 Год назад

    What do you recommend to build a construction engineering contractor estimating cooperation

  • @jgilmourtechsmog
    @jgilmourtechsmog Год назад

    great stuff, looking to tie this out as a slack bot to answer questions from employees for various business facing items contained in our KB

    • @umairx25
      @umairx25 Год назад

      Hello, if I use the Google docs loader, will the file be updated every time I update the Google doc?

  • @darkknightgaming9016
    @darkknightgaming9016 Год назад +2

    Great video! I was just wondering if it is possible to make it less expensive, because when I use big data bases it uses a lot of tokens.

    • @LiamOttley
      @LiamOttley  Год назад +1

      GPT 3.5 Turbo is extremely cheap, hopefully they add support for it soon instead of davinci-003

  • @smudgepost
    @smudgepost Год назад

    Very good. Need a nice front end and link to a vector db like Pinecone

  • @ahmedyehia4846
    @ahmedyehia4846 Год назад

    amazing one

  • @tomrauch1
    @tomrauch1 Год назад

    Hi, great video thanks! But I am running into an error loading the notebook - ok to post it here for resolution?

  • @varunverma5049
    @varunverma5049 Год назад

    Thanks for the session!
    how do you get the info logs for token usage?

  • @denzilstudios7072
    @denzilstudios7072 Год назад

    Good stuff here!, now friends don't let friends keep their api keys in public repositories! Remove it from the code and be safe.

    • @LiamOttley
      @LiamOttley  Год назад

      I deleted the api mate it’s all good beginners would find it hard it I was creating env variables etc

    • @denzilstudios7072
      @denzilstudios7072 Год назад

      @@LiamOttley good job

  • @nemesis851_
    @nemesis851_ 6 месяцев назад

    Does this training of the knowledge base, cut into the token limit?

  • @denniskampien987
    @denniskampien987 Год назад

    Cool, but does it keep the context of all the document?

  • @bianka1994
    @bianka1994 11 месяцев назад

    hi!
    great video!
    I tried to use the code provided in the notebook, but I got an error saying the openai API key is incorrect. maybe it's expired?

  • @wamballa5948
    @wamballa5948 Год назад

    how come you use Jupyter and not google collab?

  • @thebigbigdaddy
    @thebigbigdaddy Год назад

    How is it private when we are loading Open AI's davinci Library to process custom data? Is private info not getting into public space? Great tutorial though!

    • @awesomeguptaji
      @awesomeguptaji Год назад

      you should not share something personal and things which should not be on the internet

  • @cityofhustlers
    @cityofhustlers Год назад +1

    Truth is your tutorials are nice and life changing but the problem is it's not beginners friendly you don't show us how you get started everything should be from scratch so that people can follow up fully understand what you are doing

  • @spzen98
    @spzen98 Год назад +1

    Hi, I have a problem with the OpenAI API rate limit when using large set of data. This is when loading the GPTSimpleVectorIndex. For small data sets it's okay. Can u advise?

  • @antfiv007
    @antfiv007 Год назад +1

    Dear Liam, great stuff. Already subscribed and looking forward to learn from you. A question: how to increase the length of the output ? I am using it to document some code and it stops before completing the task entirely. many thanks

    • @taylormun
      @taylormun Год назад +1

      often the token limit is too low

  • @bgtubber
    @bgtubber Год назад

    Very nice! Does this thing run locally? Especially if there is no internet connection available.

  • @fabioarobne
    @fabioarobne Год назад

    What's the difference between that and fine tuning?

  • @dsalvat1
    @dsalvat1 Год назад

    Interesting, can it be done with a local trained model, like Llama or Vicuna 7b to keep it offline?

  • @Monalexander799
    @Monalexander799 Год назад

    Can you use this offline locally?

  • @jorgerios4091
    @jorgerios4091 Год назад +1

    Can you create one chatbot using either Langchain or Haystack and the recent star Alpaca?

    • @projectbit2248
      @projectbit2248 Год назад

      I second the above request langchain + Alpaca would be really interesting.

  • @chikosan99
    @chikosan99 Год назад

    Thanks a lot!, Great Content!
    There is a limit of data that you can index ?
    there is an option with openai api to work with chatGPT4 with custom indexes?

  • @Someone-mn1sx
    @Someone-mn1sx Год назад +1

    My question is will doing this bypass the content filter of ChatGPT? Could I host GPT and use llama-index or something to do that? No, it's not for sexy time. It won't talk about a lot of things like stock trading or cybersecurity because it flags it as bad content and gives an excuse instead of responding.

    • @LiamOttley
      @LiamOttley  Год назад

      Good question, I'd assume because it's just using your API key and davinci-text-003 or whatever you set it as then it would still hit the filter

  • @sirdrakey
    @sirdrakey Год назад

    Very cool 😎 following!

  • @yashsrivastava677
    @yashsrivastava677 Год назад

    Code doesn;t work anymore because from llama_index import GPTSimpleVectorIndex throwing an error now.

  • @bimwerx
    @bimwerx Год назад

    Great content! How would you get around the response character limit using this example?

  • @novimnovember
    @novimnovember Год назад

    Hi, great video! Totally new to python here. If i want to host this in a website, how do i do that? Thanks.

  • @kiranshiveshwar3108
    @kiranshiveshwar3108 Год назад +1

    If I have a PDF of 300 Pages, will it still work as I saw another video using Lanchain and Pinecore (to store vector data for 300 pages)

    • @LiamOttley
      @LiamOttley  Год назад

      Good question, I haven't seen anything on limits for these kinds of indexes so worth testing. Ask different questions about info on sample pages?

  • @henrymetzger9951
    @henrymetzger9951 Год назад

    Can i train llama to write my books? Build it to help me write it prompt based after making my choices from multiple selection and move forward. Like based ona large overview? Maybe have it write the structure in the beginning?

  • @noorameera26
    @noorameera26 Год назад +1

    Hi Liam! I'm interested to build a chatbot for an internal website, however, I worry that this might caused information leakage. What's your opinion on this?

    • @LiamOttley
      @LiamOttley  Год назад +1

      I’d say OpenAI is taking privacy pretty seriously. I wouldn’t be worried personally, people have built huge apps using their APIs already.

  • @a999haa
    @a999haa Год назад

    Hey Liam! Great video mate 🙌
    Can I ask if this can generate responses in a particular json format if needed after indexing any document?
    Thanks again for the video!