ChatGPT for YOUR OWN PDF files with LangChain

Prompt Engineering

Просмотров 270 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 4 окт 2024

Комментарии • 427

@engineerprompt Год назад ⁺³
Want to connect?
💼Consulting: calendly.com/engineerprompt/consulting-call
🦾 Discord: discord.com/invite/t4eYQRUcXB
☕ Buy me a Coffee: ko-fi.com/promptengineering
|🔴 Join Patreon: Patreon.com/PromptEngineering
▶ Subscribe: www.youtube.com/@engineerprompt?sub_confirmation=1
@oryxchannel Год назад ⁺²⁶
OMG someone took the time to talk about usage costs. No one has yet herded up usage case scenarios in relation to cost from major AI vendors. Thanks for your consideration in this area.
@nickstaresinic9933 Год назад ⁺²²
Well, done. You filled in several important holes in my understanding of how to code something like this for my domain.
@engineerprompt Год назад ⁺²
Glad it helped!
@besarthysniu1230 Год назад ⁺²¹
Very clear, thorough, well paced and learner-centered. What an amazing educator!
@ricksegalCanada Год назад ⁺²⁰
Excellent video. In three minutes, I learned more about how AI works in general than 100s of other videos. Well done, sir.
@sicfxmusic 9 месяцев назад
Let me see your watch history 🤣🤣
@tchrapko Год назад ⁺¹²
At this point it doesn't get any easier than that! I was able to drop in a technical document that makes my eyes bleed when I read it and just start asking questions of it instead. Great job! If someone would bundle this up into a nice little application and let me aim it at directories full of documents I think they could make a boatload of money.
@blockchainbrudda3051 Год назад
What do you mean 'aim it at directories' ?
@tchrapko Год назад ⁺¹
@@blockchainbrudda3051 aka "folders"
Like D:/Technical Documents/
I can't wait for the day when SharePoint has AI assistance built in so a company can ask natural language questions of their business content and get back Chat-GPT style answers with links to the source material. It'll be a revolution for content management and productivity.
@tommasterplus Год назад
Chatpdf
@adi2soni Год назад
Working on It
@engineerprompt Год назад ⁺⁶
I created an updated video to work with multiple PDF files: Checkout here: ruclips.net/video/s5LhRdh5fu4/видео.html
@raymond_luxury_yacht Год назад
no dude its still giving me an error. please can you have another look?
Notebook loading error
There was an error loading this notebook. Ensure that the file is accessible and try again.
Request had invalid authentication credentials. Expected OAuth 2 access token, login cookie or other valid authentication credential. See developers.google.com/identity/sign-in/web/devconsole-project.
@cvetelingeorgiev1527 Год назад
How to increase output size. It works great, but output text is too short and I don't see an obvious way to increase it.
@engineerprompt Год назад
@@cvetelingeorgiev1527 You can pass on temperature to the OpenAI object, play around with it. That will change the behavior.
@elmiraghorbani7437 Год назад ⁺¹
still, the file isn't accessible
@dinugherman8785 Год назад
The notebook is not available. 😢
@martynas-al Год назад
A very clear explanation. Before this video, I was confused about the purpose of embeddings and how the actual answers are produced and the video explained it very well.
@calabisan Год назад ⁺⁴
Great work! Thanks! Works out of the box. Shorter and clearer impossible 🙂
@engineerprompt Год назад
Great to hear!
@AIEinstein Год назад ⁺²
AWESOME Video! This kind of apps are really good :)) the workflow gets improved too much
@bingolio Год назад ⁺²⁶
Great example, please cover how to do same using FreeGpt, Dolly or other Opensource models
@dongnguyenanh7282 Год назад ⁺¹
hello, how do you get the location of the pdf files on the drive?
@yashdes1 Год назад ⁺¹
Langchain works with essentially any model with an api
@arthur...barros Год назад ⁺²
Excellent educator. loved the well paced video. Thanks for sharing your knowledge and findings
@engineerprompt Год назад
Thank you!
@nottyverseOfficial Год назад ⁺²
ChatPDF is also good. I have used it.. and its free for 120 pages, 3 PDFs/day, and 50 questions/day... one can pay $5 per month to get very good upgrades
@helter2K10 Год назад ⁺²
Nice work - very clearly explained and you addressed the code fragments really well - look forward to more vids!!
@engineerprompt Год назад
Thank you :)
@hiutuanting4643 Год назад ⁺²
Is it possible to feed an entire GitHub project to GPT and ask it to explain or give ideas on how to modify the code?
@jrs999999 Год назад ⁺⁴
Really interesting and helpful! Thanks for taking the time to put this video together.
@engineerprompt Год назад
Glad it was helpful!
@YugKhatri-ht8kd Год назад ⁺²
what is the approx cost of API, if I use a University Subject's textbook with 1000 pages? I mean cost of embedding the pdf data to model and also the search cost for questions. Can you tell the cost in the form of API pricing or tokens?
@Alice_Fumo Год назад ⁺²
Wow, I stared at that opening graph for like 10 minutes being in awe, realizing the implications and uses, marveling at the elegance. This is insanely similar to an approach I thought of to extract new information during conversation, but this is more elegant.
I should start making graphs of my approaches, since they do tend to get pretty complex and sometimes I lose track of what I'm doing or trying to do.
@ianabrahams5434 Год назад ⁺¹
Thanks for a very instructive video and learned quite a bit from your step by step guide. Much appreciate the effort you put in & you have inspired me to keep expanding my knowledge in this area. Thank you.
@engineerprompt Год назад
Glad you found it useful.
@SedhuujGorem 8 месяцев назад ⁺³⁰
The Best tool for this is ruclips.net/video/bcK7LldB3dk/видео.html
I like some of the transitions, but sometimes they're a bit too much and are seemingly random. Since we use these persistent elements that transition across pages to indicate some kind of relationship between the previous and the next states, some of your transitions confuse me because I can't immediately see what the relationship is.
For example 1:23 of the selectable tiles (which weren't selected) transition into being two switches... does that mean anything? are they related in some way? I see this as random and a bad use of the design language. However, at 3:14 I like the transition from switches to the ticks on a paper, that makes sense to me. Epic presentation tho
@jejejejeq 8 месяцев назад
video unavailable :/
@TylerKlug Год назад ⁺³
Fantastic video. I'm sure someone has made a follow-up somewhere, but can you help me understand how to wrap everything into my own UI where I can pass a parameter through to the search query so it can effectively act as a chatbot?
@port7421 Год назад ⁺¹
It was a very helpful guide. Thanks! Great that I was able to test it quickly thanks to your notebook link.
@dongnguyenanh7282 Год назад
hello, how do you get the location of the pdf files on the drive?
@iamjustahair1315 Год назад ⁺¹
@@dongnguyenanh7282 This is the default that u should use /content/gdrive/My Drive/data/2023_GPT4All_Technical_Report.pdf.
i would suggest to make a folder named 'data' and place your pdf file in it. It worked for me
@port7421 Год назад
@@dongnguyenanh7282 Hi, I uploaded my own file to my Google Drive. You must allow access to the drive while signed in to your Google account. For me it looks like this:
reader = PdfReader('/content/gdrive/My Drive/my.pdf')
@andresmontoya4870 Год назад ⁺¹
Mindblowing! Very clear and your explanation is excellent! Thanks ;)
@login2video 10 месяцев назад
Very nice... explained at the right pace.... keep up the good work... it would be more helpful if a repo is maintained...
@engineerprompt 10 месяцев назад
Thank you, that’s a great idea
@ultimategolfarchives4746 Год назад ⁺¹
Alright, let's be real here. I have no idea who you are, what IDE you're using, or how AI works (I still think it stands for "Artificial Iguanas"). but I can confidently say that your video is fantastic!
🔥Great job! 🔥
@engineerprompt Год назад
Thank you, comments like this keeps me going :-)
@ultimategolfarchives4746 Год назад
@@engineerprompt Seriously, you know your subject and you take the time to explain the concept behind it. Thanks for your content 🙏🙏
@andre-le-bone-aparte Год назад ⁺²
Just found your channel, Excellent Content! - Another sub for you sir!
@engineerprompt Год назад
Thank you!
@gybeturkey107 Год назад
Very well laid out and all answered. Thank you.
@ludwigvanbeethoven61 Год назад ⁺²
Thanks, can we also use it with non pay-for-each-token models like ChatGPT3.5 or ChatGPT4? (Might be a stupid question; but i did not find an answer to this so far)
@captanblue Год назад
I'd like to know as well
@aliminaoui6448 Год назад ⁺⁹
Hello, thanks for this amazing content !
I tried it with multiple PDF and CHATGPT get confused when I ask him generic questions that are similar on multiple documents (for example : "what are the skills of Jhon DOE ? " when I uploaded multiple PDF resume, it send me back the skills of everyone in the vector database)
How do you manage multiple PDF ?
@engineerprompt Год назад
I have another video on dealing with multiplle PDF files. Have a look at that. You can set it to give you the top k responses. Will be making a video on it soon.
@victorgianordoli5403 Год назад
@@engineerprompt Your explanation is very didactic. Your code is very clear. I look forward to your new video on chatting with multiple PDFs. Congratulations.
@engineerprompt Год назад
@@victorgianordoli5403 Thank you, you probably want to check out this here: ruclips.net/video/s5LhRdh5fu4/видео.html
@lynnqi6451 Год назад
Your explanation is very clear! Love it! Thank you very much!
@engineerprompt Год назад
Glad you found it useful. Appreciate the kind words.
@ziga1998 Год назад ⁺¹
I have a question.. So what If I want to have like a knowledge of chatGPT model which I specify, plus the added information from the PDF file? How is this achievable?
@peterthegreat7125 Год назад ⁺²
Super useful, this is what I have been looking for, ❤ love it!
@dongnguyenanh7282 Год назад
hello, how do you get the location of the pdf files on the drive?
@peterthegreat7125 Год назад
@@dongnguyenanh7282 "/content/gdrive/My Drive/" is the root dir of your gdrive, you can append you file path in gdrive after this root dir. you can treat it as a real folder and use 'ls' to find out where your file is.
@MohitKumar-gp6nr Год назад ⁺¹
I have some JSON files which I want to use for chatbot data source. How to store the JSON information in Croma DB using embedding and then retrieve it based on the user query. I googled a lot but did not find any answers.
@GimbaGoyo Год назад ⁺¹
Nice, I don't have the basic coding skills and I feel that's a must. I will like to challenge you though to create an App that can compare two or more than two documents and to discover if there are issues of copy and paste or plagiarism between the documents without running a search across the whole internet. Is this doable?
@cstan2381 Год назад ⁺¹
Thanks! Is there a cost associated when you call OpenAIEmbeddings(). Can I run a local LLM model to answer the query?
@engineerprompt Год назад
Thanks this out: ruclips.net/video/MlyoObdIHyo/видео.html
@yousufleads Год назад ⁺¹
I assume there is no one-click .exe file (yet) or a clear GUI?
@kicheko4980 11 месяцев назад
You sir I am buying you a coffee
@CER786 Год назад ⁺¹
It was amazing learning for me. I built my application successfully. Can we take user input using a window? Can we use pdf in Arabic or Urdu?
@engineerprompt Год назад
You can build GUI application on top of it. Check this out:
ruclips.net/video/RIWbalZ7sTo/видео.html
I haven't used it for any other language but I think it can be done.
@adityahpatel Год назад
the 3 questions you are asking are very simple. running this on a company's annual report 10-K. There are many questions e.g. what is the capital expenditure for 2022. The answers exist in the PDF yet it says 'i don't know'.
@engineerprompt Год назад
You will have to do some prompt engineering on top of the simple examples I have shown here.
@8888-u6n Год назад ⁺³
Thanks for this video it's really helpful. Could you make a video on how to do embedding with gpt4all and langchain on colab , it would be cool to be able to run your own models and have your own extra data sets
@JavArButt Год назад ⁺¹
Very nice content - thank you for that introduction
@harishusic5284 Год назад ⁺¹
Thanks! This was super helpful and I was able to query my own PDF's but I can't figure out where and how to specify the LLM I want to use GPT-4. Can you please let me know?
@engineerprompt Год назад
Watch the latest video on the channel. I have provided detailed explanation there.
@adytech5788 Год назад ⁺¹
Hello, how do you think i can handle the same process with lot of files of my own company database, i have few Gigabytes of files that i would need to scan & chunks to create my own database, then connect with GPT4all to interact with question regarding my company, give some tasks etc...
thx for the head up
@DanieleCorradetti-hn9nm Год назад ⁺²
Amazing tutorial, but is there a way to have multiple pdf all stored in the same place once for all and then go there for the query as we are doing in this tutorial? From a practical point it would have much more sense...
@engineerprompt Год назад ⁺⁴
There is more interest that I anticipated :-) I am going to be making more videos on the topic with practical use cases (multiple files, different file formats etc.). Keep an eye out for those!
@jasonpearson1555 Год назад ⁺¹
Godspeed sir
@user-wr4yl7tx3w Год назад ⁺⁴
is it possible to replace openAI with alternatives like Alpaca or Vicuna, given the cost?
@SuproMVP Год назад ⁺⁷
Tried searching a lot. Every example uses OpenAPI. No one has used LLama, Alpaca or Vicuna.
@synthclub Год назад ⁺¹
No.. body has the compute hardware that openai has or will have..
@miguelcabaero5843 3 месяца назад
Hello in the case that i had a diagram, graph, chart, or any kind of graphic organizer in the pdf, is it possible for that too to be inputed? Thank you so much btw for the excellent video.
@indianmonk8746 Год назад
OSM, I really liked your to the point video, Thank you
@chinmaybhalerao5062 Год назад ⁺¹
Excellent video!
@VastIllumination Год назад
I love you. thank you for making this so easy!
@snaky1310 Год назад ⁺¹
That was a great video, thanks!
But in the end, how do you then output the ChatGPT message outside of Langchain into your apps?
@BalaramakrishnaKamma 10 дней назад
If I ask questions about graphs, tables, or images present in the PDF, will it provide an answer?
@saeedbello 10 месяцев назад
Well explained. Thank you for sharing your knowledge with us. I want to ask if it is possible to get response of a query from the vector database and ad well as the outside the vector database
@codea1273 Год назад ⁺¹
So when does this use the API the most? During the embedding or during the query? If its during the embedding, can I pickle the results so I can query the same stuff faster and more cheaply in the future?
@engineerprompt Год назад ⁺¹
At both stages, and yes, you can get embeddings of your documents and store them locally and then do the api call for the query.
@KOREAyoungwoo Год назад
I am waiting for multiple file read, thanks a lot!
@sportscardvideos Год назад
What's the best video for someone with little to no python experience but wants to use langchain
@not-a-weasel Год назад ⁺¹
Thanks for sharing!
@danielmoore4311 Год назад
I have been looking for something like this for almost 2 months, and watched at least a dozen youtube videos. This is the first video/code that acutally works! Question... suggestions on how to connect this to streamlit or another webbased query platform?
@engineerprompt Год назад
Thank you! You want to check out this video: ruclips.net/video/RIWbalZ7sTo/видео.html
@cretindofinoi Год назад ⁺¹
Hi, thank you for the video. I need your help. I want to use this solution. However, i would to base gpt answers on one hundreds pdf files. Each pdf file is a book about 200 pages. I do not see in this video how we can rely on several pdf.
@engineerprompt Год назад
Check this out for multiple files. Will be making more detailed videos on the topic soon ruclips.net/video/s5LhRdh5fu4/видео.html
@GooberStudios Год назад
great simplified video explanation. In the part where you choose the text-ada model. Can you replace that with the model id of an openai fine-tuned model we created? This way we can use the fine-tuned model to speak with the pdf?
@engineerprompt Год назад
Yes, you should be able to do that easily.
@GooberStudios Год назад
@@engineerprompt so basically if i wanted, i can say have a fine tuned model that speaks like Thor read my pdf knowledge base and answer in the way of Thor. is this correct?
@MAButh Год назад
Nice video! I assume that DeepL uses a similar approach to translate PDFs. I used it but encountered some problems. For example, if a sentence does not end on one page, it can cause problems and return nonsense. This may have been the reason for our "Overlap"? So, I rewrote some 250-page-long documents to eliminate any overlapping sentences from page to page. (From now on, I will compare translating a text to making queries, since both require a comparable amount of "work" for GPT.) This helped a lot, but not always.
In my opinion, the reason for the occasional issues is that it is difficult to predict the number of tokens required for each page. If the text, like in my case, is complex scientific or technical content, GPT will need more tokens for the same number of characters than it would for a fairy tale, for example. Therefore, with a technical or scientific document, you may run out of tokens very quickly if the content is complex. Whether it's translating or making queries, I believe this problem will arise.
Perhaps we need to wait for GPT to upgrade the maximum number of tokens by 2-3 times from now until it can handle any kind of text. Currently, you could reduce the format of your pages to ensure that each page has less (con)text.
@kashanasim7903 8 месяцев назад
The model used by default is text-davinci-003 and it is now deprecated so what should we do now ?
Any latest code for the above project ?
@kevennguyen3507 11 месяцев назад
How can I combine the RetrievalQAWithSourcesChain from your other tutorial into these codes. Basically, I want to provide the references which will return the page number or numbers, within the PDF document, that the answer is found. Please help.
@maxpiau4004 Год назад
Thanks, this was my this afternoon to do.
@PODIK Год назад ⁺¹
When I try to query it gives me an error "This model's maximum context length is 4097 tokens, however you requested 4372 tokens (4116 in your prompt; 256 for the completion). Please reduce your prompt; or completion length."
This may be because I can't seem to get a gpt model that supports large proms. Honestly, it won't let me specify any model at all. Although I put it in the right place according to the timecode (8:59). When trying to specify the model, it gives the error "File "", line 1
chain = load_qa_chain(OpenAI(gpt-4-32k-0314), chain_type="stuff")
^
SyntaxError: invalid decimal literal"
@engineerprompt Год назад
If you have access to the 32k tokens model then change it like this.
chain = load_qa_chain(OpenAI(model_name='gpt-4-32k-0314'), chain_type="stuff")
This will work. However, if you are using the default model, then as the error message is showing you are providing more tokens than what the model supports. In this case you want to reduce the number of documents that are being return by the similarity search results. Pass a value of k (by defult its set to 4). I will recommend starting with 3 and if it still doesn't resolve the issue, go even lower.
docsearch.similarity_search(query, k=4)
Hope this helps.
@taznainfathima Год назад ⁺¹
How do u load multiple pdfs in LangChain ?
@jejejejeq 8 месяцев назад
The cost question was incorrect tho. It says they got GPU's for 800$ and failed trainings for about 500$ using the OpenAI API, then they say the full training could be done with 100$ renting a gpu. :/
@ДаниилКиселев-с6о Год назад
Thank you!
@prazyraj1735 5 месяцев назад
I have this use-case where there are different types of documents. I can parse documents using document loaders using langchain. But, there are images also in these documents. I want to store them as metadata and if answer generated from a context chunk it show the image also. Please help.
@GladisPL Год назад ⁺¹
Is exact openai model configured implicitly? I'm wondering how to know which model we use based on the pricing section listed on openai page. You use embeddings. Shoud it be that one then - Embeddings - Ada? Would be nice to see video about calculating prices based on various factors (so that we can plan costs acording to the requirements).
@engineerprompt Год назад ⁺¹
that's a good point, will add those details in another video for sure. You can pass the model to OpenAI function (there is a model parameter). Thanks for the suggestion.
@devsensei9 Год назад ⁺¹
It uses text davinci
@shinycaroline3722 Год назад
I am passing the entire document and able to retrieve all the details I need in a single prompt. But response time goes higher. Vice versa if I go with multiple prompts response time is less but since I need to pass the input document everytime usage of token goes high. I am building an application in drf and I don't need any user interface for this. Just need to hit the openAI once to get relevant results from the document and send as json response. Any solutions?
@M-ABDULLAH-AZIZ Год назад
having data in a file and real time embeddings vs embeddings in a db for chatbot for an application (provides information about an application)?
@TZTang-o4f Год назад
Nice work! If i want to process multiple. Can we do this by adding more inputs?
@DavidG2P Год назад
How does this compare to simply asking BingBot in the Edge Browser's sidebar about a currently displayed PDF document?
@rolandowise Год назад
Thanks so much, this was very helpful! You mentioned doing a version that can take in multiple files within a folder, what are the changes required? Will the embeddings retain a correlation to the rest of their respective file (e.g. if i ask who are the authors of a particular quote somewhere in the middle of a paper, how will it know that it relates to the names right at the beginning if there are multiple different papers embedded?)
@MVergaraQ Год назад
Man I love your tutorials! Do you have any advice on converting scanned pdfs to text for this same application? what are tools you'd recommend?
@billk6512 Год назад
Thank you!. Fantastic stuff.
@EdwardSantoro 5 месяцев назад
I need an App that can read multiple files to answer questions in another uploaded file. Any suggestions?
@sm4849 Год назад
Brilliant tutorial mate
@wernershintaku6104 Год назад
Very good and clear.
@JamesBrooksco Год назад ⁺¹
Could we do this with a folder of txt documents. I’m thinking of querying a Zettlekasten created in an app like Obsidian
@engineerprompt Год назад ⁺¹
Yes, there is a loader for text files in langchain
@italoaguiar Год назад
Excellent!! 🎉
@dealersagent Год назад
Very good video. Thank you
@cybersamurai99 Год назад
Magnificent!!
@Woldekidan Год назад
I have now an idea how chatGPT is trained with large data and be able to retrieve a response for your query within seconds. Thank you!
@MajorBuzzKill Год назад
I used a research paper as input pdf and i want it to create a 1500 word summary but it cuts off at 200 something words, also where you specified i cant input any models. ( 8:59 )
@PhilipOwusu 11 месяцев назад
Can images in a PDF be interpreted and described using a similar method as text?
@h-s7218 5 месяцев назад
how can I save the vector database in a physical one, not in memory ?
@lokash Год назад
Thank you. Very interesting
@thecutestcat897 Год назад
Thanks, this helps me a lot!
@tebitellechea 6 месяцев назад
Thanks for the very well detailed tutorial. I'm working with a large pdf (10mb, 580 pages) and I have this error message when running docsearch = FAISS.from_texts(texts, embeddings): RateLimitError: Error code: 429 - {'error': {'message': 'You exceeded your current quota, please check your plan and billing details.
@engineerprompt 6 месяцев назад ⁺¹
That means you don't have enough money in your openai account. You need to look at your billing page
@clemenswager4000 Год назад
Can you make an update video on the project? it kinda blew up and I am really interested in how it is going 😊
@engineerprompt Год назад
Yes, updates coming soon!
@xevenau Год назад
thank you! Is there a way to adjust the token size of the output? i would like to add more context to the output. also on minute 9 you mention changing ai model. how exactly do i do that?
@kaini8635 Год назад ⁺¹
thanks for the video, just wonder how do you do extraction if the pdf page contains mixed text and image/chart
@engineerprompt Год назад ⁺²
The file I tested has table and images but this will ignore them. It can only do text based info retrieval.
@jonnythrive Год назад ⁺²
Thanks for the videos! They've helped understand a lot about GPT stuff. But how do I change the language model?
@iamjustahair1315 Год назад
i guess you can change the api key
@AmitKailashchandraGupta 11 месяцев назад
Hi Prompt Engineering,
can we implement the same logic with our custom model, ( without taking any help from OpenAI)?
waiting to here from your side....
@engineerprompt 11 месяцев назад ⁺¹
Yes
@asepmulyana9085 Год назад
Thanks for your video! How can I change the PDF file using URL instead of google drive?
@maamardli Год назад
Great tutorial! thank you very much!
@albertocambronero1326 Год назад
what is the token limit on this? can it read 1000 pages PDFs and answer questions accuaretly?
@smart-sg5cs Год назад
hey the way u explain seems extremely simple to implement can we use PDF gpt for commercial use
@engineerprompt Год назад
Yes, there are actual products out there using the EXACT same approach :)
@alxx736 Год назад
Hi! When i ask things not related to the documents,alwats returns informations . Information not inside my context

Следующие

Автовоспроизведение

Talk to YOUR DATA without OpenAI APIs: LangChain