How to Build an AI Document Chatbot in 10 Minutes

Dave Ebbelaar

Просмотров 323 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 25 май 2023
🚀 Kick-start your freelance career in data: www.datalumina.io/data-freela...
Easily Build LLMs Apps - In this video, we are going to explore Flowise, an open-source UI visual tool to build your customized LLM flow using LangchainJS, written in Node Typescript/Javascript.
🔗 Links
flowiseai.com/
github.com/FlowiseAI/Flowise
github.com/daveebbelaar/langc...
www.docker.com/
docs.npmjs.com/downloading-an...
👋🏻 About Me
Hey there, my name is @daveebbelaar and I work as a freelance data scientist and run a company called Datalumina. You've stumbled upon my RUclips channel, where I give away all my secrets when it comes to working with data. I'm not here to sell you any data course - everything you need is right here on RUclips. Making videos is my passion, and I've been doing it for 18 years.
While I don't sell any data courses, I do offer a coaching program for data professionals looking to start their own freelance business. If that sounds like you, head over to www.datalumina.io/ to learn more about working with me and kick-starting your freelance career.
Наука

Комментарии • 250

@daveebbelaar 10 месяцев назад ⁺²
👉🏻Join my FREE community if you want to learn more about AI: bit.ly/data-alchemy
@Taskade 9 месяцев назад
Awesome stuff, Dave! Loved how you delved into Flowise and LangchainJS. Hoping to see some comparisons on AI Agents, especially long-running and autonomous ones in Langchain. Keep it up! 🚀
@SmartWizzard 6 месяцев назад
Can I integrate this into Flutter APP as a chat FAQ?
@Taskade 6 месяцев назад
Feel free to reach out any time! @@SmartWizzard
@srh1034 4 месяца назад
Any solution to build a CPU-based AI Document Chatbot which utilizes CPU and does not need API ?
@Hisma01 Год назад ⁺⁶
great content. Reminds me of the same concept as node-red, but specifically built for AI apps. I think this can go a lot further than just a prototyping tool, seeing as how far node-red has come from where it started. Really neat.
@user-jg4ci4mf8w Год назад ⁺¹
This is the best review about Flowise on RUclips!
@f1l4nn1m 11 месяцев назад ⁺²⁶
Hi Dave, thanks for sharing. Flowise is pretty cool as a rapid-prototyping tool, you can visually chain complex objects in just a matter of minutes. I found some cons though, which I think the devs should address.
- Debugging is tough. Each API-based component should have a test button (does the key work? is the deployment name correct?). Those aren't things you want to guess from pieces of text in the chat box.
- Coding. It's nice to have a visual interface, but at the end of the day this is just chaining function calls. It would be nice to have the Python code corresponding to a specific saved flow. So that one can customize it.
I wonder if they are interested in getting precise feedback from users to improve and prioritize features. I'd be really kin to help them build a great product.
@ZachHyman_SIA 10 месяцев назад ⁺¹
Agreed. "Error: Request failed with status code 401" in the chatbox dialog isn't particularly revealing...
@Esehe 10 месяцев назад
Thanks for the videos Dave! Keep them comming. Also noticed you doubled your subscribers since you made the video, congrats. Your conversational agent snitched :)
@camilocampos5900 Год назад
You are a great friend, I really like the information and knowledge you give us.
@gabrieljauregui654 Год назад
I was looking for this in your channel yesterday. Todays youtube algorithm rewarded me with this suggestion haha. Thanks for this valuable content. Greetings from Argentina! New suscriber =)
@daveebbelaar Год назад ⁺¹
That's awesome haha - thanks!
@WylieWasp Год назад ⁺¹
Hi Dave thank you so much for another awesome video I have a use case for this is just come at the right time I have some online documentation which I need to convert into a chatbot!
@daveebbelaar Год назад ⁺¹
Perfect 🙌🏻
@vadimzubakhin1589 7 месяцев назад
This is so impressive. Thank you.
@m.ehtizan Год назад ⁺¹
Hey Dave, I just watched your video and I must say that it was really informative, very useful and well-made. I loved your videos. I was wondering if I could help you edit your videos and also make highly engaging shorts for you?
@WeirdoRocks Год назад
Great video :) Keep up the great work.
@daveebbelaar Год назад
Thanks 🙏🏻
@abagatelle 10 месяцев назад ⁺⁷
Really good video. For beginners a video on how to use Git etc would help just to get grounded in the concepts of how to use VS code, download repositories and so forth would really help. Thanks for all your efforts, much appreciated.
@gr8tbigtreehugger 9 месяцев назад
Very cool! Many thanks!
@MattGadi 3 месяца назад ⁺²
Hi, many thanks for the video. How did you prepare the txt input file? Do you have a video or link to understand the contend and how to prepare it?
Thanks
@vishalnangare31 Год назад
Great information, here from Nash's channel 🙌🙏
@daveebbelaar Год назад
Thanks man!! 🙏🏻
@dsgeyer Год назад ⁺¹
Love it. Thanks for making these videos. This seems great to slap some components together then go in and have a lot of the code roughed in so it could be worked on from there. Is that an accurate assumption?
@daveebbelaar Год назад
Excatly
@khushpatel7060 Год назад ⁺¹
Hello Dave, I am asking here as this is your latest video, I have seen your videos regarding Langchain, I am curious how to optimize inference while using LLMs in Langchain using TensorRT or Onnx Runtime... As in Industries, It's very obvious, to save the time as well as computation cost... In TensorRT while using open source models, we have techniques like quantization and a few more more for that... So In Langchain is there any way to do this?
@hgwvandam Год назад
Good video Dave, thanks!
@daveebbelaar Год назад
Thanks Hans!
@nilsbuer 4 месяца назад
Very cool video. Content and video quality
@kennethochiabuto8516 11 месяцев назад ⁺³
Great video thanks. As a no coder myself I have been looking at the best ways to utilise chatbots for my clients. flowise still requires a number of steps and some low code know how, but now we are seeing chatbot solutions requiring no technical ability ie no code become available, Orimon, chatbase to name but a few. Maybe not a flexible as flowise in terms of functionality. Your thoughts?? BTW Subscribed.
@overrideguilarte 11 месяцев назад ⁺²
Hello, how are you?, very good video. I ask you a question, how did you create the embedded chat, what did you use to generate it?
@TF-qt3jh Год назад ⁺²⁸
❤ I love all of your videos, and especially this one. As a no coder entrepreneur, I really appreciate this kind of step by step tutorials. Many RUclipsrs are making similar ones, but yours exceeds. Only two points. To your surprise, one of the hardest part of this kind of tutorials is, setting up the apps. The process is no brainer for coders and data scientist, but not for no coders. Two. I always wanted to see a tutorial covering end to end process, meaning designing GUI to (e.g., Figma) to connecting to algo (e.g., Flowise) so that I can build a completely functioning app or MVP. I’d appreciate if you could do this type of video. 🎉
@abagatelle 10 месяцев назад ⁺¹
Yes! setting it all up for we beginners is tricky. It takes a while to get familiar with Git etc
@GraceyGarret 10 месяцев назад ⁺¹
Yes, I agree. I also looking on it
@jozonas 5 месяцев назад
+1 For many videos I can't even go pass the point of the setup because they go so fast over it thinking we know about it, but we don't. Guess what, the people who get that, are not here trying to watch the video tutorial. Come on guys.
@nickrobinson7096 9 месяцев назад
How does it deal with the structure of a PDF? For example, if I have a Table with CAS numbers in the format 1234-56-7. Usually, I can use Regex to extract these patterns however what if the text is contained in a table column and broken by a carriage return or just generally by the structure of the document?
e.g.
123-
45
-6
In a single column.
It seems that the text file generated will not keep this text together. This is a frequent issue I run into when extracting data.
@ichbingenug3565 10 месяцев назад ⁺³
As a non-coder, I am flabbergasted. This looks so easy! Thanks for sharing this. How can I turn this into a function Q&A widget on my webpage? 😳
@sanderschat 3 месяца назад
ask ChatGPT
@inwardfarts3263 Год назад ⁺²⁵
Can you make a video talking about the cost involved with building AI apps with langchain and OpenAI? Are there strategies that can reduce costs? How many tokens does it take to parse company data and return insights? Great content. Best tutorials in this space that I've found. Thank you and keep up the excellent work!
@gkennedy_aiforsocialbenefit 11 месяцев назад
Great question and fully agree Dave is amazing
@MrGoodeats 11 месяцев назад ⁺¹
i mean those tools were made with using that AI tech as the cornerstone for. a business. so i imagine it is somewhat affordable
@pathmonkofficial 10 месяцев назад
Great content!
@Syedhussainipage 8 месяцев назад ⁺¹
Thank you for this amazing video. Would you be kind to build an app that would refer to a given website and reply based on that site's data?
@liorzioni 10 месяцев назад ⁺¹
hi tnx for the guide , i tried to bulid the doucument chatbot , i uplade txt file , when i asked "what this document about " he answer "i dont know "why its happened?
@ivanperez4878 Год назад ⁺³
I'm also on the Pinecone waiting list. Do we have any other options other than Pinecone? Thanks
@jamminrebel3614 2 месяца назад
This 👏 is 👏 freaking 👏 awesome. 👏
love you.
@Earthvssuna 11 месяцев назад ⁺¹
Is it safe in terms of the companies data or will another model use the data to train? For exmaple over the open ai API?
@nvwls 7 месяцев назад
For anyone setting this up now and experiencing a "TypeError" when they send messages in the chatbot- removing the document module and adding it again worked for me.
@marcuskirsch4128 3 месяца назад ⁺⁵
You know your stuff. But I am struggling to follow this to replicate its a lot of detail.
@ryanbrown8387 10 месяцев назад ⁺¹
Are you able to change the language that the chatbot speaks in on flowise?
@Jake_McAllister 10 месяцев назад ⁺⁴⁷
Hi Dave, my problem is most companies won’t be happy uploading internal documents to a external server. Do you think you could make a video where a chat bot is created but the company can keep there internal documents on their end 🔥 🙏
@OpenMachines_ 10 месяцев назад ⁺²
I already made that before chatgpt even existed
@Jake_McAllister 10 месяцев назад ⁺²
@@OpenMachines_ what’s it called?
@jeremiahwolf783 9 месяцев назад ⁺²
Yes please, this!!! Jake and I have the same problem.
@houndogism 8 месяцев назад ⁺²
Hey, there are ways to do this using the stack explained in this video. For example, you could use the Confluence API document loader to query documents stored in your Confluence site
@blender_wiki 8 месяцев назад ⁺²
Hire an app architect if you don't know how to solve this basic problem.
@cyber1619 8 месяцев назад ⁺²
I'm getting the error TypeError: Cannot read properties of undefined (reading 'startsWith'), anyone knows a fix.
@OtherTNSEE Год назад ⁺²
Issue I'm having with these document loaders and vector stores is they get soooo slowwwww once you start loading actual data into them. At around 500 MB of pdfs and vectors, these vector stores take so long to query the whole app breaks down. Not to mention cost issues if you go down the openai api / pinecone route.
@roberto048 2 месяца назад
Hi Dave, thanks for this amazing video. I have a few question, is it possible to teach to our chat bot to ask some specific questions to make a 2 way conversation? I want to use this chatbot as a sales rep in our website, talking with our clients. Also is it possible to teach the chatbot to not saying some specific things. For example I am afraid it recommend our competitors to our clients :)) I guess this chatbot is linked to Chatgpt and uses the same dataset, so it can use its own dataset to say something and offering our clients to the competitors
@user-eb8jw2mz4f 11 месяцев назад
Hi, Can you please advise how to use the Folder Path in "Folder with Files" document loader. I've deployed the Flowise app on GCP as docker image. Thanks
@iGuide_net 3 месяца назад ⁺¹
How much do you have to pay per search to the GPT provider? One metric i saw had chatgpt-4 at 30 cents per search.
@asafnb 11 месяцев назад
Thanks for the great content! If I have big pdf files isn't it will get pricy?
@stoianandreimircea1509 Год назад
Hey there. Love the content. One comment if I may. Maybe show us in the begining a glimpse of 10 sec how the end looks like. then continue with the video😊
@daveebbelaar Год назад
Thanks for the tip!
@JohnnyLeuthard Месяц назад
Are you uploading your data to the cloud or into the docket container?
@dekumus Год назад
Thank you!
@tecion 11 месяцев назад ⁺¹
Great video! I couldn't find the video you mentioned at 6:07, would be nice if you could link to it.
Месяц назад
What would you recommend using: Stack-AI, Flowise or Langflow?
@Che_Se Год назад
once ive made my bot, how can i get the bot online so its not just accessible from my laptop using the embed function to connect it to my site but then it only works on my laptop
@samus4799 10 месяцев назад
I would like to train a large language model to evaluate scientific publications, specifically regarding research, based on its adherence to the scientific method.
@lancerkind 4 месяца назад ⁺¹
Copy/pasting the "connect credential"s is different now than how the video shows. In the Flowise UI, you can no longer copy/paste the credentials. You need to click on the Connect Credential field and select "create new" which opens a popup window. In that window, put in your username and then finally you can copy/paste the API key.
@Gra55H0pper 11 месяцев назад
Like your videos! Subscribed! Just wish you'd ditch the background music.
@TechResearch05 10 месяцев назад
I am getting an error while using flowise - ErrorWithoutStackTrace: PineconeClient: Error getting project name: SyntaxError: Unexpected token A in JSON at position 0. Tried to cross check but not sure what is going wrong. Could yo please help?
@peroforrr7663 Год назад ⁺¹
Hello, 🎉 great content! I would like to request a video on a solution for chatting on PDFs using an open-source model from Hugging Face that utilizes GPU and ensures data privacy.
@SicilianDream 8 месяцев назад ⁺¹
Hi, you are very clear, but you use and refer to so many things I have never heard of. Where can I learn what I need to understand your tutorial?
Lars, Norway 😊
@safetime100 3 месяца назад
Thanks ❤
@almahdibakkali8007 11 месяцев назад
Can you add custom prompts?
What happens when you ask unrelated questions ?
@oluwaseunakinropo6318 10 месяцев назад
Hii Great video! Please I have an issue using flowise to chain with my notion database ( I embedded Google sheet in my notion) but my bot can’t answer questions about info in my google sheet, please what am I doing wrong ?
@henrytabimagiraldo7116 Год назад ⁺¹
Thanks, this is amaizing!! do you know if there is an alternative to Pinecone since it has the waitlist in place?
@ivanperez4878 Год назад
I'm also on the Pinecone waiting list. Do we have any other options other than Pinecone? Thanks
@user-vl3mr4yg6v Год назад ⁺¹
Thanks for the video, it's very useful. Is it possible to integrate a voice assistant that receives a question as input and answers via voice, using the information in the pdfs? It would be very useful. What do you think about it?
@daveebbelaar Год назад
Yep, that's possible. Look into the speech services from Google or Microsoft.
@johanvander6193 Год назад ⁺¹
Hi Dave, question. A lot of documents I want to upload have images meaningful to the subject. Is there already a way for AI to take these into account when chatting to a document?
@aisimp 11 месяцев назад ⁺²
I think that ability is coming with a new iteration of GPT4, right now this functionality is in testing
@mrdman094081 8 месяцев назад ⁺¹
After watching several times it looks like I might need to be a developer or software writer to complete this. Are any others in this thread totally new to A.I. and have made a complete chat bot ? Does The material loaded become public ?
@trackerprince6773 11 месяцев назад ⁺¹
Awesome video. After watching I'm trying to go a step further and use pinecone with ConversationalRetrievalQA with langchan Js directly. I'm getting a response for the query, however for some reason the buffer only always stores the latest messages. The history of messages is never stored so I cant build on prev messages. Do you have any example for successful implementation?
@aisimp 11 месяцев назад
I build with flowise and I believe it was aware of the whole conversation. Maybe something to do with the setup
@laythqureshi5917 11 месяцев назад
Do Flowise have any intellectual property over your data if you make a chatbot with them and train the model yourself?
@girrajjangid4681 Год назад
Impressive ♥
@daveebbelaar Год назад
Thanks!
@poshsims4016 Год назад ⁺¹
THANK YOU!!
@daveebbelaar Год назад
You're welcome ;)
@betenu1 Год назад
I clicked on the docker folder and ran 'npx flowise start' in the terminal and it worked fine. Already made a chatbot
@givingswag897 10 месяцев назад
What are the costs to use it?
@erosr45 10 месяцев назад ⁺¹
This is the hardest tutorial I've ever seen. It's not a step by step.
@thejordankeyes 4 месяца назад
This is pretty cool. How could we use this to chat with whole databases full of records instead of just a single file at a time?
@xugefu 3 месяца назад
Thanks!
@daveebbelaar 3 месяца назад
Appreciate that!! 🙏🏻
@str2254 Год назад
This is way too good wtf
@daveebbelaar Год назад
🙏🏻
@techitint.9100 Год назад ⁺⁷
Hi Dave, thanks so much for this very good video. One question. just wondering about privacy and if information written in our documents are passing to OpenAI while using its APIs. Good part of data are also going to a third party on cloud service such as Pinecone. How can we ensure the privacy of our personal data, while using Flowise or similar services that make use of external API?
@AdrianGiurgiu-jt3tl Год назад
Curious too
@stephensuprios1896 8 месяцев назад
Nice and extremely useful video Dave… how can I use a website as the source?
This allows for updates to immediately be available and we don’t have multiple locations of our data items.
@stephensuprios1896 8 месяцев назад
I have particular websites Not wanting the whole web. It will use these particular websites as it’s data source for conversion with memory. Help? And thanks again.
@AbdelrahmanEmad-mm2xs 7 месяцев назад
thanks for this great tutorial, am frontend developer so how can i build SAAS on this?
@Armin_Hemati 20 дней назад
tnx for info.... can u help me understand the differencebetween there two Artificial Intelligence & artificial intelligence solution !!??
cuz i want to choose a university bachelor's degree???
and what do you think i should learn?? ML /// deep learning/// python //data analyst //data science...... ???? TNX
@somethingmystical567 11 месяцев назад
it keeps saying this for me: * error decoding 'ports': No port specified: : how do i fix that?
@DanielWamara 4 дня назад
Can one link this to a local LLM like Ollama to reduce the costs?
@kobvel 2 месяца назад
What about the data privacy? Where does your document go?
@jeroenvanharten6575 8 месяцев назад
I tried your exact workflow on windows and on mac, but never been able to get it going. On mac all i got was 'Error: Error: Error: ENOENT: no such file or directory, open'
@ew3995 Год назад
i wonder if you could get flowwise to show its generated python code
@MultiMam12345 Год назад
Its actual 16 minutes. For starters probably 45 min. Still amazing.
@yacahumax1431 10 месяцев назад
Any idea how to make this work with spanish? I uploaded a Q & A text file but the bot only answers in english
@Insipidityy 10 месяцев назад
What VSCode theme are you using? Looks great!
@daveebbelaar 10 месяцев назад ⁺¹
Atom One Dark 🔥
@Love-lf9ny 10 месяцев назад
Is docker free, and will the server be on while my computer is turnt off?
@dozieweon 8 месяцев назад
Nob question: How is this different from building it on Botpress?
@gabriel-gr 4 месяца назад
Very cool, thanks!
I tried to do the same with some (large-ish) PDF's but I'm getting `InsufficientQuotaError: 429` from OpenAI ;-(
@diegosandoval7462 Год назад ⁺¹
Great work! Do you know whether there is a limit to how big the PDF file can be? ie. could it be a pdf book? thanks for your help
@markryan2475 Год назад
I tried a PDF book that is over 2M, about 300 pages, and it worked fine.
@diegosandoval7462 Год назад
@@markryan2475 wow, thanks. i'm trying to understand how that affects costs. the bigger the pdf, the more the OpenAI api would charge me? or does the OpenAi only charge by the size of the response? 'appreciate the insight
@daveebbelaar Год назад ⁺¹
The PDFs are split up into chunks of for example, 1000 tokens. So it does not affect your OpenAI API costs. This video gives a more in-depth explanation of how this all works under the hood: ruclips.net/video/NYSWn1ipbgg/видео.html
@diegosandoval7462 Год назад
@@daveebbelaar /thanks so much
@diegosandoval7462 11 месяцев назад
@@daveebbelaar Thanks a lot!
@ricochetism Год назад
any way to get around the Azure OpenAI key requirement
@phuongnguyen5443 8 месяцев назад
I get the following error message when I try to chat to bot: Error: Error: Error: ENOENT: no such file or directory, open ''
Is there any idea why?
@ahmadsaddin792 Год назад
i am facing this error:
failed to register layer: error creating overlay mount to /var/lib/docker/overlay2/9555cbe6b3c5967b87e6f4087726e8ee62a5892e9779da373312c1e63e4a8b3d/merged: too many levels of symbolic links
can you please help me
@senorchavo5572 2 месяца назад
I need troubleshooting assistance. I csn get everything connected...independently openai will respond. When i link pinecone...it will receive the vectors from the pdf when connected...but he dumb ass assistant when asked about the document says hmmm i dont know! 😅 is ir the splitter overlap...its a larger pdf...i do legal work so its case law in trying to load and ask questions
@mohsenghafari7652 3 месяца назад
hi. please help me. how to create custom model from many pdfs in Persian language? tank you.
@AIEntusiast_ 2 месяца назад
multiple pdf, docs at same time ?
@naturallydope247 Год назад
Thanks for showing this. Can you videos on how to deploy using other things besides Azure?
@daveebbelaar Год назад
Maybe in future videos, but I am an Azure guy ;)
@toppu10animes 11 месяцев назад
I'm getting this error everytime: ReferenceError: Blob is not defined
@10points56 Год назад
can we add system message in query
@kittentheorangetabby9676 10 месяцев назад
TypeError: Cannot read properties of undefined (reading 'startsWith')
@explodingstardust Год назад
Can show it with chromadb as its local and free for most people? Thanks for great work.
@cycla Год назад
This is like selling the tools to make the shovels in a gold rush
@DS-ig1bf 11 месяцев назад
Great content. question! I tried it and it seems answers are restricted to the content of the doc and can’t go out of context. Is there any way around that so it can respond like the normal ChatGPT and also refer to the doc?
@aisimp 11 месяцев назад
So I made a chatbot that used Langchain for doc retrieval and regular gpt3.5 for chatcompletion, this how you preserve the functionality and have doc information as well
@willpeeler8547 9 месяцев назад
I keep getting the following error: anyone know how to solve?
PineconeError: Failed getting project name. Error: Forbidden - Invalid API key
@davedumas0 10 месяцев назад
i have an idea for a video series for you
"gpt-4-code interpreter vs gpt-4 with combo of plugins"

Следующие

Автовоспроизведение

How I'd Learn AI in 2024 (if I could start over)