Chat with an image | LangChain custom tools tutorial | Python Streamlit | Computer vision

Computer vision engineer

Просмотров 11 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 7 июл 2024
Code: github.com/computervisioneng/...
0:00 Intro
0:54 Start
2:29 Project overview
8:13 Main process
12:38 Auxiliary functions
27:06 LangChain custom tools
35:58 Create agent
51:32 Demo
52:40 Let's have some fun!
54:22 Outro
#computervision #python #webapp #computervisiontutorial #langchain #streamlit

Комментарии • 51

@ComputerVisionEngineer 10 месяцев назад ⁺¹
Did you enjoy this video? Try my premium courses! 😃🙌😊
● End-To-End Computer Vision: Build and Deploy a Video Summarization API bit.ly/3tyQX0M
● Hands-On Computer Vision in the Cloud: Building an AWS-based Real Time Number Plate Recognition System bit.ly/3RXrE1Y
● Machine Learning Entrepreneur: How to start your entrepreneurial journey as a freelancer and content creator bit.ly/4bFLeaC
All my premium courses are available to the Computer Vision Experts in my Patreon. 😉
www.patreon.com/ComputerVisionEngineer
@python360 Год назад ⁺²
Very cool project, video deserves many many views! Subscribed.👍
@ComputerVisionEngineer Год назад
Thank you! Glad you enjoyed it! 😃💪
@WelcomeToMyLife888 Год назад
awesome, your hard work is much appreciated!
@ComputerVisionEngineer Год назад
😃 Thank you! Glad you enjoyed the video! 🙌
@ingluissantana Год назад ⁺²
Looks like a great project!! Gracias!!
@ComputerVisionEngineer Год назад ⁺¹
Yeah it is a great project to get more familiar with LangChain! 😃🦾 De nada!
@user-wt3fh6kq5e 11 месяцев назад
it's great,make the knowledge easier and more interesting,thank you very much.
@ComputerVisionEngineer 11 месяцев назад
You are welcome! Glad you enjoyed it. 🙌
@ishujain9343 10 месяцев назад
Awesome work Sir 💯
@ComputerVisionEngineer 10 месяцев назад
Thank you! 😃
@sreekartammana Год назад ⁺¹
Waooo, this looks awesome💥
@ComputerVisionEngineer Год назад ⁺¹
😃 It is a very cool project!! Glad you enjoyed it, Sreekar! 🙌
@AlessandroTogni Год назад ⁺¹
It would be awesome to add a functionality that allows users to edit the uploaded image via prompt. "Cut the [detected object]", "Change the lightning from day to night", etc
@ComputerVisionEngineer Год назад ⁺²
Yeah, it would be awesome to add additional image processing functionalities! I will continue improving this project in future tutorials. 💪💪
@hiteshram7421 Год назад ⁺¹
Great Project😃😃
@ComputerVisionEngineer Год назад ⁺²
Yeah I enjoyed it a lot! 😃💪
@tancheeken 11 месяцев назад
Hi, thank you for the awesome video. I do have a question. I understand that the concept of building an agent with Langchain. In your example, the function returns the caption of an image. The caption is a man on a horse with a dog. If I were to query the color of the dog, it would not be able to re-process the image and focus solely on the dog, right? If so, what's the purpose of using Langchain agent and LLM? Wouldn't it be better to image-to-caption and store the caption to normal database? Or am I missing something here?
@ComputerVisionEngineer 11 месяцев назад ⁺¹
Hey, this tutorial is an example regarding how to use LangChain in a computer vision project. I agree we could continue working on this project and add more features to it. 💪🙌
@RudraGoswami-wz8hv 3 месяца назад
How to fine tune the model on custom data? As it is specific to the training dataset associated with Open AI
@rishabhinc2936 Год назад ⁺¹
I have a question ?
What if we dont create object detection and image caption tool?? Will it still answer our query related to object detection and captioning ???
@ComputerVisionEngineer Год назад ⁺¹
Nop, without those tools it won't answer any query related to object detection and captioning.
@adnanahmad4168 Год назад ⁺¹
Bro please please create a video on how to deploy Yolov8 trained custom model on Nano Jetson.
I have submission of my Final Year Project within a week and I can't find help regarding that. Please do it.
@connectrRomania Год назад ⁺²
include installation of transformers in your requirements from the main hugging repo
@ComputerVisionEngineer Год назад ⁺²
Oh, I missed it! Thank you for the heads up! I will update the requirements file shortly. 🙌
@santhosh1387 Год назад ⁺¹
sir , you have put that lane crossing detection video in private can you please put it back sir , my work depends on it and that video has helped me a lot , thank you
@ComputerVisionEngineer Год назад
I am preparing a more recent version of the lane crossing detector 😃. It will be available soon! 🙌💪
@VAIBHAVCHAUDHARY-fl1hn 8 месяцев назад
Can we do this by using yolov8 and using our custom dataset and asking about that product ?
@ComputerVisionEngineer 8 месяцев назад
Do you mean creating a custom tool to perform object detection with yolov8? Yes, it is possible. 🙌
@mrsmile9399 10 месяцев назад
Love your videos
Is there an alternative to chatgpt that you recommend from huggingface?
@ComputerVisionEngineer 10 месяцев назад
Take a look at HuggingChat. 😃🙌
@dhrroovv Месяц назад
when I am trying to publish the repo to github so that we can host it on streamlit cloud, its giving error due to use of OpenAI secret key directly in the code. how can we fix that?
@dhrroovv Месяц назад
okay i fixed that error by using environment variable and not directly exposing the openai secret key to github but now its giving error that I have reached the quota for openai api key, even though i have not used it at all.
@ishasingh1039 5 месяцев назад
Hello , i followed this tutorial and made the app. But for whatever question i ask , its response is caption ( except for object detection) . Like when I asked it to generate story , it returned the caption of the image . I have done everything as same as shown here
@ComputerVisionEngineer 4 месяца назад ⁺¹
Thank you for your feedback, I will try to test it soon and I will update the code if needed.
@rohitpillai2307 4 месяца назад
I have faced the same issue@@ComputerVisionEngineer
@allies4183 Год назад
Which model have you used?
@ComputerVisionEngineer Год назад ⁺¹
I am using the model facebook/detr-resnet-50 for object detection and Salesforce/blip-image-captioning-large for image description. 💪
@tnztanim1623 7 месяцев назад
Can we do this with free hugging Face model instead of Open Ai? Open Ai requires subscription.
@ComputerVisionEngineer 7 месяцев назад
Yes, we could use a free alternative, instead of openai. 🙌
@tnztanim1623 7 месяцев назад
@@ComputerVisionEngineer will you plz implement something instead of openai? I really liked the project and tried my best but couldn't make the project run with a free model.
@rohitchan007 Год назад
I'm getting this error. PermissionError: [Errno 13] Permission denied: 'D:\\Courses\\Computer_vision_engineer\\Ask_image_question\\tmp3chlmrdj'
I have granted read write permission to the folder. Need help
@ComputerVisionEngineer Год назад
What is your OS?
@rohitchan007 Год назад
@@ComputerVisionEngineer Windows
@rohitchan007 Год назад
@@ComputerVisionEngineer I fixed it. it was due to NameTemporaryFile() issue. Replaced with
with open("temp.jpg", "w+b") as f:
@ComputerVisionEngineer Год назад ⁺¹
cool, thanks for the update!
@rohitchan007 Год назад
@@ComputerVisionEngineer no problem.
I really love the projects you're working on and sharing the knowledge

Следующие

Автовоспроизведение

Automatic number plate recognition with Python, Yolov8 and EasyOCR | Computer vision tutorial