Chat with an image | LangChain custom tools tutorial | Python Streamlit | Computer vision

Поделиться
HTML-код
  • Опубликовано: 7 июл 2024
  • Code: github.com/computervisioneng/...
    0:00 Intro
    0:54 Start
    2:29 Project overview
    8:13 Main process
    12:38 Auxiliary functions
    27:06 LangChain custom tools
    35:58 Create agent
    51:32 Demo
    52:40 Let's have some fun!
    54:22 Outro
    #computervision #python #webapp #computervisiontutorial #langchain #streamlit

Комментарии • 51

  • @ComputerVisionEngineer
    @ComputerVisionEngineer  10 месяцев назад +1

    Did you enjoy this video? Try my premium courses! 😃🙌😊
    ● End-To-End Computer Vision: Build and Deploy a Video Summarization API bit.ly/3tyQX0M
    ● Hands-On Computer Vision in the Cloud: Building an AWS-based Real Time Number Plate Recognition System bit.ly/3RXrE1Y
    ● Machine Learning Entrepreneur: How to start your entrepreneurial journey as a freelancer and content creator bit.ly/4bFLeaC
    All my premium courses are available to the Computer Vision Experts in my Patreon. 😉
    www.patreon.com/ComputerVisionEngineer

  • @python360
    @python360 Год назад +2

    Very cool project, video deserves many many views! Subscribed.👍

  • @WelcomeToMyLife888
    @WelcomeToMyLife888 Год назад

    awesome, your hard work is much appreciated!

  • @ingluissantana
    @ingluissantana Год назад +2

    Looks like a great project!! Gracias!!

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Год назад +1

      Yeah it is a great project to get more familiar with LangChain! 😃🦾 De nada!

  • @user-wt3fh6kq5e
    @user-wt3fh6kq5e 11 месяцев назад

    it's great,make the knowledge easier and more interesting,thank you very much.

  • @ishujain9343
    @ishujain9343 10 месяцев назад

    Awesome work Sir 💯

  • @sreekartammana
    @sreekartammana Год назад +1

    Waooo, this looks awesome💥

  • @AlessandroTogni
    @AlessandroTogni Год назад +1

    It would be awesome to add a functionality that allows users to edit the uploaded image via prompt. "Cut the [detected object]", "Change the lightning from day to night", etc

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Год назад +2

      Yeah, it would be awesome to add additional image processing functionalities! I will continue improving this project in future tutorials. 💪💪

  • @hiteshram7421
    @hiteshram7421 Год назад +1

    Great Project😃😃

  • @tancheeken
    @tancheeken 11 месяцев назад

    Hi, thank you for the awesome video. I do have a question. I understand that the concept of building an agent with Langchain. In your example, the function returns the caption of an image. The caption is a man on a horse with a dog. If I were to query the color of the dog, it would not be able to re-process the image and focus solely on the dog, right? If so, what's the purpose of using Langchain agent and LLM? Wouldn't it be better to image-to-caption and store the caption to normal database? Or am I missing something here?

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  11 месяцев назад +1

      Hey, this tutorial is an example regarding how to use LangChain in a computer vision project. I agree we could continue working on this project and add more features to it. 💪🙌

  • @RudraGoswami-wz8hv
    @RudraGoswami-wz8hv 3 месяца назад

    How to fine tune the model on custom data? As it is specific to the training dataset associated with Open AI

  • @rishabhinc2936
    @rishabhinc2936 Год назад +1

    I have a question ?
    What if we dont create object detection and image caption tool?? Will it still answer our query related to object detection and captioning ???

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Год назад +1

      Nop, without those tools it won't answer any query related to object detection and captioning.

  • @adnanahmad4168
    @adnanahmad4168 Год назад +1

    Bro please please create a video on how to deploy Yolov8 trained custom model on Nano Jetson.
    I have submission of my Final Year Project within a week and I can't find help regarding that. Please do it.

  • @connectrRomania
    @connectrRomania Год назад +2

    include installation of transformers in your requirements from the main hugging repo

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Год назад +2

      Oh, I missed it! Thank you for the heads up! I will update the requirements file shortly. 🙌

  • @santhosh1387
    @santhosh1387 Год назад +1

    sir , you have put that lane crossing detection video in private can you please put it back sir , my work depends on it and that video has helped me a lot , thank you

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Год назад

      I am preparing a more recent version of the lane crossing detector 😃. It will be available soon! 🙌💪

  • @VAIBHAVCHAUDHARY-fl1hn
    @VAIBHAVCHAUDHARY-fl1hn 8 месяцев назад

    Can we do this by using yolov8 and using our custom dataset and asking about that product ?

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  8 месяцев назад

      Do you mean creating a custom tool to perform object detection with yolov8? Yes, it is possible. 🙌

  • @mrsmile9399
    @mrsmile9399 10 месяцев назад

    Love your videos
    Is there an alternative to chatgpt that you recommend from huggingface?

  • @dhrroovv
    @dhrroovv Месяц назад

    when I am trying to publish the repo to github so that we can host it on streamlit cloud, its giving error due to use of OpenAI secret key directly in the code. how can we fix that?

    • @dhrroovv
      @dhrroovv Месяц назад

      okay i fixed that error by using environment variable and not directly exposing the openai secret key to github but now its giving error that I have reached the quota for openai api key, even though i have not used it at all.

  • @ishasingh1039
    @ishasingh1039 5 месяцев назад

    Hello , i followed this tutorial and made the app. But for whatever question i ask , its response is caption ( except for object detection) . Like when I asked it to generate story , it returned the caption of the image . I have done everything as same as shown here

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  4 месяца назад +1

      Thank you for your feedback, I will try to test it soon and I will update the code if needed.

    • @rohitpillai2307
      @rohitpillai2307 4 месяца назад

      I have faced the same issue@@ComputerVisionEngineer

  • @allies4183
    @allies4183 Год назад

    Which model have you used?

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Год назад +1

      I am using the model facebook/detr-resnet-50 for object detection and Salesforce/blip-image-captioning-large for image description. 💪

  • @tnztanim1623
    @tnztanim1623 7 месяцев назад

    Can we do this with free hugging Face model instead of Open Ai? Open Ai requires subscription.

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  7 месяцев назад

      Yes, we could use a free alternative, instead of openai. 🙌

    • @tnztanim1623
      @tnztanim1623 7 месяцев назад

      ​@@ComputerVisionEngineer will you plz implement something instead of openai? I really liked the project and tried my best but couldn't make the project run with a free model.

  • @rohitchan007
    @rohitchan007 Год назад

    I'm getting this error. PermissionError: [Errno 13] Permission denied: 'D:\\Courses\\Computer_vision_engineer\\Ask_image_question\\tmp3chlmrdj'
    I have granted read write permission to the folder. Need help

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Год назад

      What is your OS?

    • @rohitchan007
      @rohitchan007 Год назад

      @@ComputerVisionEngineer Windows

    • @rohitchan007
      @rohitchan007 Год назад

      @@ComputerVisionEngineer I fixed it. it was due to NameTemporaryFile() issue. Replaced with
      with open("temp.jpg", "w+b") as f:

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Год назад +1

      cool, thanks for the update!

    • @rohitchan007
      @rohitchan007 Год назад

      @@ComputerVisionEngineer no problem.
      I really love the projects you're working on and sharing the knowledge