YOLO-World - Real-Time, Zero-Shot Object Detection

Поделиться
HTML-код
  • Опубликовано: 25 ноя 2024

Комментарии • 43

  • @TheJAM_Sr
    @TheJAM_Sr 7 месяцев назад

    I just wanted to say I found your channel this week and really appreciate your classes. I won’t even say they are tutorials because I can take what I learn and easily apply them to my project.

  • @learn_with_gaddal
    @learn_with_gaddal 8 месяцев назад

    Awesome, thank you so much for sharing this information.

  • @arnavthakur5409
    @arnavthakur5409 9 месяцев назад

    Ma'am your work is really incredible

  • @p.logesharavind3528
    @p.logesharavind3528 9 месяцев назад

    This is really cool and interesting .!

  • @soravsingla8782
    @soravsingla8782 8 месяцев назад

    Awesome

  • @rickyS-D76
    @rickyS-D76 5 месяцев назад

    Thanks, do you have detailed video on video object detection with label and confidence score...or any other resource that can be helpful. Thank you.

  • @harshays2873
    @harshays2873 6 месяцев назад

    please make a video for training on custom data for this model

  • @aneerimmco
    @aneerimmco 5 месяцев назад

    informative, Thank you.

  • @ezequieligomez2135
    @ezequieligomez2135 6 месяцев назад

    Is this pre-trained on O365+GoldG or COCO dataset?
    How would I get to specifically get the one pre-trained on O365+GoldG?

  • @Sunil-ez1hx
    @Sunil-ez1hx 9 месяцев назад

    Amazing video

  • @ShittheswaranSelvakumar
    @ShittheswaranSelvakumar 8 месяцев назад

    nice explanation mam.. Thank you...:)

  • @jeffg4686
    @jeffg4686 8 месяцев назад

    Oh nice. How do they come up with these ridiculous names...
    Is this actually better than grounding DINO, or just faster?
    Also, do they have safetensors?
    Do certain model types not work with safetensors, or is this their new plan to infect all the computers?

  • @pifordtechnologiespvtltd5698
    @pifordtechnologiespvtltd5698 9 месяцев назад

    Nice

  • @himanshudnk
    @himanshudnk 9 месяцев назад

    i still not clear how it is different from traditional yolo models vs yolo world , as it is like we using pretrained model and in that we give classes as per and it is able to detect, is it also like yolov8 for example is trained on 80 classes , so yolo world has more other classes?

    • @CodeWithAarohi
      @CodeWithAarohi  9 месяцев назад

      Using yolov8, We can detect the object classes. Suppose if model is trained on coco dataset then you can only detect those 80 classes which are present in coco dataset. And suppose, you created a custom yolov8 model to detect 5 classes then yolov8 will be able to detect those 5 classes.
      But in yolo-world, you can write the name of any object you want to detect. And it will detect that object because yolo world is trained on images and their text descriptions.

  • @2xback2back14
    @2xback2back14 9 месяцев назад

    Hello, can you please demonstrate how to give custom text in "text to image generation using stackGAN", and even after 1000 epochs my model doesnt seem to generate birds images.
    Please help me.

    • @CodeWithAarohi
      @CodeWithAarohi  9 месяцев назад

      I will try to cover this requested topic when I will continue with the GAN playlist.

  • @anamikamaurya22
    @anamikamaurya22 8 месяцев назад

    My god....now programmer will become the creater of 2025

  • @iPrashantSmp
    @iPrashantSmp 9 месяцев назад

    How can I know the list of pretrained classes in the YOLOWorld world model?

    • @CodeWithAarohi
      @CodeWithAarohi  9 месяцев назад

      I am not sure but YOLO-World is pre-trained on large-scale vision-language datasets, including Objects365, GQA, Flickr30K, and CC3M

  • @hemachandhers
    @hemachandhers 9 месяцев назад

    can you put video on fine tuning yolo world on custom dataset mam

    • @CodeWithAarohi
      @CodeWithAarohi  9 месяцев назад

      ruclips.net/video/kl7yszVU6Tg/видео.htmlsi=WRSX79c0QmuMBrWh

    • @Satchi017
      @Satchi017 8 месяцев назад

      @@CodeWithAarohi Yes, how to build a custom yolo-world model for a totally new class, which is not even in large-scale vision-language datasets (Objects365, GQA, Flickr30K, and CC3M)

    • @Satchi017
      @Satchi017 8 месяцев назад

      Sorry ma'am, the person class is in the pre-trained classes. I guess the example is biased.
      How can I detect the car FM antenna on your example image?

    • @CodeWithAarohi
      @CodeWithAarohi  8 месяцев назад

      @@Satchi017 check this: ruclips.net/video/WbCgU4GrjV4/видео.htmlsi=qbiPic5BmDPTUAPn

    • @Satchi017
      @Satchi017 8 месяцев назад

      ​@@CodeWithAarohi Ma'am, I have viewed the video. Rather than detecting "hard hat" and "gloves", how can I detect the object (Red Probe/wire) in the image (a.jpg)?

  • @bb-andersenaccount9216
    @bb-andersenaccount9216 9 месяцев назад

    good job. however it is not clear when setting the classes if you are giving a description prompt or just picking a pre trained class as usual. the person class you show in the example might be a typical pre trained label class instead a description prompt. this makes the example confusing

  • @Hemamalini-f3i
    @Hemamalini-f3i 9 месяцев назад

    How to convert these detections into annotations?

    • @CodeWithAarohi
      @CodeWithAarohi  9 месяцев назад

      There is no need to convert the detections into annotations for custom object detection. But still if you want to do that then you can write a script to fetch the bounding boxes co ordinates and store them in a file.

  • @informative7410
    @informative7410 8 месяцев назад

    How to convert yolo world into tflite ???

  • @عدنانمهداوي-ن5ث
    @عدنانمهداوي-ن5ث 8 месяцев назад

    Yolo in real time is very slow, you know why??