MLT __init__ Session #4 - SSD: Single Shot MultiBox Detector

Поделиться
HTML-код
  • Опубликовано: 27 окт 2024

Комментарии • 32

  • @youssefelassfouri9659
    @youssefelassfouri9659 3 года назад +7

    amasing video , clear and direct. please never stop making these !! thank you

  • @go64bit
    @go64bit Год назад +3

    Amazing lecture. Very good explanation of SSD and its EfficientDet descendants.

  • @logansizemore4735
    @logansizemore4735 3 года назад +3

    Best description I found. Good presentation. I love the visuals.

  • @atticross
    @atticross 3 года назад +7

    You have fantastic pedagogical skills. Thanks for the clear and lovely explanations

  • @gennarofarina94
    @gennarofarina94 2 года назад +1

    Many compliments. This is a great introduction to SSD and its directions for improvement.

  • @hole386
    @hole386 5 месяцев назад

    Finnaly someone explaining how they actually make detections. So many people on here gloss over that part, and explain nothing about how the featuremaps are turned into boxes.

  • @danielkusuma6473
    @danielkusuma6473 3 года назад +2

    Very clear explanation and description of basically the original paper. Thank you!

  • @MyungeinHan
    @MyungeinHan 8 месяцев назад

    This is really ground breaking! I love your explanation. Straight to the point & Simple.

    • @MLTOKYO
      @MLTOKYO  8 месяцев назад

      We'll pass the feedback on! :)

  • @RajkamalUdayasuriyan
    @RajkamalUdayasuriyan 5 месяцев назад

    Thank you for this wonderful Video, May I know how the default bounding boxes are encoded and what are their default location , size and aspect ratio in a feature map ?

  • @AmlanPanigrahi
    @AmlanPanigrahi 3 года назад +2

    This is so clean and thorough!

  • @19916718514
    @19916718514 3 года назад +2

    best explaination on ssd i have ever found!

  • @Paladin1983PL
    @Paladin1983PL 7 месяцев назад

    Where can I get coordinates of the default 8732 boxes?
    I need to interpret the raw output localization tensor (the one with size [1,4,8732]) using C++ (LibTorch) so I can't use nvidia_ssd_processing_utils for that. I understand how to calculate the final bounding boxes but I don't have the default box data (dx, dy, dw, dh).

  • @hanshima_
    @hanshima_ 2 года назад +1

    Thanks for your generosity to teach this topic.

  • @asifmehmood6803
    @asifmehmood6803 3 года назад +2

    Could you please make videos on RCNN and its variants with focus on RPN?

  • @kotanvich
    @kotanvich 2 года назад +1

    Thanks a lot, great and clear explanation

  • @from-chimp-to-champ1
    @from-chimp-to-champ1 2 года назад

    Beautiful lecture, thank you sir! May I just ask a little question?
    On the GIF visualizations that you show, it intuitively seems like we perform convolution of default bounding boxes with the feature maps of different scales. But it is written, that the actual size of convolutional blocks is 3 x 3. Could you clarify, how convolutions of feature maps with 3 x 3 blocks "intuite", with which default bounding box the network is working?
    Thank you very much! Sincerely, Pavel

  • @mariamIbrahim5848
    @mariamIbrahim5848 Год назад +1

    very clear and helpful thank you

  • @granatapfel6661
    @granatapfel6661 3 года назад +1

    Can I use your presentation for my university project?

  • @lackofsleep7653
    @lackofsleep7653 2 года назад

    You’re wonderful and save my life!!!!

  • @hosseinmadadi9141
    @hosseinmadadi9141 2 года назад +1

    it was very useful and thank you for your time!

  • @yasminemohamed5157
    @yasminemohamed5157 2 года назад

    such a great video, thank you!

  • @antonioconsiglio95
    @antonioconsiglio95 2 года назад

    Thanks for sharing, good explanation.

  • @clashtm8210
    @clashtm8210 11 месяцев назад

    Great lecture.

  • @harshdevmurari007
    @harshdevmurari007 11 месяцев назад

    crystal clear explaination

  • @AIandtheworld
    @AIandtheworld 2 года назад

    This was amazing!

  • @rezarawassizadeh4601
    @rezarawassizadeh4601 2 года назад

    Very good explanation, but it seems your layers are not correct they are 6 layers not 5.

  • @budiandraym8451
    @budiandraym8451 2 года назад

    is ssd using FFNN or CNN ?

  • @Ohjjin
    @Ohjjin 3 года назад

    I like the jib of this guy’s jab

  • @lotfilofti6220
    @lotfilofti6220 2 года назад +1

    in 6:27 you have 6 results concatenated in the detection block (detection 8732 by class), but in 11:55 you have only 5 outputs (features maps) ??????????????????????????????????????????????????????????????????????????????????????????????!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!

  • @wurstachim
    @wurstachim Год назад +1

    svm = __support__ vector machine.

    • @les3464
      @les3464 Год назад

      I kept thinking that myself lol