DETR: End-to-End Object Detection with Transformers (Paper Explained)

Swin Transformer

DINOv2 Explained: Visual Model Insights & Comprehensive Code Guide

Trying EVERY Fast Food Holiday Item!

Marvel Rivals | Winter Celebration, Joyful Jubilation

THE AMAZING DIGITAL CIRCUS - Ep 4: Fast Food Masquerade

DETR - End to end object detection with transformers (ECCV2020)

Nicolas Carion

Просмотров 24 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 16 дек 2024

Комментарии • 25

@fire_nakamura 14 дней назад ⁺¹
I'm fascinated by you and your team members' craft, with tweaks on loss, ideas of encodings and sufficient amount of data, applications will be huge. I would love to learn and explore those possibilities, Isn’t there anyway to be a part of your team or contribute to any related projects?
@kvnptl4400 6 месяцев назад
A very nice presentation with clear visualizations and easy-to-understand explanations! Great Work!!🌟🌟🌟🌟🌟
Smooth animations 👌
@QuintinMassey 2 года назад ⁺³
Outstanding work. I’m also very interested in the, arguably more difficult, small object detection problem.
@syedabdul8509 3 года назад ⁺⁷
Excellent Explanation.
But I want to know the most important thing in this video,
How did you create those cool animations like @1:58-@2:20 and @8:00-@8:05
@praveen9083 3 года назад ⁺²
I'm expecting this answer too!
@nicollenunes4459 11 месяцев назад
@@praveen9083 me 2!
@azharhussian4326 2 месяца назад
anyone has idea?
@MarioHari 4 года назад ⁺²
Nice work!
A small correction to what you said: "Semantic segmentation labels each pixel in the whole image. It is not restricted to only pixels in the background".
@nicolascarion3111 4 года назад ⁺⁵
You're right, my statement is imprecise. I meant that semantic annotations of foreground classes are not used in the panoptic task.
@MarioHari 4 года назад
@@nicolascarion3111 merci infiniment :)
@ujjalkrdutta7854 2 года назад
@@nicolascarion3111 Can we then say that: "Panoptic Segmentation= Instance Segmentation+Semantic Segmentation minus annotations of foreground classes" ?
@Ramakrishnan-bq9is 3 года назад ⁺¹
Thanks for sharing!
Could you please explain what you mean by full differentiable and how other methods might not be fully differentiable?
@goldenshale 2 года назад
This is an end to end neural network defined by functions which all have derivatives. In the R-CNN family of algorithms you have one procedure that produces a bunch of region proposals, then you crop out these regions and feed them to a classifier, and then you run another algorithm to prune out overlapping and low confidence predictions. Since there are multiple steps that have logical rather than mathematical implementations, you can't take derivatives all the way through to back propagate information through the whole system.
@morancium 26 дней назад
WoW thankyou for your contribution!
@Nino234mff 3 года назад
Thank you for the great work and the presentation!
@kaceangelo132 3 года назад
i realize it is quite off topic but do anyone know of a good website to watch new movies online ?
@bakercain265 3 года назад
@Kace Angelo try Flixzone. Just google for it =)
@chandrahasp6697 Год назад
Really good work!
@ujjalkrdutta7854 2 года назад
Elegant explanation. liked it
@rohinim7707 4 года назад ⁺¹
Amazing! What was the main motivation behind using a sequence model for an object detection?
@redjammie8342 4 года назад
It is not a sequence model. It was successfully used for sequences, but it's not a sequence model by definition.
@ZobeirRaisi 4 года назад ⁺¹
What this mean?: "since the transformer is a permutation
equivalent some extra care is required to retain
the 2d structure of the image."
@nicolascarion3111 4 года назад ⁺⁷
The transformer isn't aware of the 2D structure of the image, because 1) we flatten it and 2) permuting the inputs of a transformer simply permutes its outputs (permutation equivariance). That's why we add 2D positional encodings. This is similar to what is done in NLP, to retain the order of the sentence.
@ZobeirRaisi 4 года назад ⁺¹
@@nicolascarion3111 Thanks for your explanation. I have another question: Right now DETR because of rectangle bboxes of COCO-dataset produces rectangle-bboxes outputs, if we had polygon bboxes (8 points), which parts of the architecture must be modified to output a polygon shape bboxes?
@nicolascarion3111 4 года назад ⁺⁴
@@ZobeirRaisi Well you need to modify the regression head as well as the loss and matching function (GiOU may not make sense anymore, so you'll likely have to stick to L1). For this kind of questions, it's best to open an issue on our github. Thanks!

Следующие

Автовоспроизведение

DETR: End-to-End Object Detection with Transformers (Paper Explained)

DETR: End-to-End Object Detection with Transformers (Paper Explained)

Swin Transformer

Swin Transformer

DINOv2 Explained: Visual Model Insights & Comprehensive Code Guide

DINOv2 Explained: Visual Model Insights & Comprehensive Code Guide

Trying EVERY Fast Food Holiday Item!

Trying EVERY Fast Food Holiday Item!

Marvel Rivals | Winter Celebration, Joyful Jubilation

Marvel Rivals | Winter Celebration, Joyful Jubilation

THE AMAZING DIGITAL CIRCUS - Ep 4: Fast Food Masquerade

THE AMAZING DIGITAL CIRCUS - Ep 4: Fast Food Masquerade

The Witcher 4 - Official Reveal Trailer | The Game Awards 2024

The Witcher 4 - Official Reveal Trailer | The Game Awards 2024

RT DETR - realtime object detection with transformers

RT DETR - realtime object detection with transformers

[Tutorial] Training End-to-end Object Detection with Transformer(DETR) model on custom dataset

[Tutorial] Training End-to-end Object Detection with Transformer(DETR) model on custom dataset

15 НОВЫХ ЗАПРЕТОВ ГИБДД: ксенон, LED, катализатор, видеорегистратор, тонировка, дефлекторы, фаркоп

15 НОВЫХ ЗАПРЕТОВ ГИБДД: ксенон, LED, катализатор, видеорегистратор, тонировка, дефлекторы, фаркоп

End-to-End Object Detection with Transformers

End-to-End Object Detection with Transformers

[CVPR 2024] RT-DETR, DETRs Beat YOLOs on Real-time Object Detection.

[CVPR 2024] RT-DETR, DETRs Beat YOLOs on Real-time Object Detection.

YOLO Object Detection (Part 1)

YOLO Object Detection (Part 1)

Vision Transformer in PyTorch

Vision Transformer in PyTorch

Object Detection as a Machine Learning Problem - Ross Girshick

Object Detection as a Machine Learning Problem - Ross Girshick

How to Train DETR Object Detection Transformer on Custom Dataset

How to Train DETR Object Detection Transformer on Custom Dataset

Перегон авто на ЖЁСТКОЙ СЦЕПКЕ пошёл не по плану.BMW x5m НАДЕЖНЕЕ Toyota

Перегон авто на ЖЁСТКОЙ СЦЕПКЕ пошёл не по плану.BMW x5m НАДЕЖНЕЕ Toyota

Подземелья Чикен Карри #33 Зверь Фест (Мягкова, Бустер, Котельникова, Гудков, BRB)

Подземелья Чикен Карри #33 Зверь Фест (Мягкова, Бустер, Котельникова, Гудков, BRB)

Doctor is helping to remote controlled Car 👩‍⚕️💉🚗 #builderc

Doctor is helping to remote controlled Car 👩‍⚕️💉🚗 #builderc

7 НАУЧНЫХ способов стать ПРИВЛЕКАТЕЛЬНЫМ | Вячеслав Дубынин

7 НАУЧНЫХ способов стать ПРИВЛЕКАТЕЛЬНЫМ | Вячеслав Дубынин

ОХОТИМСЯ на НОЧНОГО МОНСТРА! - Debt Hunt + MyNeosha, Demaster

ОХОТИМСЯ на НОЧНОГО МОНСТРА! - Debt Hunt + MyNeosha, Demaster

Mache leckere Lutscher mit diesem PRO-Gadget! 🚽🍭

Mache leckere Lutscher mit diesem PRO-Gadget! 🚽🍭

ПРИВЕТИК ЧЕРЕЗ ИНТЕРНЕТИК #натальнаякарта #карцев #журавлев #иванченко #mediumquality

ПРИВЕТИК ЧЕРЕЗ ИНТЕРНЕТИК #натальнаякарта #карцев #журавлев #иванченко #mediumquality

Сдала тест ДНК по приколу

Сдала тест ДНК по приколу