SynFlow: Pruning neural networks without any data by iteratively conserving synaptic flow

Object-Centric Learning with Slot Attention (Paper Explained)

DETR: End-to-End Object Detection with Transformers (Paper Explained)

Cristiano Ronaldo scores game-winning goal in 88' against Scotland | UEFA Nations League

4 Days of Training Like a Marine

Kendrick Lamar. Super Bowl LIX Halftime Show

CornerNet: Detecting Objects as Paired Keypoints (Paper Explained)

Yannic Kilcher

Просмотров 13 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 9 сен 2024

Комментарии • 23

@teslaonly2136 4 года назад ⁺²⁸
I almost finished 1/3 of your uploaded videos. It feels like someone is reading papers with me. The feeling is great! Thanks so much Yannic. Keep it up!
@AediWang 4 года назад ⁺⁶
Paper from 1 year ago is now "a bit old". Just amazing how fast the field moves.
@jasdeepsinghgrover2470 4 года назад ⁺²
This shows that there still so much left in DL to be done. One thing I see is that it seems that every point is making a prediction. According to research by Uber, there are location sensitive CNNs which can also be tried in these cases. Would love to see something like a combination of the two ideas.
@siddharthbhargava4857 4 года назад ⁺²
Thank you for the explanation.
@herp_derpingson 4 года назад ⁺²
These embeddings get more interesting the more you think about it. It is essentially two neural networks inventing a language to talk to each other. If we can make this interpretable, it might open up a lot of possibilities.
@kwonohhyeok5016 3 года назад
first subscription in my life. thanks for your video
@ruanjiayang 2 года назад
Corner pooling is a really smart way to largely increase perception field, sort of like deformable convolution. But DETR seems to generally solve the problem in object detection since it make use of the full image as perception field.
@AIwithAniket 4 года назад ⁺¹
Nicely explained
@rickywong8149 4 года назад ⁺²
I really like your content, can you make an explanation of centernet : objects as point i dont really quite get the idea of its loss function
@0lec817 4 года назад ⁺²
Any specific reason you went with this approach over any of the other very similar boxless/keypoint detection approaches like CenterNet ("Objects as Points") or CSPNet ("Center and Scale Prediction") that not even require laborious embeddings while performing equally or even better? Or the "CenterNet: Keypoint Triplets for Object Detection" paper that basically is the combination of the CornerNet and the Center approaches.
I mean they basically all do the same (keypoint detection) which in my opinion is quite different to what you suggested with the cross attention matrix from the attention heads?
@YannicKilcher 4 года назад
Yes this paper didn't turn out to be exactly what I hoped, but still interesting. I chose it just because it sounded like fun.
@larrybird3729 4 года назад ⁺³
The person who put the thumbs down has Oppositional defiant disorder (ODD)🤣
@l33tc0d3 4 года назад ⁺¹
My intuition is that using paired keypoints is cheaper but should be more inaccurate over anchor boxes. For example, It is not clear what the paper does when there are overlapping objects that share the same keypoint (e.g. top-left). Using keypoints is interesting nevertheless. I found another recent paper that just uses keypoints inside transformer to replace RGB tracking and matching pipeline for pose tracking task: arxiv.org/pdf/1912.02323.pdf
@austinmw89 3 года назад
Hey, have you done videos on the older but still heavily used architectures Faster RCNN, SSD, YOLO3, RetinaNet?
@awangprajaanugerah8231 Месяц назад
How can i find the research paper like you do
@efedoganay07 4 года назад ⁺¹
So, does network predict a tensor of WxHxC for heatmap branch ?
@YannicKilcher 4 года назад
Yes, one for top left and one for bottom right
@SadatAShaik 4 года назад ⁺²
Your videos are great!! Keep them up :) Why do you think they decided to go with these push and pull losses instead of using a triplet loss? Seems almost identical to the push + pull losses they propose
@YannicKilcher 4 года назад ⁺³
No idea, but it's either the first thing they tried, or they tried a bunch of things and this worked the best.
@LaoZhao11 4 года назад ⁺¹
now Taiwan (GMT+8) is 11 PM
yt: it's time reading a paper
@hanbrianlee Год назад
embeddings of 1 dimension, not 1 number. 1 number wouldn't work lol
@SethuIyer95 4 года назад ⁺²
first

Следующие

Автовоспроизведение

SynFlow: Pruning neural networks without any data by iteratively conserving synaptic flow

SynFlow: Pruning neural networks without any data by iteratively conserving synaptic flow

Object-Centric Learning with Slot Attention (Paper Explained)

Object-Centric Learning with Slot Attention (Paper Explained)

DETR: End-to-End Object Detection with Transformers (Paper Explained)

DETR: End-to-End Object Detection with Transformers (Paper Explained)

Cristiano Ronaldo scores game-winning goal in 88' against Scotland | UEFA Nations League

Cristiano Ronaldo scores game-winning goal in 88' against Scotland | UEFA Nations League

4 Days of Training Like a Marine

4 Days of Training Like a Marine

Kendrick Lamar. Super Bowl LIX Halftime Show

Kendrick Lamar. Super Bowl LIX Halftime Show

PlayStation 5 Pro Console - Reveal Trailer

PlayStation 5 Pro Console - Reveal Trailer

Synthesizer: Rethinking Self-Attention in Transformer Models (Paper Explained)

Synthesizer: Rethinking Self-Attention in Transformer Models (Paper Explained)

Learning To Classify Images Without Labels (Paper Explained)

Learning To Classify Images Without Labels (Paper Explained)

CornerNet: Detecting Objects as Paired Keypoints

CornerNet: Detecting Objects as Paired Keypoints

PR-241: Objects as Points

PR-241: Objects as Points

Philipp Krähenbühl - Point-based object detection

Philipp Krähenbühl - Point-based object detection

[Classic] Deep Residual Learning for Image Recognition (Paper Explained)

[Classic] Deep Residual Learning for Image Recognition (Paper Explained)

This is why Deep Learning is really weird.

This is why Deep Learning is really weird.

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Будь счастлива, наша малышка ❤️❤️❤️

Будь счастлива, наша малышка ❤️❤️❤️

Чайник из камня вручную. Мастеру 86 лет #ученые_против_мифов

Чайник из камня вручную. Мастеру 86 лет #ученые_против_мифов

ФОКУС -СВЕТОФОР

ФОКУС -СВЕТОФОР

МАЛЬЧИК ПОНЯЛ, ЧТО ЕГО ОБМАНУЛИ С «ЖАНА КАЗАХСТАНОМ»

МАЛЬЧИК ПОНЯЛ, ЧТО ЕГО ОБМАНУЛИ С «ЖАНА КАЗАХСТАНОМ»

Самый БОЛЬШОЙ iPhone в МИРЕ!

Самый БОЛЬШОЙ iPhone в МИРЕ!

Шок. Никокадо Авокадо похудел на 110 кг

Шок. Никокадо Авокадо похудел на 110 кг

ПОСТОЯННИК ЛОМБАРДА #шоу #юмор #спб #фитнес #вау

ПОСТОЯННИК ЛОМБАРДА #шоу #юмор #спб #фитнес #вау

Как встречать мужа с работы. Психолог Ирина Ковалёва

Как встречать мужа с работы. Психолог Ирина Ковалёва