Exploring Simple Siamese Representation Learning

Fixing SimCLRs Main Problem - BYOL Paper Explained

Contrastive Learning for Unpaired Image-to-Image Translation

Avengers wake up, Marvel Rivals is fire

Seungmin "그렇게, 천천히, 우리(As we are)" | [Stray Kids : SKZ-PLAYER]

Stray Kids Answers 30 Questions As Quickly As Possible

Momentum Contrastive Learning

Connor Shorten

Просмотров 15 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 22 дек 2024

Комментарии • 26

@connor-shorten 4 года назад ⁺⁴
2:33 Contrastive Learning - Dollar Drawings
3:09 Motivation of Self-Supervised Learning
4:48 Success with DeepMind Control Suite
6:00 MoCo Framework Overview
8:08 Dynamic Dictionary Look-up Problem
8:46 Data Augmentation
9:21 Key Dictionary should be Large and Consistent
10:42 Large Dictionaries
11:32 Dictionary Solutions in MoCo
13:37 Experiments
14:26 Ablations
16:10 MoCov2 with SimCLR extensions
18:24 Training with Dynamic Targets
@abhishekyadav479 4 года назад ⁺²⁹
The queue encoding is FIFO not LIFO, correct me if I'm wrong
@dutchJSCOOP 4 года назад ⁺¹
You are not.
@kunai9809 3 года назад ⁺²
i was confused by it too...
@kunai9809 3 года назад ⁺¹
6:15
in the denominator are not all _other_ keys, but _ALL_ keys, including the positive one.
From the paper, right under the equation: " The sum
is over one positive and K negative samples"
@vikrammurthy8337 2 года назад ⁺²
thanks for taking the time Connor ..still couldn't figure out 2 mysteries from the paper
a) why maintain a dictionary when we are NOT sampling from it ? from the psuedo code in the paper, the only time the queue is used is while calculating -ve logits (which has an additional issue .. if im taking all KEYS from current batch, there will definitely be +ve keys in the queue when i multiply the query into the queue right ? most will be -ve but atleast the +ve pairs in the batch WILL result in +ve keys)
b) while calculating the loss , the paper uses an N dim array of 0's .. i understand it specifies the 0th index of the target label so i can assume the 0th index to 1 and the rest as 0's BUT one would assume that only the positive logits would need to be closer to the 0th index ..why are they making even the -ve logits come closer to the 0th index ) .. im quite confused
@connor-shorten 2 года назад
Hey Vikram, I will try to get around to this. Please feel free to join the Weaviate slack chat to ping me again about this in case I forget.
@vikrammurthy8337 2 года назад ⁺¹
@@connor-shorten thanks much .. i just re read the paper and realized that the dictionary is nothing but a big sampler for ALL -ve keys ..so my understanding is that since the Query encoder is being trained to learn the best possible representation of the images, it can only do so if it can come as close as possible to the +ve key and go as far AWAY as possible from all the -ve keys in the dictionary ..so more the -ve keys it can "escape" from the better and crisper the image representation gets hence enabling the encoder to allow for richer image embeddings that can be used in low volume datasets via supervised learning ( instead of using the small dataset to create an overfit model OR , theoretically, use imagenet's supervised pre trainers )
@ShivaramKR 4 года назад ⁺⁴
What is the problem in using the same encoder for both key and query, why should they be different?
@timbrt9413 3 года назад
If I have understood correctly from the paper, using the same encoder for keys and querys yields in an oscillating loss, because the encoder changes to fast for the "older" keys. (See section 3.2 in momentum update and 4.1 in ablation: momentum in the paper)
@dilipyakkha9225 3 года назад
Speed.
@siarez 4 года назад ⁺⁴
Thanks for the video.
Why are the weights computed for the query encoder useful at all for learning the key encoder?
@connor-shorten 4 года назад ⁺¹
We are aiming for one representation space as the product of this task. The query and key encoders can't be too disentangled from each other because than the query encoder could learn a trivial solution to map queries to their positive keys.
Good question, it's challenging to answer well, please ask any follow up questions or comments on this.
@safoorayousefi3814 3 года назад
@Patrik Vacek Because then you'll either have a small dictionary due to memory constraints, or if you store past mini-batches then your dictionary will be inconsistently out-dated.
@ThibaultNeveu 4 года назад ⁺⁴
Thank you !!!!!
@connor-shorten 4 года назад
Thank you for watching!
@TimScarfe 4 года назад ⁺¹
Great summary Connor!
@egeres14 3 года назад ⁺²
I love someone is breaking actually complex topics in AI with this much care and cosistency, but it goes completely unnoticed while siraj + medium collect views with clickbaity content xd
@farzadimanpoursardroudi45 2 года назад
Thank you, It was really helpful.
@BlakeEdwards333 4 года назад
Thank you!
@phuccoiinkorea3341 3 года назад
Thank you
@spenhouet 4 года назад
I did not look at the paper but that looks similar to Siamese neural networks.
@2107mann 4 года назад ⁺²
Nice
@connor-shorten 4 года назад ⁺¹
Thank you!
@2107mann 4 года назад
@@connor-shorten How can I contribute?
@SparshGarg-n8e 8 месяцев назад
Thank you!

Следующие

Автовоспроизведение

Exploring Simple Siamese Representation Learning

Exploring Simple Siamese Representation Learning

Fixing SimCLRs Main Problem - BYOL Paper Explained

Fixing SimCLRs Main Problem - BYOL Paper Explained

Contrastive Learning for Unpaired Image-to-Image Translation

Contrastive Learning for Unpaired Image-to-Image Translation

Avengers wake up, Marvel Rivals is fire

Avengers wake up, Marvel Rivals is fire

Seungmin "그렇게, 천천히, 우리(As we are)" | [Stray Kids : SKZ-PLAYER]

Seungmin "그렇게, 천천히, 우리(As we are)" | [Stray Kids : SKZ-PLAYER]

Stray Kids Answers 30 Questions As Quickly As Possible

Stray Kids Answers 30 Questions As Quickly As Possible

Buffalo Bills vs. Detroit Lions Game Highlights | NFL 2024 Season Week 15

Buffalo Bills vs. Detroit Lions Game Highlights | NFL 2024 Season Week 15

Contrastive Learning with SimCLR | Deep Learning Animated

Contrastive Learning with SimCLR | Deep Learning Animated

Contrastive Clustering with SwAV

Contrastive Clustering with SwAV

SimCLR Explained!

SimCLR Explained!

Introduction to Representation learning: Approaches, Challenges and Applications

Introduction to Representation learning: Approaches, Challenges and Applications

What Is Self-Supervised Learning and Why Care?

What Is Self-Supervised Learning and Why Care?

Evolving Normalization-Activation Layers

Evolving Normalization-Activation Layers

SimCLR: A Simple Framework for Contrastive Learning of Visual Representations

SimCLR: A Simple Framework for Contrastive Learning of Visual Representations

Big Self-Supervised Models are Strong Semi-Supervised Learners (Paper Explained)

Big Self-Supervised Models are Strong Semi-Supervised Learners (Paper Explained)

Phillip Isola -- When and Why Does Contrastive Learing Work?

Phillip Isola -- When and Why Does Contrastive Learing Work?

Редакция. News: 148-я неделя

Редакция. News: 148-я неделя

САМЫЙ ДОРОГОЙ НОВОГОДНИЙ СТОЛ ЗА 1200 $ / ОГРОМНЫЙ ОСЬМИНОГ , ИКРА МОРСКОГО ЕЖА , КРАБЫ , ЛАНГУСТЫ

САМЫЙ ДОРОГОЙ НОВОГОДНИЙ СТОЛ ЗА 1200 $ / ОГРОМНЫЙ ОСЬМИНОГ , ИКРА МОРСКОГО ЕЖА , КРАБЫ , ЛАНГУСТЫ

Let you know sleep all day long. My son is sorry. He is your father and will not do anything to you.

Let you know sleep all day long. My son is sorry. He is your father and will not do anything to you.

Кто поставил камеру?

Кто поставил камеру?

Реакция Газана на танец Анджилиши

Реакция Газана на танец Анджилиши

Дела № 40 / ПСИХОПАТ И УБИЙЦА / (Фауст, Илья Сатир, Катя Екатzе)

Дела № 40 / ПСИХОПАТ И УБИЙЦА / (Фауст, Илья Сатир, Катя Екатzе)

НУБ И ВЛАДУС СТРОЯТ ЗАЩИЩЕННЫЙ ПОДЗЕМНЫЙ ДОМ ЗА 10 СЕКУНД / 1 МИНУТА / 5 МИНУТ В МАЙНКРАФТ БИТВА

НУБ И ВЛАДУС СТРОЯТ ЗАЩИЩЕННЫЙ ПОДЗЕМНЫЙ ДОМ ЗА 10 СЕКУНД / 1 МИНУТА / 5 МИНУТ В МАЙНКРАФТ БИТВА

Новый лидер Сирии: «Мы не совершали преступлений, оправдывающих клеймо террористической группы»

Новый лидер Сирии: «Мы не совершали преступлений, оправдывающих клеймо террористической группы»