Wuchen Li: "Accelerated Information Gradient Flow"

Tom Goldstein: "What do neural loss surfaces look like?"

Risi Kondor: "Fourier space neural networks"

Ranking Every YouTuber Food

Two Faced (Official Music Video) - Linkin Park

WW2's Most Gangster Naval Brawl - USS Borie Rams German U-boat 405

Tom Goldstein: "An empirical look at generalization in neural nets"

Institute for Pure & Applied Mathematics (IPAM)

Просмотров 6 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 17 ноя 2024

Комментарии • 17

@mohammedrayyan1007 3 года назад ⁺¹¹
Absolute gold for developing intuition. Many thanks professor.
@kellymoses8566 3 месяца назад
When you explained the mystery of how Neural Networks generalize so well I really wanted to know the answer.
@ankitkumarpandey7262 6 месяцев назад
Absolute gem of a video!!
@adokoka 3 года назад ⁺¹
Great presentation! Refreshing to go through a math concept without maths :)
@andre192818 3 года назад ⁺³
Great talk nice slides, thanks! I am wondering, if it is so difficult to find narrow, sharp minima, why are SVMs finding them?
@Kaslor1000 3 года назад ⁺¹
I think that simple models like SVMs without non-linear kernels just lack the power or expressiveness to generate loss landscapes that even contain large margin flat minima, since they simply can't linearly separate data. In some sense all minima in SVMs are equally bad?
@fedorsmirnov439 3 года назад
Thanks a lot for the awesome talk! I really wish more talks in the are of deep learning would prioritize intuition and general understanding over the mathematical proof.
@HamsterSword 3 года назад
Absolutely fantastic video, thank you!
@mihirmongia7787 2 года назад
amazing talk
@Kaslor1000 3 года назад ⁺¹
Fantastic talk.
@denysivanov3364 3 года назад
Really good lecture, thanks!
@TheKirillfish 3 года назад ⁺¹
Very interesting! So this kinda explains why CNNs tend to overfit on textures or otherwise seek simpler visual features: it leads to wider margin and flatter minima.
Also, seems that for an example at 49:30 we need either more data for inner red circle (and naturally get wider margins for the desirable minimum) or stronger priors. Speaking of the latter case, for example, "circular pattern" is much simpler geometrically than cherry-picking formation with arcs which was found by the model + we already have 2 circular formations on the image. Can we integrate these sorts of priors to the loss function via attention mechanism or some other way? Can transformers do that?
@Kaslor1000 3 года назад
I guess it would be nice if NNs could somehow "zoom" into the dateset in this case, to artificially make linear separability with a wide margin easier (more likely). No way how to implement that though.
@arturodeza3816 3 года назад
Spectacular Talk!
@Kaslor1000 3 года назад
So, we know wide margin minima are good and that they are easy to find when they exist but I guess the question remains, why do wide margin flat minima exist in the first place? My bet would be that current networks tend to contain at least a few wide layers, and wide layers produce outputs which are of high dimension and we know linear separability is easier in higher dimensions. Also, I think that the deeper a network is, the more likely it is that data becomes easily separable at some layer (and therefore a wider margin minimum can exist), since layers near the end tend to represent higher-level features.
@zwrogoli666 3 года назад
Great talk! Thanks for sharing
@sagemaninsky3979 3 года назад
Extremely useful

Следующие

Автовоспроизведение

Wuchen Li: "Accelerated Information Gradient Flow"

Wuchen Li: "Accelerated Information Gradient Flow"

Tom Goldstein: "What do neural loss surfaces look like?"

Tom Goldstein: "What do neural loss surfaces look like?"

Risi Kondor: "Fourier space neural networks"

Risi Kondor: "Fourier space neural networks"

Ranking Every YouTuber Food

Ranking Every YouTuber Food

Two Faced (Official Music Video) - Linkin Park

Two Faced (Official Music Video) - Linkin Park

WW2's Most Gangster Naval Brawl - USS Borie Rams German U-boat 405

WW2's Most Gangster Naval Brawl - USS Borie Rams German U-boat 405

Testing Scary Minecraft Lies That Are Unsolved…

Testing Scary Minecraft Lies That Are Unsolved…

Understanding Deep Learning Requires Rethinking Generalization

Understanding Deep Learning Requires Rethinking Generalization

Watermarking LLMs to Fight Plagiarism with Tom Goldstein - 621

Watermarking LLMs to Fight Plagiarism with Tom Goldstein - 621

This is why Deep Learning is really weird.

This is why Deep Learning is really weird.

Michael Levin - Non-neural intelligence: biological architecture problem-solving in diverse spaces

Michael Levin - Non-neural intelligence: biological architecture problem-solving in diverse spaces

Miles Cranmer - The Next Great Scientific Theory is Hiding Inside a Neural Network (April 3, 2024)

Miles Cranmer - The Next Great Scientific Theory is Hiding Inside a Neural Network (April 3, 2024)

Deep Ensembles: A Loss Landscape Perspective (Paper Explained)

Deep Ensembles: A Loss Landscape Perspective (Paper Explained)

Stéphane Mallat: "Deep Generative Networks as Inverse Problems"

Stéphane Mallat: "Deep Generative Networks as Inverse Problems"

Yann LeCun: "Energy-Based Self-Supervised Learning"

Yann LeCun: "Energy-Based Self-Supervised Learning"

Geometric Intuition for Training Neural Networks

Geometric Intuition for Training Neural Networks

消防避险训练，消防员用“水盾”逼退烈火！这是训练，也是他们可能面对的日常。致敬！#熱門 #中国

消防避险训练，消防员用“水盾”逼退烈火！这是训练，也是他们可能面对的日常。致敬！#熱門 #中国

ТАРАКАН ОТОМСТИЛ ГРАБИТЕЛЮ

ТАРАКАН ОТОМСТИЛ ГРАБИТЕЛЮ

15 Способов Пронести ГАДЖЕТЫ и СЛАДОСТИ в ШКОЛУ !

15 Способов Пронести ГАДЖЕТЫ и СЛАДОСТИ в ШКОЛУ !

Она очень сильная... #джарахов #mona #мона #подкаст

Она очень сильная... #джарахов #mona #мона #подкаст

топ 3 способа не застрять в туалете самолета 😄 мой тг «хей! это марьяна!»

топ 3 способа не застрять в туалете самолета 😄 мой тг «хей! это марьяна!»

Я влюбился В РОТОМОЙКУ, НЕ ПОСМОТРИШЬ - ПОЖАЛЕЕШЬ - Mouthwashing 1

Я влюбился В РОТОМОЙКУ, НЕ ПОСМОТРИШЬ - ПОЖАЛЕЕШЬ - Mouthwashing 1

Це ТРЕБА БАЧИТИ! "НАТАША ПОПАЛА, ПОПАЛА!". Колишня вихователька дитсадка ЗБИЛА РАКЕТУ

Це ТРЕБА БАЧИТИ! "НАТАША ПОПАЛА, ПОПАЛА!". Колишня вихователька дитсадка ЗБИЛА РАКЕТУ

Загадка Маруси разгадана! Победа близка! Раунд 3

Загадка Маруси разгадана! Победа близка! Раунд 3