Sadhika Malladi: Mathematical Views on Modern Deep Learning Optimization

Greg Yang: The unreasonable effectiveness of mathematics in large scale deep learning

This is why Deep Learning is really weird.

There Is Something Hiding Inside Earth

🎵 Luigi's Lament 3: BRING BOWSER BACK! 🎵

Perrie - You Go Your Way (Official Video)

Greg Yang: The unreasonable effectiveness of mathematics in large scale deep learning

Sydney Mathematical Research Institute - SMRI

Просмотров 1,8 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 4 окт 2024
Speaker: Greg Yang, xAI
Date: 13 September 2023
Abstract: Recently, the theory of infinite-width neural networks led to the first technology, muTransfer, for tuning enormous neural networks that are too expensive to train more than once. For example, this allowed us to tune the 6.7 billion parameter version of GPT-3 using only 7% of its pretraining compute budget, and with some asterisks, we get a performance comparable to the original GPT-3 model with twice the parameter count. In this talk, I will explain the core insight behind this theory. In fact, this is an instance of what I call the *Optimal Scaling Thesis*, which connects infinite-size limits for general notions of “size” to the optimal design of large models in practice. I'll end with several concrete key mathematical research questions whose resolutions will have incredible impact on the future of AI.
Seminar series website: sites.google.c...

Комментарии •

Следующие

Автовоспроизведение

Sadhika Malladi: Mathematical Views on Modern Deep Learning Optimization

Sadhika Malladi: Mathematical Views on Modern Deep Learning Optimization

Greg Yang: The unreasonable effectiveness of mathematics in large scale deep learning

Greg Yang: The unreasonable effectiveness of mathematics in large scale deep learning

This is why Deep Learning is really weird.

This is why Deep Learning is really weird.

There Is Something Hiding Inside Earth

There Is Something Hiding Inside Earth

🎵 Luigi's Lament 3: BRING BOWSER BACK! 🎵

🎵 Luigi's Lament 3: BRING BOWSER BACK! 🎵

Perrie - You Go Your Way (Official Video)

Perrie - You Go Your Way (Official Video)

We Put Two Electric Motors in Hondas Smallest Sports Car and it Rips!

We Put Two Electric Motors in Hondas Smallest Sports Car and it Rips!

Neel Nanda: Mechanistic Interpretability & Mathematics

Neel Nanda: Mechanistic Interpretability & Mathematics

A mathematical theory of learning in deep neural networks - Surya Ganguli

A mathematical theory of learning in deep neural networks - Surya Ganguli

The Race to Harness Quantum Computing's Mind-Bending Power | The Future With Hannah Fry

The Race to Harness Quantum Computing's Mind-Bending Power | The Future With Hannah Fry

Greg Yang - "Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer"

Greg Yang - "Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer"

Computer Scientist Explains One Concept in 5 Levels of Difficulty | WIRED

Computer Scientist Explains One Concept in 5 Levels of Difficulty | WIRED

Why Vertical LLM Agents Are The New $1 Billion SaaS Opportunities

Why Vertical LLM Agents Are The New $1 Billion SaaS Opportunities

The Grandfather Of Generative Models

The Grandfather Of Generative Models

Greg Yang | Large N Limits: Random Matrices & Neural Networks | The Cartesian Cafe w/ Timothy Nguyen

Greg Yang | Large N Limits: Random Matrices & Neural Networks | The Cartesian Cafe w/ Timothy Nguyen

Terence Tao at IMO 2024: AI and Mathematics

Terence Tao at IMO 2024: AI and Mathematics

ЗАЧЕМ МЫ ЕГО КУПИЛИ ??? ДЛЯ ЖЕЛЕЙНОГО МЕДВЕДЯ ВАЛЕРЫ

ЗАЧЕМ МЫ ЕГО КУПИЛИ ??? ДЛЯ ЖЕЛЕЙНОГО МЕДВЕДЯ ВАЛЕРЫ

КОТЯТА НАУЧИЛИСЬ ГОВОРИТЬ#cat

КОТЯТА НАУЧИЛИСЬ ГОВОРИТЬ#cat

Нельзя смеяться | Смех с водой | 83 #shorts

Нельзя смеяться | Смех с водой | 83 #shorts

🤣 Придумали, как зарабатывать, ничего не делая! И всё получилось! | Новостничок

🤣 Придумали, как зарабатывать, ничего не делая! И всё получилось! | Новостничок

Те самые соседи в три часа ночи #умихрум

Те самые соседи в три часа ночи #умихрум

ВКУСНАЯ ВОДА! Кулинарка с @dimapozov #кулинарка #кулинария #кулинарноешоу

ВКУСНАЯ ВОДА! Кулинарка с @dimapozov #кулинарка #кулинария #кулинарноешоу

Russian fighter jet executes risky manoeuvre near US aircraft

Russian fighter jet executes risky manoeuvre near US aircraft

Их надо запретить | Николай Чубаров @hudeuotremonta

Их надо запретить | Николай Чубаров @hudeuotremonta