Neel Nanda: Mechanistic Interpretability & Mathematics

Paul Christiano: Formalizing Explanations of Neural Network Behaviors

Why Does Diffusion Work Better than Auto-Regression?

4 dead, 9 injured in mass shooting at North Birmingham nightclub

Ignition Teaser: A Name Forged in Flames | Genshin Impact #Ignition #Teaser #GenshinImpact

Destiny 2: Echoes | Act 2 Trailer

Sadhika Malladi: Mathematical Views on Modern Deep Learning Optimization

Sydney Mathematical Research Institute - SMRI

Просмотров 359

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 27 сен 2023
Speaker: Sadhika Malladi, Princeton University
Date: 28 September 2023
Abstract: This talk focuses on how rigorous mathematical tools can be used to describe the optimization of large, highly non-convex neural networks. We start by covering how stochastic differential equations (SDEs) provide a rigorous yet flexible model of how deep networks change over the course of training. We then cover how the SDEs yield practical insights into scaling training to highly distributed settings while preserving generalization performance. In the second half of the talk, we will explore the new deep learning paradigm of pre-training and fine-tuning large language models. We show that fine-tuning can be described by a very simplistic mathematical model, and insights allow us to develop a highly efficient and performant optimizer to fine-tune LLMs at scale. The talk will focus on various mathematical tools and the extent to which they can describe modern day deep learning.
Seminar series website: sites.google.com/view/m-ml-sy...
Наука

Комментарии •

Следующие

Автовоспроизведение

Neel Nanda: Mechanistic Interpretability & Mathematics

Neel Nanda: Mechanistic Interpretability & Mathematics

Paul Christiano: Formalizing Explanations of Neural Network Behaviors

Paul Christiano: Formalizing Explanations of Neural Network Behaviors

Why Does Diffusion Work Better than Auto-Regression?

Why Does Diffusion Work Better than Auto-Regression?

4 dead, 9 injured in mass shooting at North Birmingham nightclub

4 dead, 9 injured in mass shooting at North Birmingham nightclub

Ignition Teaser: A Name Forged in Flames | Genshin Impact #Ignition #Teaser #GenshinImpact

Ignition Teaser: A Name Forged in Flames | Genshin Impact #Ignition #Teaser #GenshinImpact

Destiny 2: Echoes | Act 2 Trailer

Destiny 2: Echoes | Act 2 Trailer

How To Use All 14 Elden Ring DLC Sorceries...

How To Use All 14 Elden Ring DLC Sorceries...

Shane G. Henderson: A Tutorial and Perspectives on Monte Carlo Simulation Optimization

Shane G. Henderson: A Tutorial and Perspectives on Monte Carlo Simulation Optimization

Has Generative AI Already Peaked? - Computerphile

Has Generative AI Already Peaked? - Computerphile

Víctor Elvira: State-space models as graphs

Víctor Elvira: State-space models as graphs

US New Supersonic Passenger Jet Finally Makes Its First Flight!

US New Supersonic Passenger Jet Finally Makes Its First Flight!

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Francois Charton: Transformers for maths, and maths for transformers

Francois Charton: Transformers for maths, and maths for transformers

Stanford's FREE data science book and course are the best yet

Stanford's FREE data science book and course are the best yet

2 Years of My Research Explained in 13 Minutes

2 Years of My Research Explained in 13 Minutes

Colorful Vulcan w rtx 4070ti Super

Colorful Vulcan w rtx 4070ti Super

3 месяца подготовки и тут.. Выпуск уже на канале! #litvin #unit_ru #iphone #litenergy #shortvideo

3 месяца подготовки и тут.. Выпуск уже на канале! #litvin #unit_ru #iphone #litenergy #shortvideo

Неожиданная концовка. $5000 или что-то из apple магазина? #shorts #опрос #сигма #телефон #rec #fyp

Неожиданная концовка. $5000 или что-то из apple магазина? #shorts #опрос #сигма #телефон #rec #fyp

👍САМЫЙ ПРИКОЛЬНЫЙ СМАРТФОН В 2024 ГОДУ!

👍САМЫЙ ПРИКОЛЬНЫЙ СМАРТФОН В 2024 ГОДУ!

Worlds smallest 4K headset 😎 Visor.com #tech #vr #technology #virtualreality #insideout2

Worlds smallest 4K headset 😎 Visor.com #tech #vr #technology #virtualreality #insideout2

😱Самая ДОРОГАЯ мышка от китайцев WLmouse Beast X Max 8K

😱Самая ДОРОГАЯ мышка от китайцев WLmouse Beast X Max 8K

TECNO CAMON 30 smartfoni eng bejetni smartfon BEKMOBIL do’konlarida🤩🔥

TECNO CAMON 30 smartfoni eng bejetni smartfon BEKMOBIL do’konlarida🤩🔥

Как думаете, КС потянет? 😂 #shorts #gaming #pc #asus #cs2 #csgo

Как думаете, КС потянет? 😂 #shorts #gaming #pc #asus #cs2 #csgo