Francois Charton: Transformers for maths, and maths for transformers

Neel Nanda: Mechanistic Interpretability & Mathematics

Daryl Cooper: Symmetry, old and new

Yungeen Ace - Ready To Die (Official Music Video)

keshi - Say (Official Music Video)

Hundreds of whales surround rower in Atlantic Ocean

Paul Christiano: Formalizing Explanations of Neural Network Behaviors

Sydney Mathematical Research Institute - SMRI

Просмотров 465

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 30 окт 2023
Paul Christiano (Alignment Research Center): October 26
Abstract: Existing research on mechanistic interpretability usually tries to develop an informal human understanding of “how a model works,” making it hard to evaluate research results and raising concerns about scalability. Meanwhile formal proofs of model properties seem far out of reach both in theory and practice. In this talk I’ll discuss an alternative strategy for “explaining” a particular behavior of a given neural network. This notion is much weaker than proving that the network exhibits the behavior, but may still provide similar safety benefits. This talk will primarily motivate a research direction and a set of theoretical questions rather than presenting results.
Course homepage: sites.google.com/view/m-ml-sy...
Наука

Комментарии •

Следующие

Автовоспроизведение

Francois Charton: Transformers for maths, and maths for transformers

Francois Charton: Transformers for maths, and maths for transformers

Neel Nanda: Mechanistic Interpretability & Mathematics

Neel Nanda: Mechanistic Interpretability & Mathematics

Daryl Cooper: Symmetry, old and new

Daryl Cooper: Symmetry, old and new

Yungeen Ace - Ready To Die (Official Music Video)

Yungeen Ace - Ready To Die (Official Music Video)

keshi - Say (Official Music Video)

keshi - Say (Official Music Video)

Hundreds of whales surround rower in Atlantic Ocean

Hundreds of whales surround rower in Atlantic Ocean

I Put Myself in College Football 25! Road to Glory Ep1

I Put Myself in College Football 25! Road to Glory Ep1

Shane G. Henderson: A Tutorial and Perspectives on Monte Carlo Simulation Optimization

Shane G. Henderson: A Tutorial and Perspectives on Monte Carlo Simulation Optimization

Francis Su: Sperner's Lemma - A generalization with surprising applications

Francis Su: Sperner's Lemma – A generalization with surprising applications

Think Fast, Talk Smart: Communication Techniques

Think Fast, Talk Smart: Communication Techniques

D-Modules Course: Symplectic stuff, Singular support and Holonomicity

D-Modules Course: Symplectic stuff, Singular support and Holonomicity

Víctor Elvira: State-space models as graphs

Víctor Elvira: State-space models as graphs

Using maths to invent solutions to large-scale human problems, just in time to survive AI

Using maths to invent solutions to large-scale human problems, just in time to survive AI

Cosmology in Crisis? Confronting the Hubble Tension

Cosmology in Crisis? Confronting the Hubble Tension

Matthew Emerton public lecture

Matthew Emerton public lecture

D-Modules Course: Week Thirteen

D-Modules Course: Week Thirteen

С Какой Высоты Разобьётся Айфон ?😳 #рекомендации

С Какой Высоты Разобьётся Айфон ?😳 #рекомендации

Product Link in Bio ( # 1636 ) @MaviGadgets ✅ Smart Universal Magnetic Car Phone Holder

Product Link in Bio ( # 1636 ) @MaviGadgets ✅ Smart Universal Magnetic Car Phone Holder

Тechnics 1500 Настройка, схема, музыка, всё!

Тechnics 1500 Настройка, схема, музыка, всё!

Как правильно выключать звук на телефоне?

Как правильно выключать звук на телефоне?

Я купил три последних айфона!

Я купил три последних айфона!

⚡️Супер БЫСТРАЯ Зарядка | Проверка

⚡️Супер БЫСТРАЯ Зарядка | Проверка

Colorful Vulcan w rtx 4070ti Super

Colorful Vulcan w rtx 4070ti Super

Ryzen 9 9950X и 9900X - первые тесты Zen 5. Новый король CPU?

Ryzen 9 9950X и 9900X - первые тесты Zen 5. Новый король CPU?