Concrete open problems in mechanistic interpretability | Neel Nanda | EAG London 23

Open Problems in Mechanistic Interpretability: A Whirlwind Tour

This is why Deep Learning is really weird.

Can I Make The PERFECT Marvel vs Capcom Roster?

A$AP Rocky - HIGHJACK (Official Video)

paris during the olympics

Open Problems in Mechanistic Interpretability: A Whirlwind Tour | Neel Nanda | EAGxVirtual 2023

Centre for Effective Altruism

Просмотров 384

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 22 авг 2024
Mechanistic Interpretability is a sub-field of AI Alignment that studies trained neural networks and tries to reverse-engineer the algorithms they've learned. In this talk, Neel Nanda gave an overview of the field, key works, and some of the open problems.
Learn more about effective altruism at: www.effectivealtruism.org
Find out more about EA Global conferences at: www.eaglobal.org

Комментарии •

Следующие

Автовоспроизведение

Concrete open problems in mechanistic interpretability | Neel Nanda | EAG London 23

Concrete open problems in mechanistic interpretability | Neel Nanda | EAG London 23

Open Problems in Mechanistic Interpretability: A Whirlwind Tour

Open Problems in Mechanistic Interpretability: A Whirlwind Tour

This is why Deep Learning is really weird.

This is why Deep Learning is really weird.

Can I Make The PERFECT Marvel vs Capcom Roster?

Can I Make The PERFECT Marvel vs Capcom Roster?

A$AP Rocky - HIGHJACK (Official Video)

A$AP Rocky - HIGHJACK (Official Video)

paris during the olympics

paris during the olympics

We Trained For The OLYMPICS!

We Trained For The OLYMPICS!

The Turing Lectures: The future of generative AI

The Turing Lectures: The future of generative AI

Three Journeys for EA | Zach Robinson | EAG London: 2024

Three Journeys for EA | Zach Robinson | EAG London: 2024

The moment we stopped understanding AI [AlexNet]

The moment we stopped understanding AI [AlexNet]

Welfare and moral patienthood | Jeff Sebo, Daniela Waldhorn, & Patrick Butlin | EAG London: 2024

Welfare and moral patienthood | Jeff Sebo, Daniela Waldhorn, & Patrick Butlin | EAG London: 2024

Neel Nanda: Mechanistic Interpretability & Mathematics

Neel Nanda: Mechanistic Interpretability & Mathematics

Mechanistic Interpretability - Stella Biderman | Stanford MLSys #70

Mechanistic Interpretability - Stella Biderman | Stanford MLSys #70

What's the future for generative AI? - The Turing Lectures with Mike Wooldridge

What's the future for generative AI? - The Turing Lectures with Mike Wooldridge

How This New Battery is Changing the Game

How This New Battery is Changing the Game

GEOMETRIC DEEP LEARNING BLUEPRINT

GEOMETRIC DEEP LEARNING BLUEPRINT

Ватные палочки с помадой внутри #iribaby #shorts

Ватные палочки с помадой внутри #iribaby #shorts

Арестович оценил ответ России на Курскую операцию ЗСУ: Кремль не остановит наступление на Покровск

Арестович оценил ответ России на Курскую операцию ЗСУ: Кремль не остановит наступление на Покровск

Обвиняемым в теракте в Крокусе продлили арест #россия #крокусситихолл #shorts

Обвиняемым в теракте в Крокусе продлили арест #россия #крокусситихолл #shorts

Never Troll Shelly🫡 | Brawl Stars

Never Troll Shelly🫡 | Brawl Stars

МЫ ПРОЕХАЛИ 180 ТЫСЯЧ КМ НА ОКТАВИИ И ОНА СГНИЛА ...

МЫ ПРОЕХАЛИ 180 ТЫСЯЧ КМ НА ОКТАВИИ И ОНА СГНИЛА ...

Воссоздал СТРАНУ СМЕШАРИКОВ в Майнкрафт ХАРДКОР

Воссоздал СТРАНУ СМЕШАРИКОВ в Майнкрафт ХАРДКОР

Construction site video BEST.99

Construction site video BEST.99

The kindhearted bunny officer helps the disabled!#spiderman #Harley Quinn

The kindhearted bunny officer helps the disabled!#spiderman #Harley Quinn