Neel Nanda on Avoiding an AI Catastrophe with Mechanistic Interpretability

AI and Quantum Computing: Glimpsing the Near Future

Neural and Non-Neural AI, Reasoning, Transformers, and LSTMs

What is the GREATEST Fortnite Mythic of All Time?

GLOVES OFF: CANELO vs. BERLANGA | Episode 1

Neel Nanda on What is Going on Inside Neural Networks

Future of Life Institute

Просмотров 2,4 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 11 сен 2024

Комментарии • 5

@snarkyboojum Год назад ⁺²
TL;DR -> This video discusses the concept of mechanistic interpretability, which is a set of methods and techniques used to reverse engineer AI models to understand their thought processes. It also discusses the concept of instrumental convergence, which is the idea of an AI system having goals, understanding its context, and being competent enough to use its understanding of its context to deceive. Mechanistic interpretability could be the right tool for understanding the goals of a model and the algorithms it follows, and could be used to distinguish between deceptive and honest AI systems.
@antigonemerlin Год назад ⁺²
The example with fourier transforms used to solve addition is really interesting. Our current AIs are truly alien ways of thinking, and perhaps by studying them, we can also learn about our own blindspots.
What's simple for us isn't necessarily objectively simple. For example, graphs and visualizations are a form of data analysis that relies on using our innate visual abilities to make connections, but quite frankly it would be simpler to ingest the data and spit out analysis, save for the fact that evolution did not bless us with a data analysis organ.
@spasibushki Год назад ⁺³
Was it mentioned in a podcast how he gets funded for independent research?
@artemisgaming7625 Год назад ⁺²
The Effective Altruism Long Term Future Fund. He wrote about it briefly on his LessWrong posts.
@chrismathwin6971 Год назад ⁺¹
Really interesting talk, I’m looking forward to the next part!

Следующие

Автовоспроизведение

Neel Nanda on Avoiding an AI Catastrophe with Mechanistic Interpretability

Neel Nanda on Avoiding an AI Catastrophe with Mechanistic Interpretability

AI and Quantum Computing: Glimpsing the Near Future

AI and Quantum Computing: Glimpsing the Near Future

Neural and Non-Neural AI, Reasoning, Transformers, and LSTMs

Neural and Non-Neural AI, Reasoning, Transformers, and LSTMs

What is the GREATEST Fortnite Mythic of All Time?

What is the GREATEST Fortnite Mythic of All Time?

GLOVES OFF: CANELO vs. BERLANGA | Episode 1

GLOVES OFF: CANELO vs. BERLANGA | Episode 1

Men Vs Women Survive The Wilderness For $500,000

Men Vs Women Survive The Wilderness For $500,000

Anton Korinek on Automating Work and the Economics of an Intelligence Explosion

Anton Korinek on Automating Work and the Economics of an Intelligence Explosion

Connor Leahy Unveils the Darker Side of AI

Connor Leahy Unveils the Darker Side of AI

What's the future for generative AI? - The Turing Lectures with Mike Wooldridge

What's the future for generative AI? - The Turing Lectures with Mike Wooldridge

What Is Reality?

What Is Reality?

GEOMETRIC DEEP LEARNING BLUEPRINT

GEOMETRIC DEEP LEARNING BLUEPRINT

Why this top AI guru thinks we might be in extinction level trouble | The InnerView

Why this top AI guru thinks we might be in extinction level trouble | The InnerView

Roger Penrose: Physics of Consciousness and the Infinite Universe | Lex Fridman Podcast #85

Roger Penrose: Physics of Consciousness and the Infinite Universe | Lex Fridman Podcast #85

Two robots debate the future of humanity

Two robots debate the future of humanity

Computer Scientist Explains One Concept in 5 Levels of Difficulty | WIRED

Computer Scientist Explains One Concept in 5 Levels of Difficulty | WIRED

MAGISCHES MAKE-UP: GEHEIMNISSE UND DUNKLE AUGENRINGE VERSTECKEN! 😂✨💄

MAGISCHES MAKE-UP: GEHEIMNISSE UND DUNKLE AUGENRINGE VERSTECKEN! 😂✨💄

INSANE Rally! 🤯🤯 #volleyball #europeanvolleyball #volleyballshorts #cevolleyball

INSANE Rally! 🤯🤯 #volleyball #europeanvolleyball #volleyballshorts #cevolleyball

Decompress small game, have time to play it!

Decompress small game, have time to play it!

Миллиардный бизнес на салфетке

Миллиардный бизнес на салфетке

Rossiyadagi o‘rmondan o‘zbekistonlik erkakning tana qoldiqlari topildi

Rossiyadagi o‘rmondan o‘zbekistonlik erkakning tana qoldiqlari topildi

Пробую запустить ОКУ с первого раза #automobile #ока #shortsvideo #short #shorts

Пробую запустить ОКУ с первого раза #automobile #ока #shortsvideo #short #shorts

В Актау офицер выстрелил солдату срочнику в лицо

В Актау офицер выстрелил солдату срочнику в лицо

ЭКСТРЕМАЛЬНЫЕ ПРЯТКИ на ЗАБРОШКЕ!

ЭКСТРЕМАЛЬНЫЕ ПРЯТКИ на ЗАБРОШКЕ!