How might LLMs store facts | Chapter 7, Deep Learning

The moment we stopped understanding AI [AlexNet]

Anthropic Founders Share Roadmap to Advance AI

Join the Biggest Decorating Contest in Hay Day!

Lil Durk - Late Checkout feat. Hunxho (Official Visualizer)

2000 Pound Table

Anthropic Solved Interpretability?

The Inside View

Просмотров 4,6 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 10 окт 2024
Paper: transformer-ci...
Blogpost: www.anthropic....
Lesswrong: www.lesswrong....

Комментарии • 15

@FreakyStyleytobby 7 месяцев назад ⁺¹
Definitely a good news!
Thank you very much for the quality video man
@mgetommy Год назад ⁺²
Enjoyed this a lot! Pls do more
@TheInsideView Год назад
Great! That's the plan
@arjoon 11 месяцев назад ⁺¹
Great video!
Makes me wonder if there'll always be a fundamental tradeoff between interpretability and goodness of fit for a model
@jordan13589 Год назад ⁺²
When you’re getting ready to go as sexy Pugsley to a costume party just as Anthropic cracks mech interp:
@Alice_Fumo Год назад ⁺⁶
Recent papers are really starting to steal all my good ideas. (Or confirming that my ideas were ever good and novel to begin with)
I believe I didn't investigate this particularly far, since I didn't feel like the implications were particularly meaningful. Like having these features and even a map of neurons associated with a feature doesn't seem that useful to me in itself. By not that useful I mean that actually using this to make stuff safer is still extremely difficult and there's like 20 more easier avenues to better model performance, so I'm pretty unsure about whether they'll find ways to use this in practice which doesn't eventually lead to sort of dead ends.
@fredolivier1431 Год назад
I am having exactly the same experience, its crazy. Notepads full of ideas that turn up on arxiv sometimes months later. Best Move: actualise autonomously.
@meiotta Год назад ⁺¹
always good content
@TheInsideView Год назад
always good comments
@jt-rv5qu 10 месяцев назад
As we talk about probabilities of the next token in different levels of attention , The Large Text Models créate only the delusion of Intelligence in the users. LTM are great for transcript, translate or as NLP interphases to other systems but there no any necessity of mechanistic interpretabilty. Far better goal will be give them reasoning capacity and then you can simply ask about the introspection 🎉
@kyneticist Год назад
"They" did not say that mechanistic interpretability would never be achieved, "they" said that it should have been pursued as a fundamental step.
@danielbrockman7402 Год назад ⁺¹
🎉
@RonponVideos Год назад ⁺¹
I dig the hair. It ages you (in a good way).
@TheInsideView Год назад
Thanks, glad you like it!
@BooleanDisorder 9 месяцев назад
You are cute!

Следующие

Автовоспроизведение

How might LLMs store facts | Chapter 7, Deep Learning

How might LLMs store facts | Chapter 7, Deep Learning

The moment we stopped understanding AI [AlexNet]

The moment we stopped understanding AI [AlexNet]

Anthropic Founders Share Roadmap to Advance AI

Anthropic Founders Share Roadmap to Advance AI

Join the Biggest Decorating Contest in Hay Day!

Join the Biggest Decorating Contest in Hay Day!

Lil Durk - Late Checkout feat. Hunxho (Official Visualizer)

Lil Durk - Late Checkout feat. Hunxho (Official Visualizer)

2000 Pound Table

2000 Pound Table

Morrowind Doesn't Have Any Rivers

Morrowind Doesn't Have Any Rivers

Why Anthropic's Founder Left Sam Altman’s OpenAI

Why Anthropic's Founder Left Sam Altman’s OpenAI

How large language models work, a visual intro to transformers | Chapter 5, Deep Learning

How large language models work, a visual intro to transformers | Chapter 5, Deep Learning

Why Does Diffusion Work Better than Auto-Regression?

Why Does Diffusion Work Better than Auto-Regression?

Improving AI with Anthropic's Dario Amodei

Improving AI with Anthropic's Dario Amodei

Anthropic CEO Dario Amodei on Claude 3 model, AI arms race and Big Tech partnerships

Anthropic CEO Dario Amodei on Claude 3 model, AI arms race and Big Tech partnerships

Reading AI's Mind - Mechanistic Interpretability Explained [Anthropic Research]

Reading AI's Mind - Mechanistic Interpretability Explained [Anthropic Research]

What Do Neural Networks Really Learn? Exploring the Brain of an AI Model

What Do Neural Networks Really Learn? Exploring the Brain of an AI Model

Mechanistic Interpretability - NEEL NANDA (DeepMind)

Mechanistic Interpretability - NEEL NANDA (DeepMind)

Anthropic Solved Interpretability Again? (Walkthrough)

Anthropic Solved Interpretability Again? (Walkthrough)

ОВР Шоу: Тренер по харизме @ovrshow_tnt

ОВР Шоу: Тренер по харизме @ovrshow_tnt

МАШИНА ДЛЯ ЛЮДЕЙ БЕЗ ДЕНЕГ. И БЕЗ КОМПЛЕКСОВ. BAJAJ QUTE.

МАШИНА ДЛЯ ЛЮДЕЙ БЕЗ ДЕНЕГ. И БЕЗ КОМПЛЕКСОВ. BAJAJ QUTE.

Давай сыграем в прятки?😁НАЙДИ МЕНЯ😁

Давай сыграем в прятки?😁НАЙДИ МЕНЯ😁

In Tampa & Sarasota, Hurricane Milton damage comes before landfall; Tornadoes hit South Florida, too

In Tampa & Sarasota, Hurricane Milton damage comes before landfall; Tornadoes hit South Florida, too

Торнадо, штормовой ветер и дождь - ураган "Милтон" прошелся по США

Торнадо, штормовой ветер и дождь - ураган "Милтон" прошелся по США

Women's Defending + Men's 😮‍💨❌

Women's Defending + Men's 😮‍💨❌

Затулин: Цели СВО ПРОВАЛЕНЫ. Украина под руководством ЗЕЛЕНСКОГО останется существовать!

Затулин: Цели СВО ПРОВАЛЕНЫ. Украина под руководством ЗЕЛЕНСКОГО останется существовать!