What if all the world's biggest problems have the same solution?

Jean Kossaifi - "Neural Operators for Scientific Applications: Learning on Function Spaces“

C can do this too and it's faster than Python

The Breakfast Club Reacts To Jay-Z’s Attorney Saying Him & Diddy Aren’t Friends + More

Making Cookies For Santa

The FULL Guide To Get Fully AWAKENED Draco Race V4 (V1, V2 & V3) | Blox Fruits

Dimitris Papailiopoulos - "Self-Improving Transformers: Overcoming Length Generalization Challenges"

UWMadison SILO Seminar

Просмотров 1,5 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 9 фев 2025
Time: Wednesday, Jan 29th, 12:30-1:30 pm
Speaker: Dimitris Papailiopoulos (UW-Madison)
Title: Self-Improving Transformers: Overcoming Length Generalization Challenges
Abstract: Large language models can perform algorithmic tasks through test-time computation but struggle to generalize far beyond the task difficulty of the training distribution. These limitations manifest across even simple tasks like arithmetic, string manipulation, and maze solving, where transformers learn shortcuts rather than the underlying algorithms. While prior solutions modify transformer architectures with task-specific engineering, we overcome these limitations with a general-purpose, self-improvement approach using standard transformers. Our method starts with models trained on simple problems, then iteratively uses them to generate training data for progressively harder tasks. Scaling this weak-to-strong training approach yields (seemingly) unbounded improvements in both length and hardness generalization, allowing models to solve problem instances far exceeding the difficulty of those in the training data distribution. We find that “controlled sampling” of problem difficulty is key and also the ability to filter out “negative” self labeled examples; without it, generalization performance plateaus. Our results show that careful self-supervision allows small transformers to transcend superficial pattern matching failures and learn multi step algorithms.

Комментарии • 2

@sosson97 10 дней назад ⁺¹
great work!!
@pelayocf4558 7 дней назад ⁺²
I wonder if the techniques shown in the paper can be applied to things other than arithmetic operations. The way language models multiply numbers reflects a general problem in how they function, and if we are able to teach them to operate in the way suggested in the paper for anything without having to repeat the techniques with each specific type of task, it would represent a major advance in the utility of language models.

Следующие

Автовоспроизведение

What if all the world's biggest problems have the same solution?

What if all the world's biggest problems have the same solution?

Jean Kossaifi - "Neural Operators for Scientific Applications: Learning on Function Spaces“

Jean Kossaifi - "Neural Operators for Scientific Applications: Learning on Function Spaces“

C can do this too and it's faster than Python

C can do this too and it's faster than Python

The Breakfast Club Reacts To Jay-Z’s Attorney Saying Him & Diddy Aren’t Friends + More

The Breakfast Club Reacts To Jay-Z’s Attorney Saying Him & Diddy Aren’t Friends + More

Making Cookies For Santa

Making Cookies For Santa

The FULL Guide To Get Fully AWAKENED Draco Race V4 (V1, V2 & V3) | Blox Fruits

The FULL Guide To Get Fully AWAKENED Draco Race V4 (V1, V2 & V3) | Blox Fruits

Joe Burrow, Zac Taylor HEATED Altercation After Bengals Run Up The Score! Burrow SNAPS At Taylor!

Joe Burrow, Zac Taylor HEATED Altercation After Bengals Run Up The Score! Burrow SNAPS At Taylor!

EU leaders brace for Trump tariffs: How will they respond? | DW News

EU leaders brace for Trump tariffs: How will they respond? | DW News

Zero Emissions?

Zero Emissions?

Sandeep Silwal - "Efficiently searching for distributions"

Sandeep Silwal - "Efficiently searching for distributions"

Take-Two Interactive CEO Strauss Zelnick on Grand Theft Auto VI: We couldn't be more excited

Take-Two Interactive CEO Strauss Zelnick on Grand Theft Auto VI: We couldn't be more excited

Celebrity MasterChef 10th February 2025 Episode 11

Celebrity MasterChef 10th February 2025 Episode 11

EN VIVO: ¡Respondiendo preguntas de Inmigracion!

EN VIVO: ¡Respondiendo preguntas de Inmigracion!

China announces retaliatory tariffs on US goods

China announces retaliatory tariffs on US goods

Episode 45: Lead by Letting Go: Empowering Success and Failure

Episode 45: Lead by Letting Go: Empowering Success and Failure

БИТВА БЛОГЕРОВ - 22:00 РИСКОВАННАЯ АТАКА [ВСЕ В ИГРУ]

БИТВА БЛОГЕРОВ - 22:00 РИСКОВАННАЯ АТАКА [ВСЕ В ИГРУ]

Каждый русский уже знает итальянский язык 😳 @nastyawhere

Каждый русский уже знает итальянский язык 😳 @nastyawhere

Я - ДОЧЬ АНИМАТОРА! (Анимация)

Я - ДОЧЬ АНИМАТОРА! (Анимация)

Экстренная остановка МРТ

Экстренная остановка МРТ

Insane COOKIE MAGIC Trick 🤯🍪 #shorts @juliet_smxll

Insane COOKIE MAGIC Trick 🤯🍪 #shorts @juliet_smxll

Сколько стоит восстановить УТОПЛЕННЫЙ спорткар в 2025 году?

Сколько стоит восстановить УТОПЛЕННЫЙ спорткар в 2025 году?

Шикарная семья Антона и Виктории Макарских 😍 #muzloft

Шикарная семья Антона и Виктории Макарских 😍 #muzloft

Vem ska vi lura nästa gång?😂💛

Vem ska vi lura nästa gång?😂💛