Beyond RAG: New Continual Learning of LLM w/ InCA

o3 Inference Time CoT Reasoning: How relevant is SFT and RL?

o3 Inference Reasoning: How to Build the Training Data Set

The White Lotus Season 3 | Official Teaser | Max

Joe Burrow, Zac Taylor HEATED Altercation After Bengals Run Up The Score! Burrow SNAPS At Taylor!

Noob To Pro With DRAGON REWORK in Blox Fruits

Code CoT w/ Self-Evolution LLM: rStar-Math Explained

Discover AI

Просмотров 1,3 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 11 янв 2025

Комментарии • 6

@MaJetiGizzle 6 часов назад ⁺¹
I think taking more of a symbolically grounded environment approach to getting smaller models to train themselves on better processes for solving problems would work quite well in a variety of different domains.
So really glad to see the rstar-math model being the proof of concept for such an approach for sub 10B models.
@matt.lehodey 8 часов назад
Was waiting for this one 🥳
@diga4696 9 часов назад ⁺²
Great job! Interesting paper :)
If humanity survives, 2025 is going to be great.
@msokokokokokok 6 часов назад
One of the reason why self evolution performs suboptimally on different data set could be that best answers are always already highest ranked so forced Answers make it worst
@irbsurfer1585 7 часов назад
While the R-star Mathematics approach is intellectually interesting and proves what's possible, it's not particularly accessible for most practitioners. Thanks for the update anyways.
@zandrrlife 8 часов назад
Shoutout out to “lean-star” researchers. Shoutout Microsoft too, their research takes it to the next level, but lean-star ignited the engine.

Следующие

Автовоспроизведение

Beyond RAG: New Continual Learning of LLM w/ InCA

Beyond RAG: New Continual Learning of LLM w/ InCA

o3 Inference Time CoT Reasoning: How relevant is SFT and RL?

o3 Inference Time CoT Reasoning: How relevant is SFT and RL?

o3 Inference Reasoning: How to Build the Training Data Set

o3 Inference Reasoning: How to Build the Training Data Set

The White Lotus Season 3 | Official Teaser | Max

The White Lotus Season 3 | Official Teaser | Max

Joe Burrow, Zac Taylor HEATED Altercation After Bengals Run Up The Score! Burrow SNAPS At Taylor!

Joe Burrow, Zac Taylor HEATED Altercation After Bengals Run Up The Score! Burrow SNAPS At Taylor!

Noob To Pro With DRAGON REWORK in Blox Fruits

Noob To Pro With DRAGON REWORK in Blox Fruits

The Greatest Comeback Of All Time?

The Greatest Comeback Of All Time?

7 Outside The Box Puzzles

7 Outside The Box Puzzles

NEW INFERENCE SFT & RL by Google - First Thoughts

NEW INFERENCE SFT & RL by Google - First Thoughts

Terence Tao at IMO 2024: AI and Mathematics

Terence Tao at IMO 2024: AI and Mathematics

NEW "Autonomous CoT": Beyond o1 for Next-Level AI

NEW "Autonomous CoT": Beyond o1 for Next-Level AI

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

The Elegant Math Behind Machine Learning

The Elegant Math Behind Machine Learning

Elon Musk JUST Dropped Massive Bombshell on Tesla Bots, Neuralink and Major Tesla Updates | CES 2025

Elon Musk JUST Dropped Massive Bombshell on Tesla Bots, Neuralink and Major Tesla Updates | CES 2025

Coding Adventure: Rendering Text

Coding Adventure: Rendering Text

It Happened! Elon Musk Reveals Incredible Features Of Tesla Bot Gen 3 2025! Destroy ALL Rivals!

It Happened! Elon Musk Reveals Incredible Features Of Tesla Bot Gen 3 2025! Destroy ALL Rivals!

Монтян: ИМ придётся проиграть войну! // Интервью Зеленского, пожары Калифорнии, Трамп шокирует мир

Монтян: ИМ придётся проиграть войну! // Интервью Зеленского, пожары Калифорнии, Трамп шокирует мир

Revenge Success ✅💯😂 #shorts #trending #funny #comedy #viralvideo

Revenge Success ✅💯😂 #shorts #trending #funny #comedy #viralvideo

白天使能预知未来。#小丑 #天使 #超人不会飞 #shorts

白天使能预知未来。#小丑 #天使 #超人不会飞 #shorts

T.O.P squid game inspired look! 【IG: jia_songggggggggg】 #squidgame #top #bigbang

T.O.P squid game inspired look! 【IG: jia_songggggggggg】 #squidgame #top #bigbang

ШОУ не СВОЯ ИГРА: Егор Крид, Гарик Харламов , Джиган , Денис Дорохов #1

ШОУ не СВОЯ ИГРА: Егор Крид, Гарик Харламов , Джиган , Денис Дорохов #1

😱Короче , 10 Минут Бесполезной инфы о GTA San Andreas

😱Короче , 10 Минут Бесполезной инфы о GTA San Andreas

Фейковый снег в тикток #амитеш #эксперимент

Фейковый снег в тикток #амитеш #эксперимент