Data Science for Infrastructure w Pixie CEO Zain Asgar | Stanford MLSys #47

MedAI #41: Efficiently Modeling Long Sequences with Structured State Spaces | Albert Gu

Hardware-aware Algorithms for Sequence Modeling - Tri Dao | Stanford MLSys #87

Noob To Pro With DRAGON REWORK in Blox Fruits

The Battle Over NYC Congestion Pricing

Nardwuar vs. Chappell Roan

Efficiently Modeling Long Sequences with Structured State Spaces - Albert Gu | Stanford MLSys #46

Stanford MLSys Seminars

Просмотров 20 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 15 янв 2025

Комментарии • 14

@tarepan_YT 3 года назад ⁺⁹
Impressive works, clear presentation, and intriguing discussions!
Thanks for sharing great seminar.
@BREAKDRS 2 года назад
Very well organized and easy to follow. Thanks!
@salehgholamzadeh3368 2 года назад ⁺³
A really Great Talk.
Can S4 be integrated into Reinforcement Learning?
@brandomiranda6703 2 года назад ⁺²
How do you think it will compare with memorizing transformers?
@m.d.4979 9 месяцев назад
Hello! Great talk! I am currently studying your SSM-related works. They are amazing! Please share your ideas, challenges, and outcomes for implementing your MAMBA model into human(sports athlete) action forecasting. Thank you for your kind reply!
@sabrango 8 месяцев назад
Amazing
@gebob19 2 года назад
really great talk
@halilibrahimakgun7569 2 месяца назад
can yu share slides
@stergiosbachoumas2476 Год назад ⁺⁵
With regards to the stability question and I repeat: "Are Hippo A matrices stable?": The answer is that they are not stable or Hurwitz as we say in Control Theory because their eigenvalues are outside the unit circle. This is trivial to show as they are Lower triangular and therefore their eigenvalues are sitting on the diagonal. Thus the eigenvalues are 1,2,...,n+1 for an (nxn) matrix. Unfortunately, the organizers did not let Albert share his screen to show the form of the A matrix again. With this information now it would be very interesting to talk again about stability because Albert said that they are stable, well in what sense? Also, it's very interesting that other stable matrices do not lead to good learning.
@jonathanballoch Год назад ⁺²
what are the implications of this instability
@stergiosbachoumas2476 Год назад ⁺¹⁵
@@jonathanballoch What my comment above says is all wrong, the HiPPO matrix is stable because the eigenvalues are -1,-2,...,-n+1 all in the Left hand plane (i.e. negative). I forgot to come back and delete this comment but I will leave it here to remind myself that I must be more careful next time.
@simonl1938 Год назад
I'm trying to implement the S4 myself in C right now and have the issue of the state exploding, I don't see how the matrix is stable at all. Do you have any suggestions on what I should look into?
@swfsql 11 месяцев назад
@@stergiosbachoumas2476 Thanks for your update. Just a question, by being on the left hand plane are you referring to the root places in space state control theory?
@rohanasokan7338 9 месяцев назад ⁺²
@@simonl1938 To add to Stergios. It seems Gu keeps the matrix form exploding by keeping the matrix in the left hand plane and he is doing that by limiting the real part of the diagonal to -1/2. There are some ablations he does to this in his dissertation if you are interested. And because it is on the left hand plane, the entire formulation will transform to the complex unit circle. In positive real space, you will always have the state explosion problem.

Следующие

Автовоспроизведение

Data Science for Infrastructure w Pixie CEO Zain Asgar | Stanford MLSys #47

Data Science for Infrastructure w Pixie CEO Zain Asgar | Stanford MLSys #47

MedAI #41: Efficiently Modeling Long Sequences with Structured State Spaces | Albert Gu

MedAI #41: Efficiently Modeling Long Sequences with Structured State Spaces | Albert Gu

Hardware-aware Algorithms for Sequence Modeling - Tri Dao | Stanford MLSys #87

Hardware-aware Algorithms for Sequence Modeling - Tri Dao | Stanford MLSys #87

Noob To Pro With DRAGON REWORK in Blox Fruits

Noob To Pro With DRAGON REWORK in Blox Fruits

The Battle Over NYC Congestion Pricing

The Battle Over NYC Congestion Pricing

Nardwuar vs. Chappell Roan

Nardwuar vs. Chappell Roan

Boston FBI announce arrest of two Iranians in connection with fatal drone strike

Boston FBI announce arrest of two Iranians in connection with fatal drone strike

Notes on AI Hardware - Benjamin Spector | Stanford MLSys #88

Notes on AI Hardware - Benjamin Spector | Stanford MLSys #88

Training LLMs at Scale - Deepak Narayanan | Stanford MLSys #83

Training LLMs at Scale - Deepak Narayanan | Stanford MLSys #83

Structured State Space Models for Deep Sequence Modeling (Albert Gu, CMU)

Structured State Space Models for Deep Sequence Modeling (Albert Gu, CMU)

Text2SQL: The Dream versus Reality - Laurel Orr | Stanford MLSys #89

Text2SQL: The Dream versus Reality - Laurel Orr | Stanford MLSys #89

Monarch Mixer: Making Foundation Models More Efficient - Dan Fu | Stanford MLSys #86

Monarch Mixer: Making Foundation Models More Efficient - Dan Fu | Stanford MLSys #86

Deep Papers Episode 2 - Hungry Hungry Hippos, aka H3

Deep Papers Episode 2 - Hungry Hungry Hippos, aka H3

The State Space Model Revolution, with Albert Gu

The State Space Model Revolution, with Albert Gu

The Next 100x - Gavin Uberti | Stanford MLSys #92

The Next 100x - Gavin Uberti | Stanford MLSys #92

Lecture 2 | Image Classification

Lecture 2 | Image Classification

✈️ МЫ ЗАСНЯЛИ САМОЛЁТ ПОЖИРАТЕЛЬ НА СКРЫТЫЕ КАМЕРЫ В МАЙНКРАФТ! ШЕДИ ЛЕСКА И НУБИК MINECRAFT

✈️ МЫ ЗАСНЯЛИ САМОЛЁТ ПОЖИРАТЕЛЬ НА СКРЫТЫЕ КАМЕРЫ В МАЙНКРАФТ! ШЕДИ ЛЕСКА И НУБИК MINECRAFT

МИМ! ЛУЧШЕ ИГНОРИРУЙ ЕГО...СТРАШНАЯ ИСТОРИЯ НА НОЧЬ

МИМ! ЛУЧШЕ ИГНОРИРУЙ ЕГО...СТРАШНАЯ ИСТОРИЯ НА НОЧЬ

REAL MEWING TUTORIAL💀#real #mewing #tutorial #jawline #shelove

REAL MEWING TUTORIAL💀#real #mewing #tutorial #jawline #shelove

بوتش يتفاعل مع أحذية DIY من الماجستير

بوتش يتفاعل مع أحذية DIY من الماجستير

Squid Game Mingle Music Minecraft Art? 😳 #Shorts

Squid Game Mingle Music Minecraft Art? 😳 #Shorts

БЬЕМ РЕКОРД СМАЕВА! Сарычев, Цыпленков, Ловчев, Чуботару, Токарев

БЬЕМ РЕКОРД СМАЕВА! Сарычев, Цыпленков, Ловчев, Чуботару, Токарев

SUBO НЕ МОЖЕТ? MELLSTROY ПОМОЖЕТ! / ТАМАЕВ vs ВЕНГАЛБИ / МЕНЯ КУПИЛ АРУТ?

SUBO НЕ МОЖЕТ? MELLSTROY ПОМОЖЕТ! / ТАМАЕВ vs ВЕНГАЛБИ / МЕНЯ КУПИЛ АРУТ?

Магнитные РАСШИРИТЕЛИ для НОСА… **это сработало** // тгк: маркус тут✨

Магнитные РАСШИРИТЕЛИ для НОСА… **это сработало** // тгк: маркус тут✨