Barbara Plank: Revisiting Trustworthiness in NLP - Two Views on Uncertainty

Constructing the Mind (or Some Stuff I’ve Learned About the Brain From Studying Emotion)

Wolfram Physics Project Launch

30 Dead in Buncombe County: River Arts District in Asheville washed away; power, water remain issues

THE COST OF MANSORY FINISHING MY ROLLS ROYCE REBUILD

Manchester United v. Tottenham Hotspur | PREMIER LEAGUE HIGHLIGHTS | 9/29/2024 | NBC Sports

Sepp Hochreiter: Memory Architectures for Deep Learning

Uni Vienna live

Просмотров 1,5 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 12 фев 2024
Currently, the most successful Deep Learning architecture is the transformer. The attention mechanism of the transformer is equivalent to modern Hopfield networks, therefore is an associative memory. However, this associative memory has disadvantages like its quadratic complexity with the sequence length when mutually associating sequences elements, its restriction to pairwise associations, its limitations in modifying the memory, its insufficient abstraction capabilities. In contrast, recurrent neural networks (RNNs) like LSTMs have linear complexity, associate sequence elements with a representation of all previous elements, can directly modify memory content, and have high abstraction capabilities. However, RNNs cannot store sequence elements that were rare in the training data, since RNNs have to learn to store. Transformer can store rare or even new sequence elements, which is one of the main reasons besides their high parallelization why they outperformed RNNs in language modelling. Future successful Deep Learning architectures should comprise both of these memories: attention for implementing episodic memories and RNNs for implementing short-term memories and abstraction.
👉 More information about the lecture series "Machines that understand?": dm.cs.univie.a...
👉 Research Group Data Mining and Machine Learning at the University of Vienna: dm.cs.univie.a...
👉 Playlist Machines that understand? • Was bedeutet Generativ...

Комментарии •

Следующие

Автовоспроизведение

Barbara Plank: Revisiting Trustworthiness in NLP - Two Views on Uncertainty

Barbara Plank: Revisiting Trustworthiness in NLP - Two Views on Uncertainty

Constructing the Mind (or Some Stuff I’ve Learned About the Brain From Studying Emotion)

Constructing the Mind (or Some Stuff I’ve Learned About the Brain From Studying Emotion)

Wolfram Physics Project Launch

Wolfram Physics Project Launch

30 Dead in Buncombe County: River Arts District in Asheville washed away; power, water remain issues

30 Dead in Buncombe County: River Arts District in Asheville washed away; power, water remain issues

THE COST OF MANSORY FINISHING MY ROLLS ROYCE REBUILD

THE COST OF MANSORY FINISHING MY ROLLS ROYCE REBUILD

Manchester United v. Tottenham Hotspur | PREMIER LEAGUE HIGHLIGHTS | 9/29/2024 | NBC Sports

Manchester United v. Tottenham Hotspur | PREMIER LEAGUE HIGHLIGHTS | 9/29/2024 | NBC Sports

True Facts: How Jellyfish Hunt

True Facts: How Jellyfish Hunt

ICML 2021 | Modern Hopfield Networks - Dr Sepp Hochreiter

ICML 2021 | Modern Hopfield Networks - Dr Sepp Hochreiter

Dr. Petter Törnberg | Simulating Social Media using Large Language Models | MLiS 2024

Dr. Petter Törnberg | Simulating Social Media using Large Language Models | MLiS 2024

AGI Series 2024 - Joscha Bach: Is Consciousness a Missing Link to AGI?

AGI Series 2024 - Joscha Bach: Is Consciousness a Missing Link to AGI?

WE GOT ACCESS TO GPT-3! [Epic Special Edition]

WE GOT ACCESS TO GPT-3! [Epic Special Edition]

25 Jahre LSTM - mit Prof. Dr. Jürgen Schmidhuber und Prof. Dr. Sepp Hochreiter

25 Jahre LSTM - mit Prof. Dr. Jürgen Schmidhuber und Prof. Dr. Sepp Hochreiter

KI in der Kunst

KI in der Kunst

Deep Learning 7. Attention and Memory in Deep Learning

Deep Learning 7. Attention and Memory in Deep Learning

Google Cloud Platform Tutorial 2024 | Google Cloud In Depth Tutorial | Cloud Computing | Simplilearn

Google Cloud Platform Tutorial 2024 | Google Cloud In Depth Tutorial | Cloud Computing | Simplilearn

An Introductory QGIS Workshop for Beginners

An Introductory QGIS Workshop for Beginners

😳 24 часа ИЩУ ЖЕНУ моему другу итальянцу (до конца!😂)

😳 24 часа ИЩУ ЖЕНУ моему другу итальянцу (до конца!😂)

Airpod Through Glass Trick! 😱 #shorts

Airpod Through Glass Trick! 😱 #shorts

От первого лица: Школа 7😡 СКАНДАЛ в ШКОЛЕ 😱РАЗГРОМИЛИ САЛОН 😰БОЛЬНОЙ ОДНОКЛАССНИК 🥹ГЛАЗАМИ ШКОЛЬНИКА

От первого лица: Школа 7😡 СКАНДАЛ в ШКОЛЕ 😱РАЗГРОМИЛИ САЛОН 😰БОЛЬНОЙ ОДНОКЛАССНИК 🥹ГЛАЗАМИ ШКОЛЬНИКА

ВЫЗВАЛ ЗЛОГО СОНИКА #Shorts

ВЫЗВАЛ ЗЛОГО СОНИКА #Shorts

小路飞嫁祸姐姐搞破坏 #路飞#海贼王

小路飞嫁祸姐姐搞破坏 #路飞#海贼王

Eronning Isroilga raketa zarbasi va Rossiyaning urush uchun xarajatlari - 2-oktabr dayjesti

Eronning Isroilga raketa zarbasi va Rossiyaning urush uchun xarajatlari — 2-oktabr dayjesti

СОТНИ РАКЕТ ПО ИЗРАИЛЮ: какая расплата ждет Иран? ПРЯМОЙ ЭФИР. 2 октября. Новости Израиля сегодня

СОТНИ РАКЕТ ПО ИЗРАИЛЮ: какая расплата ждет Иран? ПРЯМОЙ ЭФИР. 2 октября. Новости Израиля сегодня

Лучше одной, чем с такими

Лучше одной, чем с такими