New Discovery: LLMs have a Performance Phase

Rotary Positional Embeddings: Combining Absolute and Relative

GraphRAG: LLM-Derived Knowledge Graphs for RAG

Usher's Greatest Hits In The R&B Hall Of Fame Albums! | BET Awards '24

Watch live: SpaceX Falcon 9 rocket launches from California on U.S. spy satellite agency mission

KISS OF LIFE (키스오브라이프) 'Sticky' Official Music Video

RoPE Rotary Position Embedding to 100K context length

code_your_own_AI

Просмотров 2,2 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 22 май 2024
ROPE - Rotary Position Embedding explained in simple terms for calculating the self attention in Transformers with a relative position encoding for extended Context lengths of LLMs.
All rights w/ authors:
ROFORMER: ENHANCED TRANSFORMER WITH ROTARY POSITION EMBEDDING (RoPE)
arxiv.org/pdf/2104.09864
#airesearch
#aiexplained
Наука

Комментарии • 8

@desmur36 Месяц назад
Amazing content! The explanations are SO clear! Thank you!
@LamontCranston-qh2rv Месяц назад ⁺¹
Thank you SO MUCH for providing such high quality conten! Very much enjoying all your many videos! If you have a chance, I'd love to see you discuss the recent work in giving AI spatial reasoning. I.e. artificial "imagination". (In it's natural form, very much a core feature of human thought.) Perhaps one might think about the creation of a "right brain" to go along with the "left brain" language models we have now? (Please forgive the over-simplification of human neuroscience.) Thanks again! All the best to you sincerely!
@AYUSHSINGH-db6ev Месяц назад
Hi Sir! Really love your videos! How can we access your presentation slides?
@mshonle Месяц назад
If one rotation is good, how about going into three dimensional rotations and using quaternions? Is there any work using that?
@paratracker Месяц назад ⁺¹
Maybe it's obvious to YOU that the solution is that complex exponential, but I wish you hadn't assumed that WE would all see that as self-evident as you do.
@code4AI Месяц назад ⁺⁹
I see what you mean. You know, I spend some days to find simple explanations for the not so self explanatory RoPE algo, especially I will build on this in my second video, and then we examine more complex, more recent ideas about RoPE. I decided for a way, that will enable my audience to understand the main ideas and methods, and go from there. I recorded 90 min for the second part, and currently I cut it to max 60 min, striking a balance of providing insights for all my viewers. I'll try harder ....
@hangjianyu 10 дней назад
there is a mistake, smaller dimensions change more quickly,and large dimensions change more slowly

Следующие

Автовоспроизведение

New Discovery: LLMs have a Performance Phase

New Discovery: LLMs have a Performance Phase

Rotary Positional Embeddings: Combining Absolute and Relative

Rotary Positional Embeddings: Combining Absolute and Relative

GraphRAG: LLM-Derived Knowledge Graphs for RAG

GraphRAG: LLM-Derived Knowledge Graphs for RAG

Usher's Greatest Hits In The R&B Hall Of Fame Albums! | BET Awards '24

Usher's Greatest Hits In The R&B Hall Of Fame Albums! | BET Awards '24

Watch live: SpaceX Falcon 9 rocket launches from California on U.S. spy satellite agency mission

Watch live: SpaceX Falcon 9 rocket launches from California on U.S. spy satellite agency mission

KISS OF LIFE (키스오브라이프) 'Sticky' Official Music Video

KISS OF LIFE (키스오브라이프) 'Sticky' Official Music Video

JT & Jeezy - OKAY (Official Video)

JT & Jeezy - OKAY (Official Video)

The math behind Attention: Keys, Queries, and Values matrices

The math behind Attention: Keys, Queries, and Values matrices

Graph Neural Networks for Link Prediction with Subgraph Sketching

Graph Neural Networks for Link Prediction with Subgraph Sketching

RoPE (Rotary positional embeddings) explained: The positional workhorse of modern LLMs

RoPE (Rotary positional embeddings) explained: The positional workhorse of modern LLMs

xLSTM: Extended Long Short-Term Memory

xLSTM: Extended Long Short-Term Memory

RAG explained step-by-step up to GROKKED RAG sys

RAG explained step-by-step up to GROKKED RAG sys

Rotary Positional Embeddings

Rotary Positional Embeddings

NEW TextGrad by Stanford: Better than DSPy

NEW TextGrad by Stanford: Better than DSPy

The Attention Mechanism in Large Language Models

The Attention Mechanism in Large Language Models

But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning

But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning

81000 руб. за ремонт воздуха в игровом ноутбуке Legion7 и как испарить деньги испарительной камерой

81000 руб. за ремонт воздуха в игровом ноутбуке Legion7 и как испарить деньги испарительной камерой

Не покупайте игровой ноутбук❤ #юмор #сериал #прикол #automobile #мем #duet #memes #смех #дуэт

Не покупайте игровой ноутбук❤ #юмор #сериал #прикол #automobile #мем #duet #memes #смех #дуэт

⚠️НЕ СОВЕРШАЙ ЭТИ ОШИБКИ ПРИ ВЫБОРЕ ПРОЦЕССОРА⚠️

⚠️НЕ СОВЕРШАЙ ЭТИ ОШИБКИ ПРИ ВЫБОРЕ ПРОЦЕССОРА⚠️

Не покупайте игровой ноутбук❤ #юмор #сериал #прикол #automobile #мем #duet #memes #смех #дуэт

Не покупайте игровой ноутбук❤ #юмор #сериал #прикол #automobile #мем #duet #memes #смех #дуэт

Комп работает как часы#юмор #коментарі

Комп работает как часы#юмор #коментарі

ТОПОВЫЙ КОМПАКТ ЗА 36 000 Р 🔥 СМАРТФОН GOOGLE PIXEL 8A ИЛИ ЛУЧШЕ POCO F6 ИЛИ VIVO X100S

ТОПОВЫЙ КОМПАКТ ЗА 36 000 Р 🔥 СМАРТФОН GOOGLE PIXEL 8A ИЛИ ЛУЧШЕ POCO F6 ИЛИ VIVO X100S

ВЕЛИКАЯ ЭВОЛЮЦИЯ ЗВУКА: от 8-bit до Hi-Res | РАЗБОР

ВЕЛИКАЯ ЭВОЛЮЦИЯ ЗВУКА: от 8-bit до Hi-Res | РАЗБОР

Правильный Li-Ion Аккумулятор Своими Руками (Aka Kasyan)

Правильный Li–Ion Аккумулятор Своими Руками (Aka Kasyan)