HuggingFace Fundamentals with LLM's such as TInyLlama and Mistral 7B

What is Retrieval Augmented Generation (RAG) and JinaAI?

But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning

We Were Wrong About Gold's Origin

UFC 302 RECAP: Islam Makhachev scores late submission of Dustin Poirier to retain title | CBS Sports

BRUTAL KO | Zhilei Zhang vs. Deontay Wilder Highlights (Queensberry vs. Matchroom - Riyadh Season)

Inside the LLM: Visualizing the Embeddings Layer of Mistral-7B and Gemma-2B

Chris Hay

Просмотров 5 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 9 мар 2024
We look deep into the AI and look at how the embeddings layer of a Large Language Model such as Mistral-7B and Gemma-2B actually works.
You will learn how tokens and embeddings work and even extract out and load the embeddings layer from Gemma and Mistral into your own simple model, which we will use to visualize the model
You will see how an AI clusters terms together and how it can cluster similar words, build connections which cover not just similar words but also grouping of concepts such as colors, hotel chains, programming terms.
If you really want to understand how an LLM's works or even build your own LLM then starting with the first layer of a Generative AI model is the best place to start.
Github
-----------
github.com/chrishayuk/embeddings
Наука

Комментарии • 30

@chrishayuk 2 месяца назад ⁺²
this is the github repo: github.com/chrishayuk/embeddings
@scitechtalktv9742 2 месяца назад ⁺³
Fantastic video !
I am wondering: I think it would also be very interesting to also be able have a visualization of not only the static embeddings you already did, but also a visualization of the so-called contextualized embeddings in a later layer of the model! These are the embeddings that are exposed to the attention mechanism. That why they are also called dynamic embeddings.
It adds another layer of abstraction, but are better embeddings because they are able to distinguish between homonyms: words that are the same but have completely other meanings if used in another context. A good example is the word “bank”, that has several different meanings when used in another context (for example financial institution or river bank and several other meanings! ). As a consequence the word “bank” will be represented by several different vectors in embedding space, depending on the context it is used in!
This technique is called Word Sense Disambiguation (WSD).
Would it be possible to visualize that too? I am curious….
@chrishayuk 2 месяца назад ⁺¹
yep, you got what i'm doing... i'm literally walking the stack
@chrishayuk 2 месяца назад ⁺¹
so those videos will be coming
@scitechtalktv9742 2 месяца назад ⁺¹
@@chrishayukFantastic ! Those embeddings are crucially important for the workings of Large Language Models !
@sumandawnmobile 2 месяца назад ⁺¹
Its an great video to understand the internals via the visualization. Thanks Chris.
@rajneesh31 2 дня назад
Damn, thank you RUclips for recommending this channel. @chrishayuk is a gun. Thanks Chris
@NERDDISCO 2 месяца назад ⁺³
This came to the absolute right time! Thank you very much! I was just trying to understand this. Now I know how it works ❤
@chrishayuk 2 месяца назад ⁺¹
Glad it was helpful!
@johntdavies 2 месяца назад ⁺²
Great insight, thanks for posting this. It would be interesting to show how a fine-tuned model differs in similarities and "vocabulary". I'm also curious on the effects of quantisation, i.e. Q4, Q6, Q8, fp16 etc. on the internal "workings" of the LLM. Thanks again.
@chrishayuk 2 месяца назад ⁺¹
It’s almost like you’re reading my roadmap
@khalilbenzineb 2 месяца назад ⁺²
I was playing a bit with finetuning to force an output schema for some 7B Models, but lately I discovered schema grammar, which is a way to dynamically play with the EOS tokens, by limiting them to a specific set of tokens, to generate the output you want, This is very stable and way efficient for many cases that we may think it requires finetuning, For me it felt like a new dimension to get the model intentions inline, I loved the unique and efficient way you create your videos, So I wanted to ask you if possible to create a video for us about this, I feel it's very important
@chrishayuk 2 месяца назад ⁺²
that's a good shout
@khalilbenzineb 2 месяца назад
Thx@@chrishayuk
@andypai 2 месяца назад ⁺¹
Thank you! Great video!
@chrishayuk 15 дней назад
thank you, glad it was useful
@kenchang3456 2 месяца назад ⁺¹
Thanks the visualization really helped me.
@chrishayuk 2 месяца назад ⁺¹
so glad, seeing it at a lower level really demystifies what's going on
@Memes_uploader 2 месяца назад ⁺¹
Thank you so much! Thank you youtube algorithm for showing such a great video!
@chrishayuk 2 месяца назад
Glad you enjoyed it!
@lfzuniga31 2 месяца назад ⁺¹
based
@enlightenment5d 2 месяца назад ⁺¹
Good! Where can I find your programs?
@chrishayuk 15 дней назад
in my github repo github.com/chrishayuk
@gregherringer7700 2 месяца назад ⁺¹
This helps thanks!
@chrishayuk 2 месяца назад
Glad it helped! :)

Следующие

Автовоспроизведение

HuggingFace Fundamentals with LLM's such as TInyLlama and Mistral 7B

HuggingFace Fundamentals with LLM's such as TInyLlama and Mistral 7B

What is Retrieval Augmented Generation (RAG) and JinaAI?

What is Retrieval Augmented Generation (RAG) and JinaAI?

But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning

But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning

We Were Wrong About Gold's Origin

We Were Wrong About Gold's Origin

UFC 302 RECAP: Islam Makhachev scores late submission of Dustin Poirier to retain title | CBS Sports

UFC 302 RECAP: Islam Makhachev scores late submission of Dustin Poirier to retain title | CBS Sports

BRUTAL KO | Zhilei Zhang vs. Deontay Wilder Highlights (Queensberry vs. Matchroom - Riyadh Season)

BRUTAL KO | Zhilei Zhang vs. Deontay Wilder Highlights (Queensberry vs. Matchroom - Riyadh Season)

Can Bryson DeChambeau Beat Me With Kids’ Clubs?

Can Bryson DeChambeau Beat Me With Kids’ Clubs?

Ollama 0.1.26 Makes Embedding 100x Better

Ollama 0.1.26 Makes Embedding 100x Better

Orignal transformer paper "Attention is all you need" introduced by a layman | Shawn's ML Notes

Orignal transformer paper "Attention is all you need" introduced by a layman | Shawn's ML Notes

Merge LLMs to Make Best Performing AI Model

Merge LLMs to Make Best Performing AI Model

Mistral 7B Dolphin Uncensored - Is This The New SMALL KING? 👑

Mistral 7B Dolphin Uncensored - Is This The New SMALL KING? 👑

The Best Tiny LLMs

The Best Tiny LLMs

The Most Important Algorithm in Machine Learning

The Most Important Algorithm in Machine Learning

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Generative AI in a Nutshell - how to survive and thrive in the age of AI

MAMBA from Scratch: Neural Nets Better and Faster than Transformers

MAMBA from Scratch: Neural Nets Better and Faster than Transformers

World’s Fastest Talking AI: Deepgram + Groq

World’s Fastest Talking AI: Deepgram + Groq

🌑 Невероятный кубический дециметр на 3D принтере Creality K1 Max #3dprinting #Shorts Игорь Белецкий

🌑 Невероятный кубический дециметр на 3D принтере Creality K1 Max #3dprinting #Shorts Игорь Белецкий

ЧАСТЬ 5 СОБРАЛ ИГРОВОЙ ПК ЗА 15000 РУБЛЕЙ #сборкапк #пк #собратькомпьютер

ЧАСТЬ 5 СОБРАЛ ИГРОВОЙ ПК ЗА 15000 РУБЛЕЙ #сборкапк #пк #собратькомпьютер

⚡️⚡️⚡️ MITSUBISHI 🔴 MD165144 or (MD175515)❗️❗️❗️ #automobile #restoreECU

⚡️⚡️⚡️ MITSUBISHI 🔴 MD165144 or (MD175515)❗️❗️❗️ #automobile #restoreECU

🔥Студийный микрофон за 1700 руб 😳 WB 176666441 ⬆️

🔥Студийный микрофон за 1700 руб 😳 WB 176666441 ⬆️

Полезные программы для Windows

Полезные программы для Windows

AMD врывается с двух ног на Computex 2024 (мама - это точно для учёбы!)

AMD врывается с двух ног на Computex 2024 (мама - это точно для учёбы!)

Полгода с iPhone 15 Pro Max от профессионала. Продал!

Полгода с iPhone 15 Pro Max от профессионала. Продал!

Power up all cell phones.

Power up all cell phones.