SuperHOT, 8k and 16k Local Token Context! How Does It Work? What We Believed About LLM’s Was Wrong.

QLoRA Is More Than Memory Optimization. Train Your Models With 10% of the Data for More Performance.

SAP TRM S4HANA Demo Class | Learn Treasury and Risk Management Online | Tech Concept Hub

I Broke Everything in This Entire City - Vending Machine Business Simulator

Jake Paul knocks out Mike Tyson

I FILLED MY BEST FRIEND'S ROOM WITH MILK! (CRAZIEST LANKYBOX VIDEO EVER!)

What Is Positional Encoding? How To Use Word and Sentence Embeddings with BERT and Instructor-XL!

AemonAlgiz

Просмотров 1,6 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 17 ноя 2024

Комментарии • 30

@smellslikeupdog80 Год назад ⁺⁷
Man I don't know anyone who explains things in a clear, technical, and straightforward way. This is great stuff.
@AemonAlgiz Год назад
Thank you! I try my best to make the content digestible
@johnholdsworth1878 Год назад ⁺³
These videos are amazing! You are an absolutely brilliant teacher. I have just made the viewing of this video mandatory for my entire team. The ability to explain complex topics in an understandable way is a sign of genius - thank you.
@AemonAlgiz Год назад ⁺¹
Thank you! Hopefully the training corpus one is as helpful
@AemonAlgiz Год назад ⁺¹
Edit: My father-in-law had a heart attack this morning. Fortunately he is going to be fine, but the universe is opposed to me being on schedule this week. My apologies all.
Hey all, the LoRA training video is almost done! I decided to add some extra material on how we to prepare and preprocess datasets, since the video didn’t feel super cohesive without it. I figured you all would prefer quality over rushing, so I’ll have it out by the morning.
@blondiedawn Год назад ⁺³
This one was easy to follow and understand. Thank you for the great tutorial!
@AemonAlgiz Год назад
Thank you!
@LoneRanger.801 Год назад ⁺²
Wow! Love your detailed explanations and code examples. Subscribed ❤
@AemonAlgiz Год назад ⁺¹
Thank you, I’m glad it was helpful!
@kaymcneely7635 Год назад ⁺⁴
Yet another easy to understand video. Thank you for the time and effort you put into these!
@AemonAlgiz Год назад
Thanks, it means a lot!
@PMX Год назад ⁺²
Japanese is read left to right, unless you are talking about vertical text (in that case is top to bottom, right to left), but most Japanese text online, which is what an llm would be trained on, would be regular left to right.
@AemonAlgiz Год назад
I was thinking about tategaki, I should have been more specific. Thanks for the correction!
@Moonz97 Год назад ⁺²
Great video!
Are sentence embeddings simply constructed from aggregating word embeddings and applying some operation, such as mean pooling or max pooling?
@AemonAlgiz Год назад
Indeed, typically you can capture the output from attention and use that as an embedding
@TariqHabibAfridi Год назад ⁺²
I was thinking to Learn about LoRA for the past several days but you just rocked it. it is so simple. Thanks alot and i really appreciate your videos.
One more thing i am currently doing my PhD and the topic of my PhD is Vision-language Pre-trained models. However my main problem until now is that i have small resources at most i can get two 3090 GPUs. I would appreciate any useful suggestion and help regarding Pre-training The large Vision-language Pre-trained models with such resources. I would also like to have any colloborations in this regard. Thank you so much again
@AemonAlgiz Год назад ⁺¹
ViT’s use a ton of resources for training, though you do have some options.
1. If you have some funds, you could use Lambda Labs which lets you rent an A100 for as little as $1.10/hr. I use them for work and they have been great.
2. You could always try 8/4-bit quantizing them, GPTQ for LLaMA has a great example of how to implement the algorithm.
1. There is the Hyena paper which shows some ways to diagonalize the attention layer, but there is no algorithm for it yet.
My discord is Aemon Algiz#0033 if you’d like to chat.
@TariqHabibAfridi Год назад ⁺¹
@@AemonAlgiz Thank you i will explore each of these options. especially 4 bit quantization GPTQ for LLaMA.
@wilfredomartel7781 Год назад ⁺¹
Such a good explanation! by the way, is instructor-xl multilingual?
@AemonAlgiz Год назад ⁺¹
You know, I’m honestly not sure, though I suspect it’s primarily English. There are multilingual embedding models, though. I’ll check when I get home!
@wilfredomartel7781 Год назад
@@AemonAlgiz thank you
@AemonAlgiz Год назад
So, it looks like it is English, though this one is multi-lingual:
huggingface.co/sentence-transformers/stsb-xlm-r-multilingual
I would personally test instructor-xl and see if works for multi-lingual
@Heccintech Год назад ⁺²
this is exactly where my research is lacking.
@AemonAlgiz Год назад ⁺¹
I’m glad it was helpful!
@lizesu8049 10 месяцев назад
Hey thanks for the great video but the IDE font is a little bit small.
@lizesu8049 10 месяцев назад
the code part is fine but the project file names and debugger infomation are hard to read
@aamir122a Год назад ⁺¹
hey mate your audio and video are not in sync, mabe be out by a second ,
@AemonAlgiz Год назад
Huh, I wonder why that’s happened. It seemed fine before I uploaded it.
@RandyHawkinsMD Год назад ⁺¹
I’m glad I discovered this valuable resource. Your simple, straightforward explanations are very helpful. One suggestion: The thin white scribe pen you use is difficult to follow at times, so if you could supplement it with a surrounding halo or some other highlight those of us with slight visual impairment would appreciate it. 🫡
@AemonAlgiz Год назад
I can do that! Do you have an example of what would be best for you?

Следующие

Автовоспроизведение

SuperHOT, 8k and 16k Local Token Context! How Does It Work? What We Believed About LLM’s Was Wrong.

SuperHOT, 8k and 16k Local Token Context! How Does It Work? What We Believed About LLM’s Was Wrong.

QLoRA Is More Than Memory Optimization. Train Your Models With 10% of the Data for More Performance.

QLoRA Is More Than Memory Optimization. Train Your Models With 10% of the Data for More Performance.

SAP TRM S4HANA Demo Class | Learn Treasury and Risk Management Online | Tech Concept Hub

SAP TRM S4HANA Demo Class | Learn Treasury and Risk Management Online | Tech Concept Hub

I Broke Everything in This Entire City - Vending Machine Business Simulator

I Broke Everything in This Entire City - Vending Machine Business Simulator

Jake Paul knocks out Mike Tyson

Jake Paul knocks out Mike Tyson

I FILLED MY BEST FRIEND'S ROOM WITH MILK! (CRAZIEST LANKYBOX VIDEO EVER!)

I FILLED MY BEST FRIEND'S ROOM WITH MILK! (CRAZIEST LANKYBOX VIDEO EVER!)

Lasers vs Lightning- Which Is More Powerful?

Lasers vs Lightning- Which Is More Powerful?

A Complete Overview of Word Embeddings

A Complete Overview of Word Embeddings

Reinforcement Learning From Human Feedback, RLHF. Overview of the Process. Strengths and Weaknesses.

Reinforcement Learning From Human Feedback, RLHF. Overview of the Process. Strengths and Weaknesses.

Llama 1-bit quantization - why NVIDIA should be scared

Llama 1-bit quantization - why NVIDIA should be scared

Intro to Sentence Embeddings with Transformers

Intro to Sentence Embeddings with Transformers

Ilya Sutskever (OpenAI Chief Scientist) - Building AGI, Alignment, Spies, Microsoft, & Enlightenment

Ilya Sutskever (OpenAI Chief Scientist) - Building AGI, Alignment, Spies, Microsoft, & Enlightenment

QLoRA PEFT Walkthrough! Hyperparameters Explained, Dataset Requirements, and Comparing Repo's.

QLoRA PEFT Walkthrough! Hyperparameters Explained, Dataset Requirements, and Comparing Repo's.

Transformers (how LLMs work) explained visually | DL5

Transformers (how LLMs work) explained visually | DL5

How To Create Datasets for Finetuning From Multiple Sources! Improving Finetunes With Embeddings.

How To Create Datasets for Finetuning From Multiple Sources! Improving Finetunes With Embeddings.

BEST OPEN Alternative to OPENAI's EMBEDDINGs for Retrieval QA: LangChain

BEST OPEN Alternative to OPENAI's EMBEDDINGs for Retrieval QA: LangChain

Сегодня, 16 ноября, ушла Светлана Светличная

Сегодня, 16 ноября, ушла Светлана Светличная

skibidi toilet multiverse 044

skibidi toilet multiverse 044

АНЕКДОТ ОТ КАТИ МОРГУНОВОЙ #мнесмешно #моргунова #прикол #воронин #бабьяк #mediumquality #юмор

АНЕКДОТ ОТ КАТИ МОРГУНОВОЙ #мнесмешно #моргунова #прикол #воронин #бабьяк #mediumquality #юмор

一碗水真的能端平吗？不能也得能！#四小只吖 #日常 #搞笑 #搞笑家庭 #姐弟 #家庭生活

一碗水真的能端平吗？不能也得能！#四小只吖 #日常 #搞笑 #搞笑家庭 #姐弟 #家庭生活

吹泡泡吗？给你木雕个究极大喷菇

吹泡泡吗？给你木雕个究极大喷菇

ВРУТ ПО ТЕЛЕФОНУ, НАВЯЗЫВАЮТ ДОПЫ. ВЫРЫВАЕМ ЛАДА НИВА У ДИЛЕРА БЕЗ ДОПОВ

ВРУТ ПО ТЕЛЕФОНУ, НАВЯЗЫВАЮТ ДОПЫ. ВЫРЫВАЕМ ЛАДА НИВА У ДИЛЕРА БЕЗ ДОПОВ

Как снимают красивые свадьбы #wedding #свадьба #невеста #bride #свадебноеплатье #армянскаясвадьба

Как снимают красивые свадьбы #wedding #свадьба #невеста #bride #свадебноеплатье #армянскаясвадьба

О первых вайнах и реакции коллег #галич #идагалич #меньшова #интервью

О первых вайнах и реакции коллег #галич #идагалич #меньшова #интервью