NVIDIA's Nemotron-4's is totally insane for synthetic data generation

Inside the LLM: Visualizing the Embeddings Layer of Mistral-7B and Gemma-2B

`const` was a mistake

Deadpool & Wolverine & The Bachelorette

Billie Eilish vs. Finneas | Hot Ones Versus

Kendrick Lamar - Not Like Us

Are Claude 3.5 Sonnet, Llama-3 and Gemini choosing speed over quality?

Chris Hay

Просмотров 847

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 10 июл 2024
in this video chris looks at how model providers are trending towards using grouped query attention vs traditional multi-headed attention in transformer models and how this is impacting output in areas such as summarization. in this video chris shows that you get better coherent output from models such as llama-2 or claude 3-opus over new models such as llama-3 or gemini or gemma. in the end, in certain scenarios such as summarization or generative content, gpt-4o still beats sonnet.
repo
github.com/chrishayuk/mha_gqa...
Наука

Комментарии • 14

@makepeace88 8 дней назад ⁺¹
I just attended detailed anatomy of LLM session.. and it’s just wow! Nobody’s telling these details. Thanks very much Chris ❤
@chrishayuk 8 дней назад
Glad it was useful, I skipped a lot of details, as I wanted to keep the focus on MHA vs GQA. I will probs do some other videos on some of the other details
@trsd8640 9 дней назад ⁺¹
Great video! I don’t understand it fully, had to watch it again, but I‘m getting a idea of what is happening! Thank you!
@chrishayuk 9 дней назад ⁺²
it was quite a tough one to record, as i'm trying to avoid explaining the entire transformers architecture and attention fully (i'll do that in another video), but do enough to just show how this architectural change is affecting models output. it was a weird balance and apologies that i never explained it enough
@danielhenderson7050 9 дней назад ⁺²
This was very interesting
@chrishayuk 9 дней назад
Glad you enjoyed, definitely a fun rabbit hole
@everyhandletaken 9 дней назад ⁺¹
Interesting!
Claude 3.5 Sonnet is definitely great for code, much better than cgpt 4-o & has really helped me solve things that are well beyond my brain capacity in the last few days.
@chrishayuk 9 дней назад
totally agree, much better for code than gpt-4o
@Leo-ph7ow 9 дней назад ⁺²
Excelent content! Thanks!
@chrishayuk 9 дней назад
Glad you liked it!
@seanknowles9985 9 дней назад
Intel agencies are having their fill first. Its obviously being slowed down so three letter agencies can get ahead of this.
@chrishayuk 9 дней назад
lol, i'm sure 3 letter agencies are having their say but i suspect it's not on MHA vs GQA but would love to hear that conversation if they were

Следующие

Автовоспроизведение

NVIDIA's Nemotron-4's is totally insane for synthetic data generation

NVIDIA's Nemotron-4's is totally insane for synthetic data generation

Inside the LLM: Visualizing the Embeddings Layer of Mistral-7B and Gemma-2B

Inside the LLM: Visualizing the Embeddings Layer of Mistral-7B and Gemma-2B

`const` was a mistake

`const` was a mistake

Deadpool & Wolverine & The Bachelorette

Deadpool & Wolverine & The Bachelorette

Billie Eilish vs. Finneas | Hot Ones Versus

Billie Eilish vs. Finneas | Hot Ones Versus

Kendrick Lamar - Not Like Us

Kendrick Lamar - Not Like Us

Reid Wilson Receives The GOLDEN BUZZER For "You Don't Own Me" | Auditions | AGT 2024

Reid Wilson Receives The GOLDEN BUZZER For "You Don't Own Me" | Auditions | AGT 2024

15 INSANE Use Cases for NEW Claude Sonnet 3.5! (Outperforms GPT-4o)

15 INSANE Use Cases for NEW Claude Sonnet 3.5! (Outperforms GPT-4o)

Unleashing the Power of Claude Artifacts with Svelte and Sonnet 3.5

Unleashing the Power of Claude Artifacts with Svelte and Sonnet 3.5

AI Shocks Again: DeepMind V2A, AI BRAIN, OpenAI Nuclear AI, GPT-5 & More (June Monthly News)

AI Shocks Again: DeepMind V2A, AI BRAIN, OpenAI Nuclear AI, GPT-5 & More (June Monthly News)

getting started with typespec

getting started with typespec

Creating ReAct AI Agents with Mistral-7B/Mixtral and Ollama using Recipes I Chris Hay

Creating ReAct AI Agents with Mistral-7B/Mixtral and Ollama using Recipes I Chris Hay

The future of AI agents is WebAssembly (get started now)

The future of AI agents is WebAssembly (get started now)

Has Generative AI Already Peaked? - Computerphile

Has Generative AI Already Peaked? - Computerphile

i really want to say goodbye to copilot...

i really want to say goodbye to copilot...

How the Gemma/Gemini Tokenizer Works - Gemma/Gemini vs GPT-4 vs Mistral

How the Gemma/Gemini Tokenizer Works - Gemma/Gemini vs GPT-4 vs Mistral

1$ vs 500$ ВИРТУАЛЬНАЯ РЕАЛЬНОСТЬ !

1$ vs 500$ ВИРТУАЛЬНАЯ РЕАЛЬНОСТЬ !

899$ vs 360$ which one will you choose ? #iphone #poco

899$ vs 360$ which one will you choose ? #iphone #poco

Acer Predator Тараканьи Бега!

Acer Predator Тараканьи Бега!

В России ускорили интернет в 1000 раз

В России ускорили интернет в 1000 раз

Как работает экосистема Apple?

Как работает экосистема Apple?

Как обнаружить терминал StarLink?

Как обнаружить терминал StarLink?

ПОКУПКА ТЕЛЕФОНА С АВИТО?🤭

ПОКУПКА ТЕЛЕФОНА С АВИТО?🤭

Worlds smallest 4K headset 😎 Visor.com #tech #vr #technology #virtualreality #insideout2

Worlds smallest 4K headset 😎 Visor.com #tech #vr #technology #virtualreality #insideout2