Training the largest LLMs, Cerebras Wafer-Scale Architecture | Keynote 3 | Jean-Philippe Fricker

Cerebras AI Day - Hardware Keynote - Sean Lie

Cerebras Co-Founder Deconstructs Blackwell GPU Delay

Warfare | Official Trailer HD | A24

Searching the Jungle for WWII Battlefields (6 Days Fishing, Kayaking & Snorkeling in Palau)

AMAD WORLD CLASS! MAN CITY 1-2 MAN UTD GOLDBRIDGE MATCH REACTION

Cerebras @ Hot Chips 34 - Sean Lie's talk, "Cerebras Architecture Deep Dive"

Cerebras Systems

Просмотров 9 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 5 фев 2025
Neural networks have grown exponentially in recent years, from 2018 state-of-the-art neural networks of 100 million parameters to the famous GPT-3 with 175 billion parameters. However, this Grand ML Demand Challenge must be addressed by making substantial improvements - an order of magnitude or more - across a broad spectrum of multiple different components. Read Sean's blog for a written version of this Hot Chips 34 talk. It gives a deep dive into the Cerebras hardware to show you how our revolutionary approaches in core architecture, scale-up, and scale-out are designed to meet this ML demand: www.cerebras.n...
Learn more about Cerebras: cerebras.net
#deeplearning #ai #artificialintelligence #hotchips #hotchips34 #hc34

Комментарии • 22

@centuriomacro9787 2 года назад ⁺⁵
Very interesting presentation, thx
@xeusai 9 месяцев назад
I didn't catch that much from the routing protocol, and how actually die to communicate on wse2 , yiu guys have alot if things , congratulations 🎊 😊
@christopherkeates4147 5 месяцев назад
Incredible work. How do you scale a trained model down so that you can put it in something smaller and run inference real-time for control of a system?
@whyjay9959 Год назад ⁺⁴
Hi. There's something that a few people were wondering about: Why is the Wafer-Scale Engine square? Since it looks like there's room for ~28 more complete, attached tiles.
@CerebrasSystems Год назад ⁺⁷
It's a good question! The answer is rather prosaic, we're afraid. If the WSE weren't rectangular, the complexity of power delivery, I/O, mechanical integrity and cooling become much more difficult, to the point of impracticality.
Take a look at the virtual teardown on our website and you may get a feel for some of these challenges: www.cerebras.net/cs2virtualtour
The upshot is that a mere 850,00 cores will just have to suffice. ;)
@whyjay9959 Год назад ⁺²
@@CerebrasSystems I think I get the idea, thanks.
@AbeDillon 6 месяцев назад ⁺¹
@@CerebrasSystems Would it be possible to lop off some of those edge tiles to make mini engines?
@JoeLion55 10 месяцев назад ⁺¹
Re: the The die-to-die interface at about 15:15.
You mentioned you an upper metal layer to cross the scribe lines between the dies. What does the reticle look like for this. Is this a regular mask, but the alignment for the mask is just offset so it straddles the scribe lines for the rest of the wafer? Is this something TSMC does regularly for other products? Or is this a new process to have reticles on the same wafer that don’t align on top of each other?
@piscocuk2011 10 месяцев назад ⁺¹
00:04 Cerebras aims to revolutionize AI compute with a co-designed architecture
02:06 Architecture focused on neural networks
06:25 Memory bandwidth enables full performance in neural network computation.
08:36 Cerebras core hardware architecture flexibility
13:08 Cerebras chip has 84 die with 850,000 cores on a single 300mm wafer.
15:27 Homogeneous array of cores across the wafer for unprecedented fabric performance
19:21 Cerebras architecture utilizes dataflow mechanisms for weight computations
21:12 Single chip enables high-performance neural networks
25:02 Scalable clustering and wafer-scale chips enable large model access to everyone
Crafted by Merlin AI.
@CaseyKoh 5 месяцев назад
What is the yield of that wafer sir ? thank you
@xeusai 9 месяцев назад
Was wondering if memory x is actually an independent device outside of wse-2 , wafer ,? the fact it has better spars performance in hardware level , is very interesting?
@RalphDratman Год назад ⁺²
Is the CS-2 used only for training?
Will a time come when, for massively concurrent inference, this architecture will be applicable?
@CerebrasSystems Год назад ⁺²
Hi Ralph, good question. The vast bulk of our customers have used our systems for training LLMs or for HPC applications.
We have had a couple of projects using it for inference, like one with Lawrence Livermore National Laboratory where they offloaded an unwieldy inference step from many nodes of their Lassen supercomputer to one of our systems. You can read the case study here: www.cerebras.net/cerebras-customer-spotlight-overview/spotlight-lawrence-livermore-national-laboratory/
But in principle, our architecture should make at terrific concurrent inference platform because we can run many (hundreds or even thousands depending on the model) in parallel across our massive array of cores.
@808bigisland 2 года назад ⁺²
Aloha and thanks! Way to go! Just imagined what you will be doing in ten years from now! Do you have a public roadmap?
@CerebrasSystems 2 года назад
Thanks, 808 Big Island! Sadly, no public roadmap. You'll just have to keep watching!
@WoodyDataAI 5 месяцев назад
Super fast, lighting speed AI system. Great!
@billykotsos4642 2 года назад ⁺³
👀👀👀👀👀👀
@hg6996 3 месяца назад ⁺¹
If this wse is really that good why is still nobody talking about Cerebras AI while Nvidia is still printing money?
@jhockey11liu91 3 месяца назад
Because they are f-u-c-k up
@Marqui17 3 месяца назад
Because todays biggest models dont fit on one Cerebras chip
@hg6996 3 месяца назад
@@Marqui17 Hmm. So it's not possible to put together more of them in order to make the models fit on such a system?
@Marqui17 3 месяца назад
@@hg6996 I guess you should be able to interconnect them and split the model on them but then you are introducing the same complexities Nvidia has, taking away Cerebras' main advantage

Следующие

Автовоспроизведение

Training the largest LLMs, Cerebras Wafer-Scale Architecture | Keynote 3 | Jean-Philippe Fricker

Training the largest LLMs, Cerebras Wafer-Scale Architecture | Keynote 3 | Jean-Philippe Fricker

Cerebras AI Day - Hardware Keynote - Sean Lie

Cerebras AI Day - Hardware Keynote - Sean Lie

Cerebras Co-Founder Deconstructs Blackwell GPU Delay

Cerebras Co-Founder Deconstructs Blackwell GPU Delay

Warfare | Official Trailer HD | A24

Warfare | Official Trailer HD | A24

Searching the Jungle for WWII Battlefields (6 Days Fishing, Kayaking & Snorkeling in Palau)

Searching the Jungle for WWII Battlefields (6 Days Fishing, Kayaking & Snorkeling in Palau)

AMAD WORLD CLASS! MAN CITY 1-2 MAN UTD GOLDBRIDGE MATCH REACTION

AMAD WORLD CLASS! MAN CITY 1-2 MAN UTD GOLDBRIDGE MATCH REACTION

THE AMAZING DIGITAL CIRCUS - Ep 4: Fast Food Masquerade

THE AMAZING DIGITAL CIRCUS - Ep 4: Fast Food Masquerade

61DAC Keynote: Jim Keller, CEO, Tenstorrent

61DAC Keynote: Jim Keller, CEO, Tenstorrent

4,000,000,000,000 Transistors, One Giant Chip (Cerebras WSE-3)

4,000,000,000,000 Transistors, One Giant Chip (Cerebras WSE-3)

Transformer Neural Networks, ChatGPT's foundation, Clearly Explained!!!

Transformer Neural Networks, ChatGPT's foundation, Clearly Explained!!!

Cerebras AI Day - Full Keynote

Cerebras AI Day - Full Keynote

Building the World's Fastest AI Chip with Cerebras Systems' Co-founder and CTO Sean Lie

Building the World's Fastest AI Chip with Cerebras Systems' Co-founder and CTO Sean Lie

Transformers (how LLMs work) explained visually | DL5

Transformers (how LLMs work) explained visually | DL5

AI Hardware w/ Jim Keller

AI Hardware w/ Jim Keller

Geoffrey Hinton | On working with Ilya, choosing problems, and the power of intuition

Geoffrey Hinton | On working with Ilya, choosing problems, and the power of intuition

How Cerebras Systems Is Disrupting AI Inference

How Cerebras Systems Is Disrupting AI Inference

САМОЕ ПОЛЕЗНОЕ МАТЕМАТИЧЕСКОЕ УРАВНЕНИЕ #Shorts #Глент

САМОЕ ПОЛЕЗНОЕ МАТЕМАТИЧЕСКОЕ УРАВНЕНИЕ #Shorts #Глент

Лево или Право? АНИМАЦИЯ

Лево или Право? АНИМАЦИЯ

в какой цвет покраситься?! #шортс #тикток

в какой цвет покраситься?! #шортс #тикток

Ицык Цыпер - первое интервью с автором «Дымка» / вДудь

Ицык Цыпер – первое интервью с автором «Дымка» / вДудь

Купили редкий вездеход, но есть нюансы...

Купили редкий вездеход, но есть нюансы...

мифы о здоровье💊 в какой верили до этого видео? #медицина #здоровье #питание

мифы о здоровье💊 в какой верили до этого видео? #медицина #здоровье #питание

"Россияне, как страусы, головы в песок позасовывали" #война #фронт #Украина

"Россияне, как страусы, головы в песок позасовывали" #война #фронт #Украина

COMBAT CREW, Волга на V8 и приключения

COMBAT CREW, Волга на V8 и приключения