Chasing Silicon: The Race for GPUs

Deep-dive into the AI Hardware of ChatGPT

Can Intel survive the valley of death?

Marvel Rivals | Winter Celebration, Joyful Jubilation

Yelling at my GF in front of FaZe Rug and Brawadis..

Jason Segel Breaks Down His Most Iconic Characters

AI Hardware, Explained.

a16z

Просмотров 30 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 30 дек 2024

Комментарии • 44

@a16z Год назад ⁺⁴
For a sneak peek into part 2 and 3, they're already live on our podcast feed! Animated explainers coming soon.
a16z.simplecast.com/
@cmichael981 Год назад
doesn't look like part 2/3 are up on the podcast feed (anymore at least) - any chance those video explainers are coming out still?
@a16z Год назад ⁺⁶
Timestamps:
00:00 - AI terminology and technology
03:54 - Chips, semiconductors, servers, and compute
05:07 - CPUs vs GPUs
06:16 - Future architecture and performance
07:12 -The hardware ecosystem
09:20 - Software optimizations
11:45 -What do we expect for the future?
14:25 - Sneak peek into the series
@jack_fischer Год назад ⁺¹²
The music is very distracting. Please tone down in the future
@NarsingRaoschoolknot 9 месяцев назад ⁺¹
Well done, very clean and clear. Love your simplicity
@AlexHirschMusic 11 месяцев назад ⁺³
This is highly informative and easy to understand. As an idiot, I really appreciate that a lot.
@Inclinant 10 месяцев назад
In the usual case of floating-point numbers being represented at 32-bit, is this why quantization for LLM models can be so much smaller at around 4-bit for ExLlama and making it so much easier to fit models inside the lower amounts of VRAM that consumer GPUs have?
Incredible video, interviewer ask really though provoking and relevant questions while the interviewee is extremely knowledgeable as well. It's broken down so well too!
Also, extremely grateful to a16z for supporting the The Bloke's work in LLM quantization! High quality quantization and simplified instructions makes LLMs so much easier to use for the average joe.
Thanks for creating this video.
@msclrhd 7 месяцев назад
It's a trade-off between accuracy and space/performance (i.e. being able to fit the model on local hardware). A 1-bit number could represent (0, 1) or (0, 0.5) as it only has 2 values. With 2 bits you can store 4 values, so you could represent (0, 1, 2, 3), signed values (-2, -1, 0, 1), float between 0 and 1 (0, 0.25, 0.50, 0.75), etc. depending on the representation. The more bits you have the better the range (minimum, maximum) of values you can store, and the precision (gap or distance) between each value.
Ideally you want enough bits to keep the weights of the model as close to their trained values so you don't significantly alter the behaviour of the network. Generally a quantization of 6-8 offers comparable accuracy (perplexity score) with the original, and below that you get an exponential degredation in accuracy, with below 4-bits being far worse.
@lnebres Год назад ⁺¹
An excellent primer for beginners in the field.
@TINTUHD Год назад ⁺²
Great video. Tip of the computation innovation
@Matrix1Gamer 11 месяцев назад
Guido Appenzeller is speaking my language. the lithography of chips are shrinking while consuming lots of power. Parallel computing is definitely going to be widely adopted going forward. Risc-V might replace x86 architecture.
@lerwenliu9263 10 месяцев назад
Love this Channel! Could we also look at the hunger for energy consumption and the impact for climate change?
@dinoscheidt Год назад
1:24 Ehm… I would like to know, what camera and lens/focal length you use to match the boom arm and background bokeh so perfectly 🤐
@StephSmithio Год назад ⁺³
I use the Sony a7iv camera with a Sony FE 35mm F1.4 lens! I should note that good lighting and painting the background dark does wonders though too
@kymtoobe 6 месяцев назад ⁺¹
This is a good video.
@nvr1618 Год назад
Excellent video. Thank you and well done
@Doggieluv25 Год назад ⁺¹
Really helpful thank you!
@adithyan_ai Год назад
Incredibly useful!! Thanks.
@AnthatiKhasim-i1e 4 месяца назад
"To remain competitive, large companies must integrate AI into their supply chain management, optimizing logistics, reducing costs, and minimizing waste."
@stachowi Год назад
This was very good
@LeveragedFinance Год назад
Great job
@thirukaruna7469 Год назад
Good one, Thx.!
@IAMNOTRANA Год назад ⁺³
No wonder nvidia don't care about consumer GPU anymore.
@stachowi Год назад
Yup, cash grab
@SynthoidSounds Год назад
A slightly different way of looking at Moore's Law is not about being "dead", but rather becoming irrelevant. Quantum computing operates very differently than binary digital computation, it's irrelevant to compare these two separate domains in terms of "how many transistors" can fit into a 2D region of space, or a FOPS performance. Aside from extreme parallelism available in QC, the next stage from "here" is in optical computing, utilizing photons instead of electrons as the computational mechanism. Also, scalable analog computing ICs (for AI engines) are being developed (IBM for example) . . . Moore's Law isn't relevant in any of these.
@billp37abq 4 месяца назад
This video makes clear WHY DSP [digital signal processing] chips were implementing sum{a[i]*b[i]} in hardware!
@MegaVin99 Год назад ⁺¹
Thanks for video but 4 mins before getting to any details in a 15 min video?
@vai47 Год назад
Older Vox style animations FTW!
@chenellson489 Год назад
See you at NY Tech Week
@billp37abq 4 месяца назад
AI and cloud computing face power supply issue as cryptocurrencies?
"Cryptocurrency mining, mostly for Bitcoin, draws up to 2,600 megawatts
from the regional power grid-about the same as the city of Austin."
@shwiftymemelord261 5 месяцев назад
it would be so cool if this main speaker was a clone
@LeveragedFinance Год назад ⁺²
Huang's law
@sergiocayuqueov 7 дней назад
Interesting
@RambleStorm 3 месяца назад
Geforce 256 aka GeForce 1 wasn't even Nvidia's first gpu let alone the first ever PC gpu... 😅😂
@joshuatruong2001 Год назад ⁺¹
The Render network token solves this
@gracekim2863 Год назад
Back to School Giveaway
@antt8550 Год назад
The future
@billp37abq 4 месяца назад
AI power consumption has doomed it to failure before it has started?
ruclips.net/video/lRy5Sy9Elbw/видео.html
@1SlipperyPenguin 18 дней назад
Sounds like a bad nightclub , stop the music
@mr.wrongthink.1325 3 месяца назад
The music is unnecessary and actually annoying.

Следующие

Автовоспроизведение

Chasing Silicon: The Race for GPUs

Chasing Silicon: The Race for GPUs

Deep-dive into the AI Hardware of ChatGPT

Deep-dive into the AI Hardware of ChatGPT

Can Intel survive the valley of death?

Can Intel survive the valley of death?

Marvel Rivals | Winter Celebration, Joyful Jubilation

Marvel Rivals | Winter Celebration, Joyful Jubilation

Yelling at my GF in front of FaZe Rug and Brawadis..

Yelling at my GF in front of FaZe Rug and Brawadis..

Jason Segel Breaks Down His Most Iconic Characters

Jason Segel Breaks Down His Most Iconic Characters

Every Home Alone Is Worse Than The Last

Every Home Alone Is Worse Than The Last

How do Graphics Cards Work? Exploring GPU Architecture

How do Graphics Cards Work? Exploring GPU Architecture

What are AI Agents?

What are AI Agents?

AI’s Hardware Problem

AI’s Hardware Problem

How Nvidia Grew From Gaming To A.I. Giant, Now Powering ChatGPT

How Nvidia Grew From Gaming To A.I. Giant, Now Powering ChatGPT

The moment we stopped understanding AI [AlexNet]

The moment we stopped understanding AI [AlexNet]

The 8 AI Skills That Will Separate Winners From Losers in 2025

The 8 AI Skills That Will Separate Winners From Losers in 2025

The Coming AI Chip Boom

The Coming AI Chip Boom

What runs ChatGPT? Inside Microsoft's AI supercomputer | Featuring Mark Russinovich

What runs ChatGPT? Inside Microsoft's AI supercomputer | Featuring Mark Russinovich

Transformers (how LLMs work) explained visually | DL5

Transformers (how LLMs work) explained visually | DL5

▼ ИЩУ АНИМЕ ДЕВУШКУ 🎀

▼ ИЩУ АНИМЕ ДЕВУШКУ 🎀

За День до Нового Года (смешное видео, юмор, приколы, поржать)

За День до Нового Года (смешное видео, юмор, приколы, поржать)

БУНКЕР в реальной жизни !**Егорик, Даник, Чернец, Монтажник, Екатзе, Виолетта**

БУНКЕР в реальной жизни !**Егорик, Даник, Чернец, Монтажник, Екатзе, Виолетта**

ВЫХОДНЫЕ за 100$ VS 1000$ ЧЕЛЛЕНДЖ!

ВЫХОДНЫЕ за 100$ VS 1000$ ЧЕЛЛЕНДЖ!

Спасибо, папа! КТО РАЗБИЛ МОЮ ТЕСЛУ? Расследование! Угрозы и Шантаж иду до конца.

Спасибо, папа! КТО РАЗБИЛ МОЮ ТЕСЛУ? Расследование! Угрозы и Шантаж иду до конца.

Переехал в аниме фул (нефул)

Переехал в аниме фул (нефул)

ЭКСТРЕМАЛЬНОЕ ОГРАБЛЕНИЕ ФРОСИ! ИВАН ЗОЛО ВСТРЕЧАЕТСЯ с ФРОСЕЙ?!

ЭКСТРЕМАЛЬНОЕ ОГРАБЛЕНИЕ ФРОСИ! ИВАН ЗОЛО ВСТРЕЧАЕТСЯ с ФРОСЕЙ?!