Developing an LLM: Building, Training, Finetuning

This MacBook was really really really really dirty #413

The moment we stopped understanding AI [AlexNet]

New York Jets vs. San Francisco 49ers Game Highlights | NFL 2024 Season

BAK Jay - Find Out (For Your Love) [Official Music Video]

THIS ISO DEMIGOD BUILD WILL BREAK NBA 2K25 | #1 FASTEST DRIBBLING AND SHOOTING BUILD ON NBA 2K25

Managing Sources of Randomness When Training Deep Neural Networks

Sebastian Raschka

Просмотров 2,3 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 11 сен 2024

Комментарии • 10

@bobrarity 4 месяца назад ⁺³
Wow, wasn't expecting a new video, luv the way e explaing things bro, keep it up
@SebastianRaschka 4 месяца назад
Glad this was useful!
@anne-marieroy8812 4 месяца назад ⁺¹
Excellent summary regarding randomness in NN training and generative AI. Very good illustration as well .
@SebastianRaschka 4 месяца назад
Thanks for the kind words!
@masonholcombe3327 4 месяца назад ⁺¹
Amazing new video recapping areas of randomness in deep neural nets. I do have a question regarding top-K sampling, why do we have to renormalize the top-k choices in the vocabulary? Can we not just randomly choose between the top-k choices?
@SebastianRaschka 4 месяца назад ⁺¹
Good question. This is more for interpretability purposes, but you are right, you can skip the normalization step.
@mahmoodmohajer1677 4 месяца назад ⁺¹
Amazing video.
only I'm wondering if we start by sampling different seeds to initialize weights and biases and feed forward them once to see which one results to less loss error. samples of seed can be a range of numbers e.g 1-100 or by themselves a set of random numbers. do you think is it useful in practice?
@SebastianRaschka 4 месяца назад
That's a good question. And yes, it can be useful. Actually, I use that for creating confidence intervals, for example. E.g., see section 4 here: github.com/rasbt/MachineLearning-QandAI-book/blob/main/supplementary/q25_confidence-intervals/1_four-methods.ipynb
@nguyenhuuuc2311 4 месяца назад ⁺¹
Should we tune the seed for better results?😂
@SebastianRaschka 4 месяца назад
Haha, believe it or not, but I've once reviewed a paper where the seed was a hyperparameter.

Следующие

Автовоспроизведение

Developing an LLM: Building, Training, Finetuning

Developing an LLM: Building, Training, Finetuning

This MacBook was really really really really dirty #413

This MacBook was really really really really dirty #413

The moment we stopped understanding AI [AlexNet]

The moment we stopped understanding AI [AlexNet]

New York Jets vs. San Francisco 49ers Game Highlights | NFL 2024 Season

New York Jets vs. San Francisco 49ers Game Highlights | NFL 2024 Season

BAK Jay - Find Out (For Your Love) [Official Music Video]

BAK Jay - Find Out (For Your Love) [Official Music Video]

THIS ISO DEMIGOD BUILD WILL BREAK NBA 2K25 | #1 FASTEST DRIBBLING AND SHOOTING BUILD ON NBA 2K25

THIS ISO DEMIGOD BUILD WILL BREAK NBA 2K25 | #1 FASTEST DRIBBLING AND SHOOTING BUILD ON NBA 2K25

Babyface Ray, Veeze - Wavy Navy University (Official Video)

Babyface Ray, Veeze - Wavy Navy University (Official Video)

Conditional Ordinal Regression for Neural Networks (CORN) With Examples in PyTorch

Conditional Ordinal Regression for Neural Networks (CORN) With Examples in PyTorch

Why Does Diffusion Work Better than Auto-Regression?

Why Does Diffusion Work Better than Auto-Regression?

Transformer Neural Networks Derived from Scratch

Transformer Neural Networks Derived from Scratch

AI, Machine Learning, Deep Learning and Generative AI Explained

AI, Machine Learning, Deep Learning and Generative AI Explained

But what is a neural network? | Chapter 1, Deep learning

But what is a neural network? | Chapter 1, Deep learning

Has Generative AI Already Peaked? - Computerphile

Has Generative AI Already Peaked? - Computerphile

Watching Neural Networks Learn

Watching Neural Networks Learn

The Story of Shor's Algorithm, Straight From the Source | Peter Shor

The Story of Shor's Algorithm, Straight From the Source | Peter Shor

Standoff 2 is a true horror! #standoff #horror #meme

Standoff 2 is a true horror! #standoff #horror #meme

MAGISCHES MAKE-UP: GEHEIMNISSE UND DUNKLE AUGENRINGE VERSTECKEN! 😂✨💄

MAGISCHES MAKE-UP: GEHEIMNISSE UND DUNKLE AUGENRINGE VERSTECKEN! 😂✨💄

Мой телеграмм: v1ann

Мой телеграмм: v1ann

КРАСНОГЛАЗОМУ ЗОМБИ ПОФИГ НА МОЮ ОБОРОНУ! / PVZ ODD MOD

КРАСНОГЛАЗОМУ ЗОМБИ ПОФИГ НА МОЮ ОБОРОНУ! / PVZ ODD MOD

Terrified Russian soldier hides in a dugout as Ukraine drones hunt him down

Terrified Russian soldier hides in a dugout as Ukraine drones hunt him down

Decompress small game, have time to play it!

Decompress small game, have time to play it!

Когда дочь блогер приехала в деревню #отдых #юмор #shorts

Когда дочь блогер приехала в деревню #отдых #юмор #shorts

Чайник из камня вручную. Мастеру 86 лет #ученые_против_мифов

Чайник из камня вручную. Мастеру 86 лет #ученые_против_мифов