AI Basics: Accuracy, Epochs, Learning Rate, Batch Size and Loss

How to train a model to generate image embeddings from scratch

ML Was Hard Until I Learned These 5 Secrets!

Roman Reigns, Jimmy & Jey Uso Help Bloodline Lose Tag Titles! | WWE SmackDown 10/25/24 | WWE on USA

The True Scale Of Modern Nuclear Weapons

Dragon Age: The Veilguard | Official Launch Trailer

The Wrong Batch Size Will Ruin Your Model

Underfitted

Просмотров 18 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 27 окт 2024

Комментарии • 32

@ErlendDavidson 2 года назад ⁺²³
If you scale the batch size by the learning rate (i.e. lr=(batch_size/32.)*0.01) then the stochastic gradient descent looks sort of okay here.
@underfitted 2 года назад
Interesting :)
@jasdeepsinghgrover2470 Год назад ⁺²
I completely agree ... Because the number of updates happening depend on batch size and even the size of the update. So if the learning rate is scaled according to batch size linearly the model can perform very well even with much smaller batches.
@OliverHennhoefer 2 года назад ⁺⁴
Really like the videos. However, I want to warn against the general statement that a batch size of one is not recommended. It really depends on the problem/data. So don't simply dismiss stochastic gradient descent, try it!
@underfitted 2 года назад ⁺¹
I think that’s fair. I’ve never used it in any of the problems I’ve worked on, but you are right.
@edmundfreeman7203 Год назад ⁺²
This is the kind of thing that I hate about deep learning. A single parameter in the optimization method can completely change the results. Batches should be small but not too small. How small? That's for heuristics but will change on different data sets.
@Metryk 9 месяцев назад ⁺²
Hi! Maybe you can help me with this one: if I want to test an already pre-trained image classifier, how do I proceed regarding the amount of images used? The set containing test images has 100k images, I guess it wouldn't make any sense to load them all at once, so how do I proceed? Thanks!
@johnmoustakas8897 2 года назад ⁺²
Good work, hope your channel gets more attention
@underfitted 2 года назад
Thanks, John! It takes time and work but I’ll make it happen.
@ErlendDavidson 2 года назад ⁺⁵
What do you think of (artificially) adding noise to the learning rate. I feel like it used to be more popular to do that, but almost never see it these days.
@underfitted 2 года назад ⁺²
Yeah… never seen that honestly. I’ve used schedules to decrease the learning rate over time, but never read about adding noise to it.
@lakeguy65616 Год назад ⁺³
so, what is the optimal batch size?
@underfitted Год назад ⁺¹
It depends. Start with 32 and experiment from there.
@lakeguy65616 Год назад ⁺¹
@@underfitted Does the amount of main memory Ram or GPU ram make a difference? (great videos!)
@underfitted Год назад ⁺²
It does! Your batch has to fit in memory, or it won't work. When you are working with images, for example, you'll quickly find that your batch size can't be too large if you want to fit it in the GPU's memory.
@Agrover112 2 года назад ⁺²
Hey love this video! Was losing touch of the basics !
@underfitted 2 года назад ⁺¹
Glad it was helpful!
@Darkraak 12 дней назад
Great video man 👏
@axelanderson2030 Год назад
If you generate a dummy dataset and set a static learning rate, then smaller batch sizes work better? wtf?
@Levy957 2 года назад ⁺¹
Amazing!!
Did u know why the batch size os always 32, 64, 128?
@underfitted 2 года назад ⁺²
I read somewhere about the ability to fit batches in a GPU... can't remember where exactly. That being said, I've seen experiments that show that it really doesn't matter much (if at all.)
@MrAleksander59 Год назад ⁺¹
It's better for memory usage. GPU, CPU, hard drives, SSD and other in the current 2-bit logic uses memory blocks with sizes of power 2. 2^5 = 32, 2^6=64, 2^7=128 etc. You always want maximum usage of memory. For example you have array with floats, each float will take 32 bits. So, at least it divisible by 32.
@OmarBoukchana Год назад
i didnt see a helpful video like this one in the entire internet, thank you ♥
@underfitted Год назад
Glad it was helpful!
@muhammadtalmeez3276 Год назад
Your videos are amazing. Thank you so much for this great knowledge and beautiful videos.
@underfitted Год назад ⁺¹
Glad you like them!
@ziquaftynny9285 Год назад
I love your presentation style! Very energetic :)
@underfitted Год назад
Thanks
@akshay0072 5 месяцев назад ⁺¹
Good content. Try improving ur way of teaching. Learning should in relaxed tone
@underfitted 5 месяцев назад ⁺¹
Thanks! This was an old video. I’ve tried to improve in the latest few.
@michaelsprinzl9045 6 месяцев назад ⁺¹
A new cat video. Cute.
@sarahpeterson2702 Год назад
the question is whether if you use a batch and reach the global minimum is your model functionally equivalent to one that didn't batch? Are the weights identical... no they aren't . if your model is generative you don't have equivalence with batch/non batch.

Следующие

Автовоспроизведение

AI Basics: Accuracy, Epochs, Learning Rate, Batch Size and Loss

AI Basics: Accuracy, Epochs, Learning Rate, Batch Size and Loss

How to train a model to generate image embeddings from scratch

How to train a model to generate image embeddings from scratch

ML Was Hard Until I Learned These 5 Secrets!

ML Was Hard Until I Learned These 5 Secrets!

Roman Reigns, Jimmy & Jey Uso Help Bloodline Lose Tag Titles! | WWE SmackDown 10/25/24 | WWE on USA

Roman Reigns, Jimmy & Jey Uso Help Bloodline Lose Tag Titles! | WWE SmackDown 10/25/24 | WWE on USA

The True Scale Of Modern Nuclear Weapons

The True Scale Of Modern Nuclear Weapons

Dragon Age: The Veilguard | Official Launch Trailer

Dragon Age: The Veilguard | Official Launch Trailer

Bamboozled! 🎋 | Ep. 2 | Wild Life

Bamboozled! 🎋 | Ep. 2 | Wild Life

Epochs, Iterations and Batch Size | Deep Learning Basics

Epochs, Iterations and Batch Size | Deep Learning Basics

Epoch, Batch, Batch Size, & Iterations

Epoch, Batch, Batch Size, & Iterations

Early Stopping. The Most Popular Regularization Technique In Machine Learning.

Early Stopping. The Most Popular Regularization Technique In Machine Learning.

AI Just Solved a 53-Year-Old Problem! | AlphaTensor, Explained

AI Just Solved a 53-Year-Old Problem! | AlphaTensor, Explained

136 understanding deep learning parameters batch size

136 understanding deep learning parameters batch size

All Machine Learning algorithms explained in 17 min

All Machine Learning algorithms explained in 17 min

The Most Important Algorithm in Machine Learning

The Most Important Algorithm in Machine Learning

How To Self Study AI FAST

How To Self Study AI FAST

154 - Understanding the training and validation loss curves

154 - Understanding the training and validation loss curves

Ани Лорак круто перепела Уитни Хьюстон на МУЗЛОФТЕ😍

Ани Лорак круто перепела Уитни Хьюстон на МУЗЛОФТЕ😍

ПОЛИЦЕЙСКАЯ ТОЛКАЕТ ПОКУПАТЕЛЯ и КИНУЛА СВОЁ УДОСТОВЕРЕНИЕ! ПЫТАЮТСЯ ДОГОВОРИТЬСЯ? ОБВИНИЛИ В КРАЖЕ

ПОЛИЦЕЙСКАЯ ТОЛКАЕТ ПОКУПАТЕЛЯ и КИНУЛА СВОЁ УДОСТОВЕРЕНИЕ! ПЫТАЮТСЯ ДОГОВОРИТЬСЯ? ОБВИНИЛИ В КРАЖЕ

Мальчик с птенчиком. Эту сцену "Тупой и еще тупее" ты никогда не видел!

Мальчик с птенчиком. Эту сцену "Тупой и еще тупее" ты никогда не видел!

🧐ВОЗМОЖНО ЛИ, ПЕРЕКРАСИТЬ ТАНК💥 В РАЗНЫХ GTA? #gta #shorts

🧐ВОЗМОЖНО ЛИ, ПЕРЕКРАСИТЬ ТАНК💥 В РАЗНЫХ GTA? #gta #shorts

MAGIC TIME ⁠@Whoispelagheya

MAGIC TIME ⁠@Whoispelagheya

новое испытание

новое испытание

SOUTH KOREA AND NORTH KOREA HOLD THEIR BREATH 🇰🇷 🇰🇵 #countryhumans

SOUTH KOREA AND NORTH KOREA HOLD THEIR BREATH 🇰🇷 🇰🇵 #countryhumans

ITZY 예지한테 AI 메이크업하기💖 #shorts

ITZY 예지한테 AI 메이크업하기💖 #shorts