Lesson 8 - Practical Deep Learning for Coders 2022

How To Buy A Business That Dominates Competitors

The End of Finetuning - with Jeremy Howard of Fast.ai

Minecraft Hermitcraft :: I'm Not Good at 1.21

Explained: Boxing gender controversy as Angela Carini abandons Olympic bout after 46 seconds

Yunli Trailer - "Serendipity" | Honkai: Star Rail

Lesson 7: Practical Deep Learning for Coders 2022

Jeremy Howard

Просмотров 40 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 1 авг 2024
00:00 - Tweaking first and last layers
02:47 - What are the benefits of using larger models
05:58 - Understanding GPU memory usage
08:04 - What is GradientAccumulation?
20:52 - How to run all the models with specifications
22:55 - Ensembling
37:51 - Multi-target models
41:24 - What does `F.cross_entropy` do
45:43 - When do you use softmax and when not to?
46:15 - Cross_entropy loss
49:53 - How to calculate binary-cross-entropy
52:19 - Two versions of cross-entropy in pytorch
54:24 - How to create a learner for prediction two targets
1:02:00 - Collaborative filtering deep dive
1:08:55 - What are latent factors?
1:11:28 - Dot product model
1:18:37 - What is embedding
1:22:18 - How do you choose the number of latent factors
1:27:13 - How to build a collaborative filtering model from scratch
1:29:57 - How to understand the `forward` function
1:32:47 - Adding a bias term
1:34:29 - Model interpretation
1:39:06 - What is weight decay and How does it help
1:43:47 - What is regularization
Transcript thanks to nikem, fmussari, wyquek, bencoman, and gagan from forums.fast.ai
Timestamps based on notes by Daniel from forums.fast.ai

Комментарии • 14

@yoverale 4 месяца назад ⁺²
This course is truly priceless, much more deep and didactic than a lot of paid courses out there 🤩 thanks Jeremy
@sunderrajan6172 2 года назад ⁺²²
You are amazing as always! We all have such a gift and blessed to have you teaching these classes. I am truly amazed with your level of commitment to the society
@tumadrep00 Год назад ⁺⁶
Jeremy my man, you are truly one hell of a human being. I wish you the best
@maraoz Год назад ⁺⁹
I love how Jeremy explains techniques like gradient accumulation. He makes it seem so obvious and powerful that it's hard to forget them. Never again I'll think big models are out of scope for my experiments! :D
@merelogics Год назад ⁺⁸
"At this point if you've heard about embeddings before you might be thinking: that can't be it. And yeah, it's just as complex as the rectified linear unit which turned out to be: replace negatives with zeros. Embedding actually means: “look something up in an array”. So there's a lot of things that we use, as deep learning practitioners, to try to make you as intimidated as possible so that you don't wander into our territory and start winning our Kaggle competitions." 🤣
@JohnSmith-he5xg Год назад ⁺¹
Tremendous content!
@pranavdeshpande4942 Год назад ⁺³
I loved the collaborative filtering stuff and your explanation of embeddings!
@mukhtarbimurat5106 Год назад
Great, Thanks!
@vinodjoshi9127 Год назад
Jeremy - In the deep learning implementation of collaborative filtering the input is concatenated embedding of user and items, however my understanding is that the model is not learning the embedding matrix here, instead it's learning the weights (176 * 100) in the first layer and (100 * 1) in the second layer. Am I missing something? Appreciate your inputs
@toromanow 10 месяцев назад ⁺¹
Hello where can I find the notebook for this? I found Road to the top Part1, Part two but can't find Part 3 anywhere.
@tljstewart 10 месяцев назад
Accumulated gradients is a nice trick, however for sufficiently large datasets and run times your memory bandwidth latency will increase by the same multiple you accumulate
@matthewrice7590 Год назад
I understand the advantage of gradient accumulation in terms of being able to run your training on smaller GPUs by "imitating" a larger batch size when calculating the gradients, but wouldn't a major drawback of the gradient accumulation an increase in training time and ultimately in energy use? i.e. isn't your training going to run half as slow when accum is set to 2? And the more you increase the accum number the slower the training gets because your actual batch sizes are getting smaller and smaller?
@ChristopheMeyerPro Год назад
No, the total amount of work to be done is basically the same. There might be some more overhead from more frequent data transfers and also less parallelism optimization opportunity, but it's not like you're multiplying the work by the accum amount. You just batch the same total work by a smaller amount at a time.

Следующие

Автовоспроизведение

Lesson 8 - Practical Deep Learning for Coders 2022

Lesson 8 - Practical Deep Learning for Coders 2022

How To Buy A Business That Dominates Competitors

How To Buy A Business That Dominates Competitors

The End of Finetuning - with Jeremy Howard of Fast.ai

The End of Finetuning — with Jeremy Howard of Fast.ai

Minecraft Hermitcraft :: I'm Not Good at 1.21

Minecraft Hermitcraft :: I'm Not Good at 1.21

Explained: Boxing gender controversy as Angela Carini abandons Olympic bout after 46 seconds

Explained: Boxing gender controversy as Angela Carini abandons Olympic bout after 46 seconds

Yunli Trailer - "Serendipity" | Honkai: Star Rail

Yunli Trailer — "Serendipity" | Honkai: Star Rail

MY AUDITION FOR A NEW BAND - (FULL PERFORMANCE)

MY AUDITION FOR A NEW BAND - (FULL PERFORMANCE)

ML Was Hard Until I Learned These 5 Secrets!

ML Was Hard Until I Learned These 5 Secrets!

A Hackers' Guide to Language Models

A Hackers' Guide to Language Models

This is why Deep Learning is really weird.

This is why Deep Learning is really weird.

Lesson 6: Practical Deep Learning for Coders 2022

Lesson 6: Practical Deep Learning for Coders 2022

How I would learn Machine Learning (if I could start over)

How I would learn Machine Learning (if I could start over)

How to learn Machine Learning (ML/AI Roadmap 2024)

How to learn Machine Learning (ML/AI Roadmap 2024)

Intro to FastHTML

Intro to FastHTML

God-Tier Developer Roadmap

God-Tier Developer Roadmap

IQ Level: 10000

IQ Level: 10000

Платье Мурсдей покупай - шоколадки получай! 👗🍫 #симбочка #симба #симбакликер

Платье Мурсдей покупай — шоколадки получай! 👗🍫 #симбочка #симба #симбакликер

Подарок Марине Кравец 🎂😱 #ComedyClub #КамедиКлаб #гарикхарламов #смирняга #тнт4 #тнт #маринакравец

Подарок Марине Кравец 🎂😱 #ComedyClub #КамедиКлаб #гарикхарламов #смирняга #тнт4 #тнт #маринакравец

А как называется ваш чат с подругами? 🤫😂 #шортс #аняпокров #shorts #pokrov

А как называется ваш чат с подругами? 🤫😂 #шортс #аняпокров #shorts #pokrov

▼КОРОЛЬ СОЖРАЛ ВСЕХ 👑🍗

▼КОРОЛЬ СОЖРАЛ ВСЕХ 👑🍗

ВЫЖИВАЕМ В ЛЕСУ 24 ЧАСА / НАПАЛ МЕДВЕДЬ / 3 СЕМЬИ

ВЫЖИВАЕМ В ЛЕСУ 24 ЧАСА / НАПАЛ МЕДВЕДЬ / 3 СЕМЬИ

SEVINCHDAN PUL UCHUN FOYDALANGAN BONU SHAROFNI XAQORAT QILDI @lazizashokuzofficial

SEVINCHDAN PUL UCHUN FOYDALANGAN BONU SHAROFNI XAQORAT QILDI @lazizashokuzofficial