L17.6 A Variational Autoencoder for Face Images in PyTorch -- Code Example

The Reparameterization Trick

Understanding Variational Autoencoders (VAEs) | Deep Learning

The Most DISRESPECTFUL Way To End a Game I've Seen

MARK 마크 '프락치 (Fraktsiya) (Feat. 이영지)' MV

Imagine Dragons - Take Me To The Beach (feat. Ado) (Official Lyric Video)

L17.5 A Variational Autoencoder for Handwritten Digits in PyTorch -- Code Example

Sebastian Raschka

Просмотров 17 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 10 янв 2025

Комментарии • 18

@raghavamorusupalli7557 3 года назад ⁺⁴
Thank you for hand holding the DL aspirants to reach new destinations, Great Service to the Knowledge
@736939 3 года назад ⁺⁴
3:59 In the first decoder's linear layer you have only 2 neurons, - I mean, if you have 2 neurons from z_mean and 2 neurons from z_log_var then, the decoder's linear layer must contains 4 neurons instead of 2. I don't get it.
@SebastianRaschka 3 года назад ⁺⁵
Good question! The best way to see how these get actually used is by looking at the forward method:
def forward(self, x):
x = self.encoder(x)
z_mean, z_log_var = self.z_mean(x), self.z_log_var(x)
encoded = self.reparameterize(z_mean, z_log_var)
decoded = self.decoder(encoded)
So here we can see that these (z_mean and z_var) get passed to self.reparameterize, which returns "encoded" that is then passed to the decoder.
Upon inspecting "self.reparameterize" you will see that we use z_mean, z_log_var as parameters for a normal distribution to sample the vector "z" (same as "encoded"):
def reparameterize(self, z_mu, z_log_var):
eps = torch.randn(z_mu.size(0), z_mu.size(1)).to(z_mu.get_device())
z = z_mu + eps * torch.exp(z_log_var/2.)
return z
In other words, the two dimensional vectors z_mean & z_log_var are not used directly in the encoder but are just used to sample from a 2D Gaussian distribution via torch.randn to get a 2D vector z. I.e, the input to the decoder is the 2D vector "z" (aka "encoded")
@cricketcricket20 Год назад
Hello, at 13:53 you said that you are summing over the latent dimension. But aren't the z_mean and z_log_var tensors of the shape (batch size, channels, latent dimension)? In that case wouldn't you sum over axis = 2? Thanks a lot for the videos!
@cricketcricket20 Год назад
Following up on this, I think the sum over axis = 1 is correct because it carries out the kl divergence formula element-wise the way it should be done. This outputs a tensor of shape (batch size, channels, latent dimension), then you compute the average of this tensor. This is analogous to taking the MSE loss (with reduction = 'mean'), which first computes the squared differences element-wise and then take the average.
@vineetgundecha7872 Год назад
Thanks for the explanation! Unlike the reconstruction loss which is interpretable, how should we interpret the KL divergence loss? What is an acceptable value? How would the sampled images look if we have a low reconstruction error but high KL divergence ?
@MohitGupta-zf8kx 3 года назад ⁺⁴
Your video is really amazing. Thank you very much for giving us so much knowledge. Can you please tell us how can we get the validation loss evaluation curves?
Thanks :)
@SebastianRaschka 3 года назад
Glad to hear you are liking it. I plotted the losses with matplotlib, you can find the code here at the top: github.com/rasbt/stat453-deep-learning-ss21/blob/main/L17/helper_plotting.py
@yogendra-yatnalkar 10 месяцев назад
Thanks a lot for the VAE series. A small question: Since we need a encoder output to be as close to standard distribution as possible, why dont we enforce activation function on the encoder linear layer ? --> The mean layer will have sigmoid activation fcn and variance layer will have tanh ...something like this ?
@prashantjaiswal5260 Год назад
running the code on google colab it shows error in model.to(DEVICE ) part how it can be corrected???
set_all_seeds(RANDOM_SEED)
model = VAE()
model.to(DEVICE)
optimizer = torch.optim.Adam(model.parameters(), lr=LEARNING_RATE)
@yeahjustin3388 Месяц назад
I am wondering how to backward for randomn function
@siddhantverma532 3 года назад ⁺¹
First of all, thanks a lot! The scatter plot really gives a nice intuition about latent space.But it got me thinking that will every 2d space trained will look like this, or will it depend on how someone has made architecture or trained it.Then I saw your plot it was different from mine so I guess its not universal then. If it was universal it would be like a huge thing!
Another thing that we are trying to learn the probability distribution if I'm not wrong I wanna know and visualise the distribution that our network has learnt how can we know that, its in 2d so it can be visualised in 3d graph.
@SebastianRaschka 3 года назад
The latent space will depend a bit on the weight of the KL-divergence term (if it is too weak, it will resemble a 2D Gaussian less). Also, since random sampling is involved, the plot may look different every time. Btw. regarding the plot, to plot the distribution in 3D, you'd need some sort of density estimation. This reminds me, I actually wrote a blog post about this long long time ago: sebastianraschka.com/Articles/2014_kernel_density_est.html
@hillarykavagi7349 2 года назад
Hi Sebastian, I like your Videos, I has helped me, but am working on a personal project on Variational Autoencoders using Dirichlet distribution, and am stuck at the point of calculating Binary cross Entropy loss, I would kindly like to request for assistance
@jalv1499 Год назад
thank you for the video! What's the formula of backpropagation? I did not see the code of backward propagation part.
@SaschaRobitzki 11 месяцев назад
It's part of PyTorch.

Следующие

Автовоспроизведение

L17.6 A Variational Autoencoder for Face Images in PyTorch -- Code Example

L17.6 A Variational Autoencoder for Face Images in PyTorch -- Code Example

The Reparameterization Trick

The Reparameterization Trick

Understanding Variational Autoencoders (VAEs) | Deep Learning

Understanding Variational Autoencoders (VAEs) | Deep Learning

The Most DISRESPECTFUL Way To End a Game I've Seen

The Most DISRESPECTFUL Way To End a Game I've Seen

MARK 마크 '프락치 (Fraktsiya) (Feat. 이영지)' MV

MARK 마크 '프락치 (Fraktsiya) (Feat. 이영지)' MV

Imagine Dragons - Take Me To The Beach (feat. Ado) (Official Lyric Video)

Imagine Dragons - Take Me To The Beach (feat. Ado) (Official Lyric Video)

Engineers vs Extreme Hide & Seek

Engineers vs Extreme Hide & Seek

Diffusion models from scratch in PyTorch

Diffusion models from scratch in PyTorch

Coding LLaMA 2 from scratch in PyTorch - KV Cache, Grouped Query Attention, Rotary PE, RMSNorm

Coding LLaMA 2 from scratch in PyTorch - KV Cache, Grouped Query Attention, Rotary PE, RMSNorm

Stable Diffusion in Code (AI Image Generation) - Computerphile

Stable Diffusion in Code (AI Image Generation) - Computerphile

7 Outside The Box Puzzles

7 Outside The Box Puzzles

It Happened! Elon Musk Reveals Incredible Features Of Tesla Bot Gen 3 2025! Destroy ALL Rivals!

It Happened! Elon Musk Reveals Incredible Features Of Tesla Bot Gen 3 2025! Destroy ALL Rivals!

178 - An introduction to variational autoencoders (VAE)

178 - An introduction to variational autoencoders (VAE)

Variational Autoencoder from scratch in PyTorch

Variational Autoencoder from scratch in PyTorch

Variational Autoencoders | Generative AI Animated

Variational Autoencoders | Generative AI Animated

Variational Autoencoders

Variational Autoencoders

ПРОРЫВ В ГРАФИКЕ С RTX 5090: лица в играх, нарисованные нейросетью и умные NPC из GTA 6!

ПРОРЫВ В ГРАФИКЕ С RTX 5090: лица в играх, нарисованные нейросетью и умные NPC из GTA 6!

Пожары в Лос-Анджелесе: как сгорает Голливуд? | Новости, США, Байден, Трамп, Пэрис Хилтон

Пожары в Лос-Анджелесе: как сгорает Голливуд? | Новости, США, Байден, Трамп, Пэрис Хилтон

Pink Bot 💀💀 Test IQ Challenge Incredibox Sprunki

Pink Bot 💀💀 Test IQ Challenge Incredibox Sprunki

КОНФЛИКТ Конора и Авдала. СХВАТКА Германского. На Убу НАЕХАЛИ. Касымбай - ЗАЩИТА ПОЯСА. Смоян VS Хан

КОНФЛИКТ Конора и Авдала. СХВАТКА Германского. На Убу НАЕХАЛИ. Касымбай - ЗАЩИТА ПОЯСА. Смоян VS Хан

Какой ПК у CS-состава? 🤓 #teamspirit #cs2

Какой ПК у CS-состава? 🤓 #teamspirit #cs2

Жуткий лифт ( КЛЕТЬ / STRETCHER MEN )

Жуткий лифт ( КЛЕТЬ / STRETCHER MEN )

When Players Try To Score From Kick-Off 😳

When Players Try To Score From Kick-Off 😳

Awesome Harley Quinn. #Harriet Quinn #joker #cosplay

Awesome Harley Quinn. #Harriet Quinn #joker #cosplay