Variational Autoencoders

Поделиться
HTML-код
  • Опубликовано: 27 ноя 2024

Комментарии • 26

  • @wilsonlwtan3975
    @wilsonlwtan3975 10 месяцев назад

    This is a gem. Finally, someone that is able to give concise teaching well! Thank you!

  • @gorgolyt
    @gorgolyt 3 года назад +13

    Best explanation on RUclips. Exactly what I was looking for. Thorough, logical, intuitive.

  • @bluestar2253
    @bluestar2253 3 года назад +5

    One of the best explanations on VAE on YT. Thank you and keep up the good work!

  • @pietrocestola7856
    @pietrocestola7856 Год назад

    Clear, concise and very accurate. Thank you so much for sharing with us this wonderful explanation.

  • @amirhosseinramazani757
    @amirhosseinramazani757 2 года назад +1

    I enjoyed your explanation. I needed something like this video to get a little deeper into the theory of the VAEs. Thank you!

  • @sucramgnat8157
    @sucramgnat8157 2 года назад +1

    Thank you so much for your lecture. You truly have a talent for teaching!

  • @yurigansmith
    @yurigansmith 4 месяца назад +1

    Training of generative models start here: 6:26

  • @Procuste34iOSh
    @Procuste34iOSh Год назад

    thank you so much. so underrated

  • @yurigansmith
    @yurigansmith 4 месяца назад

    Very good presentation. Thanks a lot!

  • @sahhaf1234
    @sahhaf1234 Год назад +2

    How do we know that p(x|z) is normally distributed?? Do we just assume it?
    x|z is just a neural network and I dont see any reason for p(x|z) to distribute normally. Actually, the relation between x and z must be deterministic.

    • @AmanSharma-ug6sr
      @AmanSharma-ug6sr 3 месяца назад +1

      Not apparent in the video, but x|z neural network is actually outputting the mean of the distribution of x for that z, which is a gaussian. This means that there can be that a target image that can be generated by multiple z's (thus by multiple means). When computing the loss function, there are two opposing terms, one is the reconstruction error that is minimizing the distance between this mean and the target image, putting the mean generated to the right place, and KL divergence term between the p(z|x) and the standard normal distribution, which is trying to bring the outputs of the means closer for similar images .

  • @MeowlaMars
    @MeowlaMars Год назад

    This is clear and awesome

  • @gomctigger4439
    @gomctigger4439 2 года назад +1

    Hi @Paul Hand,
    thank you for the lecture.
    What is the intuition behind using q(z|x) in the expectation or the expectation at all? I see that it makes sense mathematically, but how would one get the idea?
    In contrast, there is a derivation of the ELBO via importance sampling and then applying Jensen Inequality or via the optimal sampler.

  • @user-or7ji5hv8y
    @user-or7ji5hv8y 3 года назад +1

    wow, this is so well explained.

  • @rishidixit7939
    @rishidixit7939 Месяц назад

    What is the meaning of Latent Code?

  • @slemanbisharat6390
    @slemanbisharat6390 Год назад

    Thank you excellent explanation!!

  • @trongduong1047
    @trongduong1047 3 года назад

    very nice explanation!

  • @hubertnguyen8855
    @hubertnguyen8855 3 года назад +1

    Very nice and comprehensive lecture. Thanks

  • @oFabianLoL
    @oFabianLoL Год назад

    I don't understand what phi and theta mean. "the parameters of the model", does that mean the weights of the neural network? or the parameters of the distribution, eg if it is gaussian, the parameters correspond to a mu and sigma.
    I appreciate if anyone can clarify, thank you!

    • @ThatQCboy
      @ThatQCboy Год назад

      parameters of the model. we use MLE principles to find the optimal phi and theta

    • @doyney
      @doyney Год назад +1

      I'm pretty sure phi and theta represent the parameters in terms of weights and biases in the encoder/decoder neural networks.

  • @maximmaximov4147
    @maximmaximov4147 Год назад

    It would be really perfect if someone started giving some examples on each step since we are talking about real things that exist in the world. Each step has its meaning and intention and is made to overcome challenges or obstacles that come up on the way. I want to know what we are doing and what is the purpose. And what is gonna happen if we wouldn't do it this way. I cannot find anything non abstract, I need examples to put my imagination on. It is clear and good only if you have prior knowledge of the things being discussed. Otherwise there are million ways to interpret things and even more to get lost

    • @maximmaximov4147
      @maximmaximov4147 Год назад

      At 11:00 it seems like if we are talking about pictures the formula written in blue should generate an image with real random noise which doesn't make sense. It should have been done differently like is said in other articles so that random distributions of different images (sets of parameters or pixels) overlap. So that it is not purely random noise which is not we're trying to reach

  • @madhusudanverma6564
    @madhusudanverma6564 3 года назад

    24:48, how maximizing vlb will roughly maximize p(x) because, since x is given p(x) should be constant.

    • @josephpalermo8898
      @josephpalermo8898 2 года назад

      p(x) is actually parameterized therefore it's not constant