New Easy VAE Workflow (Stable Diffusion)

Поделиться
HTML-код
  • Опубликовано: 9 июн 2024
  • Using a custom VAE can improve Stable Diffusion images significantly. We walkthrough how to use a custom VAE with the AUTOMATIC1111 webui and also explain what the heck a VAE is and why it helps.
    Discord: / discord
    0:00 - Intro
    1:00 - What is a VAE
    8:32 - How to use a VAE
    11:17 - Comparison
    ------- Links -------
    Comparison Images: / vae_comparison
    AMAZING Video on Variational Autoencoders: • Variational Autoencode...
    Good generalist VAE by Stability.AI: huggingface.co/stabilityai/sd...
    The waifudiffusion VAE I used: huggingface.co/hakurei/waifu-...
    AUTOMATIC1111 Webui: github.com/AUTOMATIC1111/stab...
    ------- Music -------
    Music from freetousemusic.com
    ‘Branch’ by ‘LuKremBo’: • (no copyright music) c...
    ‘Butter’ by LuKremBo: • lukrembo - butter (roy...
    ‘Daily’ by ‘LuKremBo’: • (no copyright music) c...
    ‘Onion’ by LuKremBo: • (no copyright music) l...
    ‘Rose’ by ‘LuKremBo’: • lukrembo - rose (royal...
    ‘Sunset’ by ‘LuKremBo’: • (no copyright music) j...
    Many thanks to LuKremBo
    #stablediffusion #aiart #xformers #tutorials #techtutorials
  • НаукаНаука

Комментарии • 72

  • @zoot.589
    @zoot.589 Год назад +9

    Very Informative! I've been seeing alot about VAEs but have been struggling to understand them. This video helped me out tremendously! Love the content, Keep it up!

  • @TheCopernicus1
    @TheCopernicus1 Год назад +4

    Is there anything you CAN'T explain? Amazing mate!!!!!

    • @lewingtonn
      @lewingtonn  Год назад +2

      Yes lol, why aitrepreneur has so many more subs than me :'(

    • @TheCopernicus1
      @TheCopernicus1 Год назад +1

      @@lewingtonn mate forget the subs they will come, do what mr beast does and translate in multiple languages!!

    • @lewingtonn
      @lewingtonn  Год назад +1

      @@TheCopernicus1 ............... huh

    • @TheCopernicus1
      @TheCopernicus1 Год назад +1

      @@lewingtonn I joined your discord! Also what I meant regarding Mr Beast was the technique he uses for most of his video's is he translates them into multiple spoken languages as there are many ML enthusiasts around the world. He figured he had more non-english speaking friends watching his channel than originally anticipated!

  • @acemax5248
    @acemax5248 11 месяцев назад +1

    I just really like the way you explain complex stuff. really appreciate it.

    • @lewingtonn
      @lewingtonn  11 месяцев назад

      hawhahah really? I'll have to visit sometime

    • @acemax5248
      @acemax5248 11 месяцев назад

      @@lewingtonn Sure, That would be fantastic! Let me know when you're thinking of coming 😀

  • @friendofai
    @friendofai Год назад

    Great video as always Koiboi, always looking forward to what you are creating.

  • @Qubot
    @Qubot Год назад +5

    Nice explaination, nice end cut too !

    • @lewingtonn
      @lewingtonn  Год назад +1

      lol, what the heck, sorry!

  • @ZeroIQ2
    @ZeroIQ2 Год назад +1

    This is very cool, I love the details you go into.

  • @gunseekers
    @gunseekers Год назад

    thx for the expalanation man, very informative and easy to understand

  • @techviking23
    @techviking23 Год назад +1

    Love your explanation!

  • @GuyEshet
    @GuyEshet Год назад +3

    Perfect explanation! You got me to read the actual paper and your video helped me get to the Aha moment!

    • @lewingtonn
      @lewingtonn  Год назад +1

      thanks for commenting it out loud dude, literally so good to hear!

  • @user-hb6dd9iu9g
    @user-hb6dd9iu9g Год назад

    Thank you for this video.

  • @g.kirilov1352
    @g.kirilov1352 Год назад +2

    Pretty much one of the very best content around. Could you do some intro to upscalers, there is a lot of controversy out there regarding those as well.

  • @autonomousreviews2521
    @autonomousreviews2521 Год назад

    Fantastic :) Thank you!

  • @pol3055
    @pol3055 Год назад

    very helpful, thank you

  • @vladimir4614
    @vladimir4614 Год назад +3

    nanomachines?

  • @mrrealpx3189
    @mrrealpx3189 Год назад

    thanks for the easy answer.

  • @alexpangilinan3785
    @alexpangilinan3785 Год назад

    thank you i recently pick up sd and having a problem like washed out color of everything i generate. this actually solve it for me thank you

  • @gdizzzl
    @gdizzzl Год назад +3

    im convinced thats what midjourney v4 is, just a new vae

  • @BoolitMagnet
    @BoolitMagnet Год назад +3

    7:11 You mentioned the Encoder converts the latent back to the exact original image; it actually only returns a very close approximation of the original.

    • @lewingtonn
      @lewingtonn  Год назад +2

      yeah, good point, I should have been a bit more clear about that hey. I should have said it TRIES to convert it back or something.

  • @devnull_
    @devnull_ Год назад +2

    Thank funny hat man. BTW is the VAE technically lossy, so with encoding ---> decoding, when it gets to the end result, is the image a good learning based guess or 1:1 copy of original?

    • @lewingtonn
      @lewingtonn  Год назад

      exactly, VAE are very lossy, it's a good learning based guess!

  • @diego.spirit
    @diego.spirit Год назад

    and if I put in the vae, the automatic option? what will he use?

  • @LM-zj7xp
    @LM-zj7xp Год назад +1

    The zigzag just looks like normal raster pixels in a low resolution image. Most raster images have them in higher contrast areas. To make a diagonal you need a series of offset square pixels, after all.

    • @lewingtonn
      @lewingtonn  Год назад

      that's exactly what I was trying to point out (I need to work on being clearer): how a diffusion model would have to learn how to offset square pixels to create a diagonal line visual effect, when really it shouldn't have to worry about such details

  • @Nairb932
    @Nairb932 Год назад +1

    Do you happen to know the "Quicksettings list"(for those who don't know, this is a thing in the settings that adds stuff at the top of the webUI) value for VAE and clip skip ?
    Is it SD_VAE and SD_Clip_Skip ?

    • @Shadow_Shinigami
      @Shadow_Shinigami Год назад

      CLIP_stop_at_last_layers, sd_vae

    • @riggitywrckd4325
      @riggitywrckd4325 Год назад

      It is sd_vae I found it by looking at the web page source and searching vae. For those that are like WTH where is this folder and why don't I have stable diffusion section on your automatic1111 don't forget to do a git pull.

  • @texx8205
    @texx8205 Год назад

    So, how to actually make or extract VAE from the full unpruned model?

  • @user-hb6dd9iu9g
    @user-hb6dd9iu9g Год назад

    Could VAE be a PT format?
    One more question: Do you know anything about Stable Warpfusion? Is it another AI or version of SD or it is a model, embedding or promt?

  • @Thefan
    @Thefan Год назад

    I wonder if the Lorenz is related in any way to the guy who had a fractal model named after them?

  • @devnull_
    @devnull_ Год назад +1

    Could you consider doing these virtual chalk board thingies on white background? I may not be majority, but my eyes can't take that black background...

    • @lewingtonn
      @lewingtonn  Год назад +1

      that's weird, I can't staaaand white background. It could be a bit more visible though, I'll try thicker lines or something

  • @SaintMatthieuSimard
    @SaintMatthieuSimard Год назад +2

    Now you got a do muscular Kamala, for equity, y'know.

  • @lithium534
    @lithium534 Год назад +1

    could you do a video on embeddings. I have tested some but it seems they do nothing. Why do we have them?

    • @devnull_
      @devnull_ Год назад

      Why not check Automatic1111's wiki? There's a whole page about textual inversion.

    • @lewingtonn
      @lewingtonn  Год назад

      i literally did one!!!

    • @lithium534
      @lithium534 Год назад

      @@lewingtonn great.
      Did in it's already out or did in it's coming next?

    • @lewingtonn
      @lewingtonn  Год назад

      @@lithium534 it's this one: ruclips.net/video/9zYzuKaYfJw/видео.html&ab_channel=koiboi (aesthetic embeddings = aesthetic gradients), I assume that's what you mean by "embeddings"

    • @lithium534
      @lithium534 Год назад +1

      @@lewingtonn Thanks.
      I was searching embeddings. So this is the other name for it.
      Know I know. Thanks again keep the great content coming.

  • @-Belshazzar-
    @-Belshazzar- Год назад

    Hey, I wonder why do you use 1.3? Is that a better model in your opinion?better than 1.4 and 1.5?

    • @lewingtonn
      @lewingtonn  Год назад +1

      I used waifu diffusion 1.3, which is the most modern version of waifu diffusion (which is a specially finetuned version of stable diffusion 1.4)

    • @-Belshazzar-
      @-Belshazzar- Год назад

      @@lewingtonn ahh I see thanks!

  • @DJVARAO
    @DJVARAO Год назад +1

    I am a bit confused.
    I see no significant differences between your before and after images.
    Shouldn't the "waifu diffusion" model be used as your main model in the prompt-to-text page of the GUI?

    • @lewingtonn
      @lewingtonn  Год назад +1

      You can see a more close-up comparison of the images linked in the description, I think the changes were significant in some cases. I did end up using waifu diffusion when I actually generated the images, but you can use any VAE with any diffusion model. Hope that cleared things up a little.

    • @DJVARAO
      @DJVARAO Год назад

      @@lewingtonn Thanks!

  • @FilmFactry
    @FilmFactry Год назад +4

    Your 100% wrong. Latent Diffusion is magic.

    • @mhnoni
      @mhnoni Год назад

      It's a panadora box, it was a gift from aliens.

  • @gbennett1000
    @gbennett1000 Год назад

    man, you are hilarious

  • @yoavco99
    @yoavco99 Год назад +3

    guys be honest, we all simp 2minutespaper here

    • @lewingtonn
      @lewingtonn  Год назад +1

      especially me :'(

    • @HB-kl5ik
      @HB-kl5ik Год назад +1

      Dear fellow scholars, do you want a 2 minutepapers replacement here?

    • @TheCopernicus1
      @TheCopernicus1 Год назад

      @@HB-kl5ik YES!

  • @VanadiumBromide
    @VanadiumBromide Год назад +3

    Big government got me 🥵🥵

  • @PeppePascale_
    @PeppePascale_ Год назад

    hold down to your papers and beers.. cheers XD

  • @mcgibs
    @mcgibs Год назад +2

    Am I the only one who calls it Auto Eleven?

    • @dbseraph
      @dbseraph Год назад

      only the one eyes pirates that don't see the other two ones...

    • @devnull_
      @devnull_ Год назад

      You are not the only one. 😅

    • @lewingtonn
      @lewingtonn  Год назад +1

      damn, that's way better!

    • @mcgibs
      @mcgibs Год назад +1

      @@lewingtonn "Automatic One-One-One-One" doesn't quite roll off the tongue.

    • @amafuji
      @amafuji Год назад

      A-Quad-1

  • @philosophicalgamer2564
    @philosophicalgamer2564 Год назад

    Donald Trump would win that fight 😏