New Easy VAE Workflow (Stable Diffusion)

koiboi

Просмотров 15 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 9 июн 2024
Using a custom VAE can improve Stable Diffusion images significantly. We walkthrough how to use a custom VAE with the AUTOMATIC1111 webui and also explain what the heck a VAE is and why it helps.
Discord: / discord
0:00 - Intro
1:00 - What is a VAE
8:32 - How to use a VAE
11:17 - Comparison
------- Links -------
Comparison Images: / vae_comparison
AMAZING Video on Variational Autoencoders: • Variational Autoencode...
Good generalist VAE by Stability.AI: huggingface.co/stabilityai/sd...
The waifudiffusion VAE I used: huggingface.co/hakurei/waifu-...
AUTOMATIC1111 Webui: github.com/AUTOMATIC1111/stab...
------- Music -------
Music from freetousemusic.com
‘Branch’ by ‘LuKremBo’: • (no copyright music) c...
‘Butter’ by LuKremBo: • lukrembo - butter (roy...
‘Daily’ by ‘LuKremBo’: • (no copyright music) c...
‘Onion’ by LuKremBo: • (no copyright music) l...
‘Rose’ by ‘LuKremBo’: • lukrembo - rose (royal...
‘Sunset’ by ‘LuKremBo’: • (no copyright music) j...
Many thanks to LuKremBo
#stablediffusion #aiart #xformers #tutorials #techtutorials
Наука

Комментарии • 72

@zoot.589 Год назад ⁺⁹
Very Informative! I've been seeing alot about VAEs but have been struggling to understand them. This video helped me out tremendously! Love the content, Keep it up!
@TheCopernicus1 Год назад ⁺⁴
Is there anything you CAN'T explain? Amazing mate!!!!!
@lewingtonn Год назад ⁺²
Yes lol, why aitrepreneur has so many more subs than me :'(
@TheCopernicus1 Год назад ⁺¹
@@lewingtonn mate forget the subs they will come, do what mr beast does and translate in multiple languages!!
@lewingtonn Год назад ⁺¹
@@TheCopernicus1 ............... huh
@TheCopernicus1 Год назад ⁺¹
@@lewingtonn I joined your discord! Also what I meant regarding Mr Beast was the technique he uses for most of his video's is he translates them into multiple spoken languages as there are many ML enthusiasts around the world. He figured he had more non-english speaking friends watching his channel than originally anticipated!
@acemax5248 11 месяцев назад ⁺¹
I just really like the way you explain complex stuff. really appreciate it.
@lewingtonn 11 месяцев назад
hawhahah really? I'll have to visit sometime
@acemax5248 11 месяцев назад
@@lewingtonn Sure, That would be fantastic! Let me know when you're thinking of coming 😀
@friendofai Год назад
Great video as always Koiboi, always looking forward to what you are creating.
@Qubot Год назад ⁺⁵
Nice explaination, nice end cut too !
@lewingtonn Год назад ⁺¹
lol, what the heck, sorry!
@ZeroIQ2 Год назад ⁺¹
This is very cool, I love the details you go into.
@gunseekers Год назад
thx for the expalanation man, very informative and easy to understand
@techviking23 Год назад ⁺¹
Love your explanation!
@GuyEshet Год назад ⁺³
Perfect explanation! You got me to read the actual paper and your video helped me get to the Aha moment!
@lewingtonn Год назад ⁺¹
thanks for commenting it out loud dude, literally so good to hear!
@user-hb6dd9iu9g Год назад
Thank you for this video.
@g.kirilov1352 Год назад ⁺²
Pretty much one of the very best content around. Could you do some intro to upscalers, there is a lot of controversy out there regarding those as well.
@autonomousreviews2521 Год назад
Fantastic :) Thank you!
@pol3055 Год назад
very helpful, thank you
@vladimir4614 Год назад ⁺³
nanomachines?
@mrrealpx3189 Год назад
thanks for the easy answer.
@alexpangilinan3785 Год назад
thank you i recently pick up sd and having a problem like washed out color of everything i generate. this actually solve it for me thank you
@gdizzzl Год назад ⁺³
im convinced thats what midjourney v4 is, just a new vae
@lewingtonn Год назад ⁺⁴
[citation needed]
@BoolitMagnet Год назад ⁺³
7:11 You mentioned the Encoder converts the latent back to the exact original image; it actually only returns a very close approximation of the original.
@lewingtonn Год назад ⁺²
yeah, good point, I should have been a bit more clear about that hey. I should have said it TRIES to convert it back or something.
@devnull_ Год назад ⁺²
Thank funny hat man. BTW is the VAE technically lossy, so with encoding ---> decoding, when it gets to the end result, is the image a good learning based guess or 1:1 copy of original?
@lewingtonn Год назад
exactly, VAE are very lossy, it's a good learning based guess!
@diego.spirit Год назад
and if I put in the vae, the automatic option? what will he use?
@LM-zj7xp Год назад ⁺¹
The zigzag just looks like normal raster pixels in a low resolution image. Most raster images have them in higher contrast areas. To make a diagonal you need a series of offset square pixels, after all.
@lewingtonn Год назад
that's exactly what I was trying to point out (I need to work on being clearer): how a diffusion model would have to learn how to offset square pixels to create a diagonal line visual effect, when really it shouldn't have to worry about such details
@Nairb932 Год назад ⁺¹
Do you happen to know the "Quicksettings list"(for those who don't know, this is a thing in the settings that adds stuff at the top of the webUI) value for VAE and clip skip ?
Is it SD_VAE and SD_Clip_Skip ?
@Shadow_Shinigami Год назад
CLIP_stop_at_last_layers, sd_vae
@riggitywrckd4325 Год назад
It is sd_vae I found it by looking at the web page source and searching vae. For those that are like WTH where is this folder and why don't I have stable diffusion section on your automatic1111 don't forget to do a git pull.
@texx8205 Год назад
So, how to actually make or extract VAE from the full unpruned model?
@user-hb6dd9iu9g Год назад
Could VAE be a PT format?
One more question: Do you know anything about Stable Warpfusion? Is it another AI or version of SD or it is a model, embedding or promt?
@Thefan Год назад
I wonder if the Lorenz is related in any way to the guy who had a fractal model named after them?
@devnull_ Год назад ⁺¹
Could you consider doing these virtual chalk board thingies on white background? I may not be majority, but my eyes can't take that black background...
@lewingtonn Год назад ⁺¹
that's weird, I can't staaaand white background. It could be a bit more visible though, I'll try thicker lines or something
@SaintMatthieuSimard Год назад ⁺²
Now you got a do muscular Kamala, for equity, y'know.
@lithium534 Год назад ⁺¹
could you do a video on embeddings. I have tested some but it seems they do nothing. Why do we have them?
@devnull_ Год назад
Why not check Automatic1111's wiki? There's a whole page about textual inversion.
@lewingtonn Год назад
i literally did one!!!
@lithium534 Год назад
@@lewingtonn great.
Did in it's already out or did in it's coming next?
@lewingtonn Год назад
@@lithium534 it's this one: ruclips.net/video/9zYzuKaYfJw/видео.html&ab_channel=koiboi (aesthetic embeddings = aesthetic gradients), I assume that's what you mean by "embeddings"
@lithium534 Год назад ⁺¹
@@lewingtonn Thanks.
I was searching embeddings. So this is the other name for it.
Know I know. Thanks again keep the great content coming.
@-Belshazzar- Год назад
Hey, I wonder why do you use 1.3? Is that a better model in your opinion?better than 1.4 and 1.5?
@lewingtonn Год назад ⁺¹
I used waifu diffusion 1.3, which is the most modern version of waifu diffusion (which is a specially finetuned version of stable diffusion 1.4)
@-Belshazzar- Год назад
@@lewingtonn ahh I see thanks!
@DJVARAO Год назад ⁺¹
I am a bit confused.
I see no significant differences between your before and after images.
Shouldn't the "waifu diffusion" model be used as your main model in the prompt-to-text page of the GUI?
@lewingtonn Год назад ⁺¹
You can see a more close-up comparison of the images linked in the description, I think the changes were significant in some cases. I did end up using waifu diffusion when I actually generated the images, but you can use any VAE with any diffusion model. Hope that cleared things up a little.
@DJVARAO Год назад
@@lewingtonn Thanks!
@FilmFactry Год назад ⁺⁴
Your 100% wrong. Latent Diffusion is magic.
@mhnoni Год назад
It's a panadora box, it was a gift from aliens.
@gbennett1000 Год назад
man, you are hilarious
@yoavco99 Год назад ⁺³
guys be honest, we all simp 2minutespaper here
@lewingtonn Год назад ⁺¹
especially me :'(
@HB-kl5ik Год назад ⁺¹
Dear fellow scholars, do you want a 2 minutepapers replacement here?
@TheCopernicus1 Год назад
@@HB-kl5ik YES!
@VanadiumBromide Год назад ⁺³
Big government got me 🥵🥵
@PeppePascale_ Год назад
hold down to your papers and beers.. cheers XD
@mcgibs Год назад ⁺²
Am I the only one who calls it Auto Eleven?
@dbseraph Год назад
only the one eyes pirates that don't see the other two ones...
@devnull_ Год назад
You are not the only one. 😅
@lewingtonn Год назад ⁺¹
damn, that's way better!
@mcgibs Год назад ⁺¹
@@lewingtonn "Automatic One-One-One-One" doesn't quite roll off the tongue.
@amafuji Год назад
A-Quad-1
@philosophicalgamer2564 Год назад
Donald Trump would win that fight 😏

Следующие

Автовоспроизведение

What the heck is CLIP Skip and when do I use it?