SDXL 1.0 blows away Stable Diffusion 1.5. And here is the testing to prove it.

Поделиться
HTML-код
  • Опубликовано: 7 июн 2024
  • In this video, I will compare the newly released SDXL 1.0 checkpoint to both base Stable Diffusion 1.5 and top tier Stable Diffusion 1.5 checkpoints to see how they compare.
    I start out by discussing the architecture of SDXL compared with SD 1.5 to see how they compare on paper.
    Afterwards, I show the results of extensive testing in Automatic1111 and ComfyUI comparing SDXL to SD 1.5 in terms of maximum image size, generation speed, and VRAM requirements. And finally, do quality testing to see where SDXL is in terms of hand quality and the maximum practical image size compared to SD 1.5.
    Intro - 00:00
    The Nerdy Part - 00:41
    Parameters - 00:53
    Text Encoder - 01:27
    Pipeline - 03:19
    File Size - 04:23
    Maximum Image Size - 05:15
    Minimum VRAM Requirements - 06:41
    Generation Speed - 07:13
    Hand Quality - 09:29
    Twinning/Practical Image Size - 11:03
    Style - 12:43
    Final Thoughts - 13:56

Комментарии • 92

  • @LiLGWaez
    @LiLGWaez 10 месяцев назад +9

    I JUST discovered ur channel, and instantly had to subscribe.
    This channel feels like an absolute GOLDMINE. Thank you so much for covering all of this stuff, it feels like ive stumbled upon something amazing.
    I love how enjoyable ur narration is too. Hahaha.
    Keep it up man. I'll be watching ur videos on hands after this. Ah those pesky hands. One day i'll master them. Hope u have a great day dude.

  • @pon1
    @pon1 10 месяцев назад +1

    Subscribed, best comparison of them all so far!

  • @Modioman69
    @Modioman69 10 месяцев назад +3

    I highly enjoy your style, explanation and methods you use for your content and you deserve massively higher viewer counts/subs. You’re a gem in a community of over saturated content. Keep up the great work. I do still feel 1.5 is way ahead until SDXL gets more trained models which will definitively surpass 1.5 just based on everything you explained here. I will wait to try until then. Excellent video.

  • @marcinszuszkiewicz
    @marcinszuszkiewicz 10 месяцев назад +2

    Thanks for your very informative review; subscribed

  • @flisbonwlove
    @flisbonwlove 10 месяцев назад +1

    Nice review mate !!

  • @CaptDabbs
    @CaptDabbs 24 дня назад

    dude, i just jumped in around Xmas 23 w/ 8g 3060 now its 500gigs of stuff & you just gave the best little darn talk ive seen so far. subbed.

  • @vvidover
    @vvidover 10 месяцев назад

    Darn good breakdown. Well done.

  • @RedmotionGames
    @RedmotionGames 10 месяцев назад

    Thanks for this video. Much more useful info than other people seem to be giving, re file sizes, etc. Sub'd. Will try Comfy (A1111 giving out of memory errors)

  • @ai_and_gaming
    @ai_and_gaming 9 месяцев назад

    Fantastic video!

  • @blitzar8443
    @blitzar8443 9 месяцев назад

    Thanks for the info. I might get back into SD after people have some more time to train SDXL models to experiment with it myself.

  • @3diva01
    @3diva01 10 месяцев назад +9

    My PC struggles hard core with SDXL, even with 12 GB vram. Also I've been keeping an eye on the new images that people have been producing with SDXL and haven't yet seen a huge increase in the quality of the images compared to some of the best 1.5 models. So I'm sticking with SD 1.5 for now. Hopefully by the time the community makes great looking models for SDXL someone will also find a way to make it run better in Automatic1111.

    • @WoodenCreationz
      @WoodenCreationz 10 месяцев назад +1

      Agree! Running 16gbs of Ram and it sucked down all my ram and just locks up. Ripped it out and back to the drawing board.

    • @3diva01
      @3diva01 10 месяцев назад

      @@WoodenCreationz Yeah, it's definitely not worth using right now, IMO. Maybe in a couple of months it will run smoother and have better models to work with. For now I'm definitely sticking with 1.5 as I'm getting great results with it that look much better than what I've been able to get out of SDXL.

    • @user-yj3mf1dk7b
      @user-yj3mf1dk7b 10 месяцев назад

      guys, read docs sometimes.
      -- medvram , will fix all issues + check settings to reduce VRAM usage.
      a lot of shit to reduce VRAM

  • @user-cz3io5tg5l
    @user-cz3io5tg5l 10 месяцев назад

    Hi, awesome comparison, but I have a question. Is it okay that in comfyUI on GPU 3060 12gb 1024 x 1024 image generating for 20-30s but if I change prompt, time increases up to 60-100s for 1 generation and that happens only if refiner connected. And that is a huge problem when I want to experiment with prompts (I am using official workflow)

  • @makebritaingreatagain2613
    @makebritaingreatagain2613 5 месяцев назад +1

    0:05 I prefer the one on the left. It looks way more interesting.

  • @wagmi614
    @wagmi614 10 месяцев назад

    can you make a video on all the text encoders available and see which one can be used in img2img to get the best prompt from the image

  • @user-pc9hm9xg6p
    @user-pc9hm9xg6p 7 месяцев назад

    Tnaks for your Info,explanation best. I have a question how do you compare the sd1.5 and sdxl in size image。I want to know why sd1.5 costs more time than sd2.1 in big more 1 megapixels

  • @Chris3s
    @Chris3s 10 месяцев назад

    Do you know if I should switch to invoke 3.0 (it laos now has nodes) from Auto1111 or just switch to comfyUI (for SD 1.5)? Heard comfy uses less VRAM, how easy is it to use controlNET there? A comparisson video between those might be interesting (all 3 using nodes, with auto1111 using the comfyUI plugin).

    • @user-yj3mf1dk7b
      @user-yj3mf1dk7b 10 месяцев назад

      should you eat with fork or a spoon? who the hell knows. it depends.

  • @genin69
    @genin69 10 месяцев назад +1

    awesome information, thanks for the hard work and nerdy deep dives. ill look at SDXL in about 4months. at the moment its just no good at all. no creativity between generated seeds. mostly the same composition after doing about 40 odd renders on a single prompt. in sd1.5 I would get incredibly diverse images with wild imaginative results that always blow my mind. its like buying a camera, never ever buy the first model. always wait at least 6months to a year and get the version2.

  • @moki123g
    @moki123g 10 месяцев назад +4

    While XL 1.0 does look great, I think I am going to hold off for a while and let people work their magic on it. I subscribed, and thanks for doing these tests!

    • @LagiohX3
      @LagiohX3 10 месяцев назад

      Loras not working on it such a negative its starting from zero again.

  • @mistermcluvin2425
    @mistermcluvin2425 10 месяцев назад +2

    Thanks! Great information. I just started using sdxl yesterday and it's very impressive. The vram requirements are crazy tho. I wonder how this will affect its widespread adoption?

    • @siliconthaumaturgy7593
      @siliconthaumaturgy7593  10 месяцев назад +3

      Right now 8GB seems like a lot of VRAM compared to 4GB for SD 1.5. But keep in mind that Stable Diffusion 1 required 10GB of VRAM when it was originally released. Things have been getting more and more efficient.

    • @ikariameriks
      @ikariameriks 10 месяцев назад

      ​@@siliconthaumaturgy7593I hope so, because sdxl struggles on 12GB VRAM as of now. Could get fixed in a few days though. And not XL itself but the many programs that work it.

    • @Axherion
      @Axherion 10 месяцев назад

      ​@@siliconthaumaturgy7593So you think even 4GB ram will be enough to use that version of XL ?

  • @krystiankrysti1396
    @krystiankrysti1396 10 месяцев назад

    Do You plan to do study on which SD 1.5 models are highest resolution ? I tested some with 600x1200 and some pass and some fail, lot of "best" or most downloaded models fail

  • @Seany06
    @Seany06 10 месяцев назад

    I'm running it on 8gb with base and refiner. Works fine but probably not gonna be able to use controlnet when it arrived unless it gets further optimized, hopefully.

  • @Sheevlord
    @Sheevlord 10 месяцев назад +1

    Thanks for the comprehensive explanation!
    I was worried that my 8 Gb GPU would prevent me from trying SDXL but it looks like it will be just barely enough. I should give it a try

    • @vallejomach6721
      @vallejomach6721 10 месяцев назад +2

      Didn't work for me using A1111. Caused BSOD a couple of times. Had it generate an image a couple of times but then sending to img2img and changing to the refiner model either failed and reverted to the base model, thus didn't work, or crashed.
      First image from start up took about 7 or 8 minutes to load the model and then the actual image generation for a single image took about 10 minutes. Far too slow and painful to use for me.
      Comfy UI may work better but that'll be for another day to look at for me.

    • @Sheevlord
      @Sheevlord 10 месяцев назад

      @@vallejomach6721 Dang, that sounds rough.

    • @LagiohX3
      @LagiohX3 10 месяцев назад +1

      ​@@SheevlordI have a 3090 but only 16gb ram (had to sell ram, had 64gb) and it makes my PC freeze when there is no more ram just by changing to the model. I managed to try it once but i will have to wait till my new ram arrives.

    • @Sheevlord
      @Sheevlord 10 месяцев назад +1

      @@LagiohX3 It's a silly question, but do you have swap partition or file enabled?

  • @user-sh9de8vu5h
    @user-sh9de8vu5h 10 месяцев назад

    Great!

  • @WifeWantsAWizard
    @WifeWantsAWizard 9 месяцев назад +1

    Thanks for the update. A few notes...
    (3:00) Or maybe they were just trying to pad their stats by adding together two parameter counts.
    (3:19) In that diagram, there are inputs for "Prompt 1" and "Prompt 2". But, the user only enters one prompt, right? So, should we trust people who can't put together an accurate flow chart or are we actually doubling up our side of the workflow?
    (3:48) So, actually it's two **products** in one: text-to-image and image-to-image--NOT a new way of doing business with "two different models", right?
    (4:59) A "whopping" 12GB? Starfield is 15.48 GB--and that's just one game. Newegg has 1TB SSDs for $59.
    (7:29) Why did you swap colors between slides?
    (10:00) HAND score? What objective mathematical evaluation does THAT use?

  • @MrSongib
    @MrSongib 10 месяцев назад +1

    I want to try use fine tune sd 1.5 then use the Refiner in img2img or Refiner at low res then into img2img for highres, some people already try this stuff. (seems faster and seems fun)

  • @Bericbone
    @Bericbone 10 месяцев назад +1

    Refiner is NOT meant for img2img. It's meant to interpret leftover noise from the base model. That means you stop the generation before the image is done generation, and pass the leftover latent to the referine to complete the image. If you use the refiner for Img2Img you are going to get inferior results from doing it this way. Also, it's not meant for higher resolutions than what the image is generated at. You should not use it for upscaling.
    Auto1111 has not currently implemented correct use of the refiner.

    • @BIG_PASTA
      @BIG_PASTA 9 месяцев назад

      Thanks for the info! Is there a webui/colab type option out there to use it correctly?

  • @jdnaveen321
    @jdnaveen321 10 месяцев назад +1

    i have 3080ti, i5 12600k, 32gb ddr5 ram yet sdxl model loading alone takes quite long time and generating with it takes huge, deleted them and gonna stay with 1.5 for now since it has long way to go

  • @Steamrick
    @Steamrick 9 месяцев назад

    One thing I've noticed that in prompts describing *two* subjects doing something (for example, mother and son building a sand castle), SDXL blows SD1.5 away by so much, there's no possible comparison.
    SD1.5 can compensate by using regional prompter (or comparable), but it's basically incapable of doing it natively.

  • @TheShadiya
    @TheShadiya 7 месяцев назад

    Updated model that is better than its predecessor? wow!

  • @FusionDeveloper
    @FusionDeveloper 10 месяцев назад +2

    Try Photon model file for SD 1.5 at 1024x1024. It's amazing.

    • @generalawareness101
      @generalawareness101 10 месяцев назад +1

      I just found that one and agreed but it doesn't do something I prompted so I switch back to 2.1 model that does. Shocked at the quality I was getting though for what it did give me.

  • @mistraelify
    @mistraelify 5 месяцев назад

    I really want to tell anyone (and the author) reading my comment that SD 1.5 has it's own advantages against SDXL. Also you cannot compare your SD 1.5 prompts and seeds with SDXL it's bad behavior. They're completely different in terms of understanding how you ask the models and how they're processed. Yes you have better space, Yes you can have more consistent results without needing to upscale, Yes you can add more prompting informations which is more precise. But it has also it's downs.
    Many concepts died with SDXL, Many LoRa's needs compatible models, Weights are very difficult to handle to get what you want, Prompts needs to be more precise according to your model to really achieve something decent.
    For rendering basic prompts with specific art style and LoRa's it's good but going further without training is very difficult to achieve what you want.
    Besides, very good video, just wanted to clarify: NO SD 1.5 is NOT wiped out by SDXL at all !

  • @coloryvr
    @coloryvr 10 месяцев назад

    Big FANX for that great Video! ...so....Just one Question: Can I run Deforum on SDXL?

  • @Axherion
    @Axherion 10 месяцев назад

    I have 3050Ti what do you think should I stay at 1.5 or XL ?

    • @igorthelight
      @igorthelight 10 месяцев назад +1

      You will struggle with SDXL
      Stay on SD 1.5 for now.
      And start saving for RTX 4070Ti (16 Gb version) or at least RTX 3060 (12 Gb version)

    • @Axherion
      @Axherion 10 месяцев назад

      @@igorthelight For laptop version right until I save some money XL will much better also will be free right 🤔

    • @igorthelight
      @igorthelight 10 месяцев назад

      @@Axherion While you have not so powerful PC, Stable Diffusion 1.5 would be your choice ;-)
      You may try SDXL, but most likely it will not work or would work very slow.
      Both are Free and Open Source. Both could be run locally (on your PC instead of from some remote website).

  • @ericneo2
    @ericneo2 8 месяцев назад +1

    I don't understand why these models cannot use system memory and only VRAM. If I have a server with 1024GB of system memory why can I not use it? Why are we being limited to only VRAM?

    • @GooseAlarm
      @GooseAlarm 7 месяцев назад

      I have the same question. :/

    • @ericneo2
      @ericneo2 7 месяцев назад

      @@GooseAlarm GDDR6 only has 2 memory channels most servers have 4-8. It just feels like a manufactured problem to sell more expensive GPUs.

    • @Kaucukovnik666
      @Kaucukovnik666 6 месяцев назад

      @@ericneo2I My thoughts exactly. Feels like AI is just filling the void left by crypto.
      "Hey, crypto is crashing and nearly all games aim at the weakest current console's graphical capabilities, we need to utilize all those overkill specced GPUs somehow. Why not, say, throw machine learning at them? And call it AI, cos it sounds way cooler!"
      "Raytracing" is selling stupidly powerful (and power hungry) GPUs to gamers, and "AI" is doing the same for tinkerers. Not actual artists really, those don't need (or even want) a high resolution output straight from a text prompt.
      In particular Auto1111 seems especially "efficient" at consuming resources. ComfyUI generates 1024x1504 images for me (after zero configuration) while A1111 eats up all my VRAM just trying to load a model, no matter the setup. It doesn't even account for all the memory used, its numbers don't add up to the total memory available, but it needs more anyways.
      Anyone brings up an issue, gets "dude, you need a better GPU" responses and soon after the bug report gets silently closed as inactive. Even in such obvious scenarios like always claiming it needs exactly 20MB more. Doesn't affect 24+GB card owners and they can at least feel good about their purchase. It wasn't an overkill for bragging rights, it was a necessity!

  • @RyokoChanGamer
    @RyokoChanGamer 10 месяцев назад +2

    I gave up using sdxl 1.0 on automatic1111, I tried for hours surrounded by errors, until finally after spending the night awake, I managed to make it work, however, very slow and uses EVERYTHING that my pc has (16gb of ram, 20gb of vram( 12 of the card + 8 shared), half of the cpu, and 100% of disk usage with paging file), I click to generate the image, I release everything and I'm just looking, because it's impossible to use or move the mouse... even if image generation does not take so long (about 40s at 1024x1024) it is not being practical to use, I tried to use it through comfyui and it is working well, but I don't like its interface, I prefer to wait more and follow the evolution, improvements and optimizations until be able to try again to use in automatic1111

    • @TheBobo203
      @TheBobo203 10 месяцев назад +1

      python process takes 20-40gb ram using sdxl on my PC

    • @RyokoChanGamer
      @RyokoChanGamer 10 месяцев назад

      @@TheBobo203 😱🫡

    • @mistertitanic33
      @mistertitanic33 10 месяцев назад +1

      Im running into the same issue. Im so bummed because I really wanted to use Automatic1111 but I guess I may have to use Comfy. Im thinking about just waiting a few months and taking the time to learn the software with smaller models until SDXL because more performant

    • @RyokoChanGamer
      @RyokoChanGamer 10 месяцев назад +1

      @@mistertitanic33 I was testing some prompts and generating some images here in comfyui, the generated images were with much less quality than the ones I generated in automatic1111, I used exactly the same promps, negative and positive (exactly the same, I copied and pasted), cfg, samplers , steps etc... in automatic1111 the images were much prettier... I don't know if I was doing something wrong, but I don't think so, both in automatic1111 and in compfyui, I was using everything in the most basic, fresh installation of webui and comfyui

    • @mistertitanic33
      @mistertitanic33 10 месяцев назад

      @@RyokoChanGamer well I’m definitely gonna wait until I can get it to work on Auto. Btw what are your specs? Im running on a rtx 2070 and 16 gb ram. I have xformers on but that doesn’t seem to be enough

  • @Difdauf
    @Difdauf 10 месяцев назад

    I don't think this comparison is totally fair. We forgot to mention that loading SDXL could freeze your computer for 15 minutes. That thing is far more greedy than just lot of VRAM.
    This isn't exactly "fun" to use.

  • @diyaaelhak
    @diyaaelhak 10 месяцев назад

    do you bleave me, if i told you, i watch your (entire:1) videos, in one sitting, and yes, you give us pure, [valuable|information:0.5], ((so thank you)),
    by the way I am confused in somethings like what is stabel defusion, is it a model or a technology deal with models, is other AI using same SD or they have own AI, why you comparing SDXL with dreamshaper and darksushi, I am literally confused, and google have no answers for the basics.

  • @lakislambrianides7619
    @lakislambrianides7619 10 месяцев назад

    I don't know about parameters but what everybody says about SDXL regarding fingers and humans that are not close up it shucks. Can't understand why #midjourney is always years ahead

  • @coreyhughes1456
    @coreyhughes1456 10 месяцев назад +3

    For now 1.5 is still better, faster, and easier to use. Hoping to be proven wrong in the near future.

    • @TheBobo203
      @TheBobo203 10 месяцев назад +1

      interesting to see our favourite models with 2.5x more precision

  • @zafiralpstv8004
    @zafiralpstv8004 10 месяцев назад

    SDXL 1.0 is even better

  • @user-em2dp7wd4e
    @user-em2dp7wd4e 6 месяцев назад

    cba watching this vid with amount of ads

  • @erics7004
    @erics7004 10 месяцев назад

    I have 4gb vram and I could run SDXL 1.0 with 1024x1024, 4 minutes for a single image, but it's worth it.

    • @panyzhal
      @panyzhal 10 месяцев назад +1

      how much ram? i have 32 and 12vram and it consumes everything when generating

    • @alienrenders
      @alienrenders 10 месяцев назад +2

      ​@@panyzhalI have 11GB on 1080ti and it takes 1 minute to generate. I noticed that using lowvram or medvram made it run out of memory or made it extremely slow. So don't use those vram settings in a1111.

    • @panyzhal
      @panyzhal 10 месяцев назад

      @@alienrenders oh thanks maybe is that, i'll check and comment results

    • @panyzhal
      @panyzhal 10 месяцев назад +1

      ​@@alienrenders It worked; it's on the limit of RAM and VRAM but can make an image in 20s. Just opening the model took too long, around 10 minutes.

  • @warlord76i
    @warlord76i 10 месяцев назад

    Well... eats memory like a hungry dinosaur

  • @yokipop9467
    @yokipop9467 10 месяцев назад +1

    why sd never have version 3 🤣, they always change the name.. sd2.1 to SDXL 1.0.. back to 1 again

  • @jasemali1987
    @jasemali1987 5 месяцев назад

    Again CFG effect is neglected in your video, too bad

  • @abline11
    @abline11 4 месяца назад

    SDXL is still hopeless at photo realism even with the latest models. I’ve given up with it now.

  • @dogecoinx3093
    @dogecoinx3093 10 месяцев назад

    I hate they make the image burry on purpose to create depth it looks awful

    • @igorthelight
      @igorthelight 10 месяцев назад +1

      Add "blured background" as a negative prompt

  • @FlowerPower3000
    @FlowerPower3000 5 месяцев назад

    SDXL epic fail...

  • @justmyopinion9811
    @justmyopinion9811 10 месяцев назад

    Not really. Some custom SD1.5 are still better. Can't wait for fine tuned SDXL

    • @generalawareness101
      @generalawareness101 10 месяцев назад

      FT SDXL requires over 24GB of vram and DB the same. Lora types are not finetunes so we will get those as they require a lot less but we need real models/checkpoints.

  • @zizyip6203
    @zizyip6203 10 месяцев назад +2

    ComfyUi is a joke. And what I want SDXL 1.0 to do it can't it's a complete joke compared to 1.5

  • @marcus_ohreallyus
    @marcus_ohreallyus 8 месяцев назад +1

    I dont know...maybe im doing it wrong, but I'm pretty good with 1.5 and I tried sdxl recently and thought it looks over-stylized.