Tiled Diffusion with Tiled VAE / Multidiffusion Upscaler, the Ultimate Image Upscaling Guide [A1111]
HTML-код
- Опубликовано: 3 июн 2024
- #aiart, #stablediffusiontutorial, #generativeart
This tutorial will cover how to upscale your low resolution images to 4k resolution and above with the Tiled Diffusion with Tile VAE or Multidiffusion extension in A1111. We are going to walk through the workflow in the 1st part of the video, and then go into detail regarding how each setting affects your resulting image. As always, feel free to leave a comment down below. Please leave a like and subscribe if you found my videos helpful. Cheers!
Chapters:
0:00 - Intro
0:30 - Outline of Main Topics
1:08 - Install Tiled Diffusion & Tiled VAE extension
1:48 - Upscalaing Workflow Overview
3:18 - Method #1, Tiled Diffusion & Tiled VAE Only
6:07 - Fast Encoder Color Fix
7:46 - Method #2, Tiled Diffusion + Noise Inversion + Control Net Tiled
11:27 - Upscaling Workflow Recap
12:40 - Detailed Explanation of each parameter within Tiled Diffusion
15:44 - How to install new Upscalers
16:29 - Upscaler Comparison
19:23 - Detailed Explanation of each parameter within Noise Inversion
21:13 - About Tiled VAE
21:45 - About Control Net Tile Down Sampling Rate
22:24 - Quick parameters list/cheatsheet
22:34 - Bonus content: Use Inpainting with Tiled Diffusion & Tiled VAE
My Other Tutorials:
IP Adapter & Control Net: • STOP wasting time with...
Adetailer: • ADetailer in A1111: Ho...
Inpaint Anything Part 1: • How to change ANYTHING...
**If you enjoy my videos, consider supporting me on Ko-fi**
ko-fi.com/keyboardalchemist
Useful links
Tiled Diffusion & Tiled VAE / Multidiffusion github page:
github.com/pkuliyi2015/multid...
Control Net v1.1 models:
huggingface.co/lllyasviel/Con...
Open Model DB - 4x-UltraSharp upscaler download link:
openmodeldb.info/models/4x-Ul...
Open Model DB - 4x-Remacri upscaler download link:
openmodeldb.info/models/4x-Re...
Quick Tiled Diffusion & Tiled VAE settings cheat sheet:
Tiled Diffusion settings
Method: Mixture of Diffusers
Tile Width x Height: 128 x 128
Tile Overlap: 8
Tile Batch Size: 4 or 5
Upscaler: 4x UltraSharp (downloaded) OR R-ESRGAN / SwinIR (stock)
Noise Inversion settings
Inversion Steps: 50
Retouch: 1
Renoise Strength: 0
Tiled VAE - all default settings
Control Net Tiled - all default setting
Control Mode: Control Net is More Important
Resize Mode: Just Resize
**If you enjoy my videos, consider supporting me on Ko-fi**
ko-fi.com/keyboardalchemist
That's a really well made tutorial, i did days of testing before understanding what every parameter do, this video will be a miracle for newbies, good job!
Thanks for your support! I'm glad you liked the tutorial!
i think this is the best ai image workflow kind of video, I do vfx work so you can imagine how I appreciate the different values tests. Subscribed
I'm glad you liked the video! Thanks for your support.
ありがてえ・・・!!!
この情報超助かる!!
Thank you!
Good to see another video from you. Hope this becomes a full time thing for you if that's what you want. Would love to see a video on making videos with animatediff and the alternatives and some workflow tips and tricks and/or how to migrate from A1111 to ComfyUI
Thank you for tuning in! I plan on uploading regularly.
I learned a lot from your video!!!!
Glad to hear that!
thank you sir!
You are welcome!
It's a great TY!
Great video just what I wanted to know about good upscaling methods!
I'm glad this was helpful!
Just one thing İ discovered when using sdxl with tiled vae you have to disable fast decodeing cause it messes up the image unfortunately
@@AresmarThat's good to know! Thank you for sharing!
hello! I really hope that you will have a lesson on restoring badly damaged photos
How to use this in forge diffusion ?
there is a "integrated multidiffusion" but its options are not the same at all. and, I can't install the a1111 extension in it (even after removing the integrated multidiff". that's really annoying.)
I have not tried this in Forge. I read on github that the Forge extension might not be working very well. Link here for reference: github.com/lllyasviel/stable-diffusion-webui-forge/issues/124
Helloo great video! I've been doing the same to upscale my renders. Is there any chance you could show how to do the same process in comfyUI?
I have not dived into ComfyUI very much; maybe in the distant future. Thanks for watching and for the sub!
Thank you for this tutorial and all the others, they are really helpful for beginners like me ! 👌
I have a question regarding the end result, I don't really know how to describe it but when I zoom in a lot, they are little squares edges visible, like cracked skin, tile edge artifacts or I don't know.. is this something normal ? Can we do something to reduce this effect ?
Thanks, I'm glad you liked my tutorials! The edge artifacts are definitely not normal. Which workflow did you follow to get to your final image? It's hard to tell without some additional information, such as which method you used or what settings.
@@KeyboardAlchemistI don't know exactly if it had a link with my issue but I changed tile overlap to 16 and upscale with more resolution and now everything seems ok ! But I don't know if it could also be linked to the checkpoint, as I often change it
@@virtualj8561Both tile overlap value and the checkpoint could be a source of your issue. Most of the time, if you just change one variable at a time, you can figure out what went wrong. Good to know that you solved the problem! Cheers!
Youre the guy when it comes to A1111 turorials. Am i missing something or this is all very similar to SD Ultimate upscale + controlnet tile? What's the point in using this new method rather than SD ultimate upscale + tile?
I'm glad you asked this question. You are right, the tiled diffusion extension is very similar to Ultimate SD Upscale + CN Tile, but I think there are a few advantages for tiled diffusion. First, you can use this extension directly in txt2img, where as Ultimate SD upscale only shows up in img2img. Second, you have the option of using this extension with CN Tile, but it's not a requirement. You can often get better details by using a combination of Noise Inversion and CN Tile. Lastly, this extension comes with a Regional Prompt Control feature, which is such a great feature! I went into detail in one of my videos: ruclips.net/video/3aIEitw5Pt8/видео.html. I like it a LOT as you can probably tell and overall I think this extension is a bit more intuitive to use. I hope this helps you. Cheers!
Great tutorial! Are those methods also possible with sdxl?
Yes, you can use this extension to upscale with SDXL models as well.
is DemoFusion better than this method? Do I work with it other add on extensions or use it by itself?
The Tiled diffusion / Multidiffusion extension works by itself. You don't really need anything else.
I was wondering if I have LoRAs(sometimes multiple of them), how I would go about balancing them out? I noticed that they tended to over saturate or change the style to much depending on how they interact. Anyways, thanks for all these workflows, was wondering how people upscaled with Multidiffusion!
Not sure if I fully understand what you are asking regarding balancing your LoRAs, but I'll take a stab at it. If all of your LoRAs are meant to modify one object or person in the image, then you would want to adjust the weights and have them add up to 1.0. So say you have LoRAs A, B, and C, and say if you want more of A in your main object, then you can do something like A weighted at 0.6, then B and C both at 0.2. This way, the A LoRA will have more influence over your main object.
@@KeyboardAlchemist Oh I meant something along the lines of if I used region prompter or composable lora+latent couple to generated an image with a specific LoRA background and/or a bunch of different characters, how would I go about tackling it?
I was also wondering, what is the tool you're using to slide and compare images?
@@FlawlessMind-lb3tw With the Tiled Diffusion / Multi diffusion extension it's pretty easy to do this. I am working on the video for this, but here is the short version of it. There is the Regional Prompt Control section within Tiled Diffusion, if you expand it, you will see different regions that you can define, and this will allow you to define prompts specifically for a region of your image. Then you can put LoRAs in your regional prompt. Let's say you want to generate three distinctly different characters in the same image, you can draw out 3 different regions and give each region a different prompt and add the LoRAs that you want.
I also described a different method that can do this with the Adetailer extension, here is the link if you want to check it out: ruclips.net/video/6EraysHdhHE/видео.html
You can google "before and after slider" to find the slider tool, there are a lot of them out there. Hope this helps you!
I must be doing something wrong with the Method #2, my GPU usage goes to 15G and uses system ram and it will take like 10 minutes for a single upscale. Is this normal? I have the same parameters as shown in the video, same starting resolution, same parameters for tiled diffusion, tiled vae and controlnet. What could it be? Also, the image changes a bit, specially the face is very different.
You might want to try halving the encoder and decoder tile sizes in the Tiled VAE section. What kind of GPU do you have and how much VRAM?
Can you give me the upScale 4x-Remacri model on google driver, the website won't let me download it anymore. Thankyou...
Tile diffusion dosnt work for me, it won’t stay checked to show it’s enabled. I checked my config and there are no extensions that are disabled. I’ve also uninstalled and reinstalled it, and it still won’t work.
This extension doesn't seem to work with SD Forge. It'll work in A1111.
✨👌😎😮😵😮😎👍✨
👋
Why do you choose „whole picture“ as inpaint area?
'Whole Picture' uses the entire image as reference for inpainting the new thing while 'Only Masked' uses the masked area plus the surrounding padding pixels as reference. In this situation, the Whole Picture option worked better.
could you just leave the controlnet input image empty?
For such image critical videos, upload high bitrate 4k video 60 fps and not 1080p with blurred quality (by RUclips). Hard to spot differences
Yeah, I just wanted to say the same.
1:58 why do you randomly speedup the AI voice? it's distracting. leave the playback speed at 1x please