StableDiffusion Deep Dive - HiRes Fix - How to avoid twinning and losing composition

Stable Diffusion Deep Dive - CFG - Don't Accidentally Fry Your Images

Stable Diffusion Samplers - Which samplers are the best and all settings explained!

Version 2.3 "Farewell, Penacony" Special Program | Honkai: Star Rail

'Sister Act 2' Cast & LaGuardia High School Performers Perform 'Joyful, Joyful' On 'The View'

Rich Dunk (Ft. DaBaby) - BIG DAWG [Official Video]

How Much Information Can Stable Diffusion Handle in a Single Prompt? - Stable Diffusion Deep Dive

SiliconThaumaturgy

Просмотров 6 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 7 июн 2024
Have you ever put a complicated prompt into Stable Diffusion and been disappointed by the results? Or was Stable Diffusion unable to portray a relatively simple subject in a certain style? In this video, I do my best to answer the question: how much information can Stable Diffusion handle in a single prompt?
To answer this, I generated thousands of test images to figure out exactly what level of complexity Stable Diffusion can handle with regards to multiple subjects with and without descriptions. I talk about the design of experiment, general results, and quantify how often Stable Diffusion is likely to succeed in a variety of scenarios.
Intro: 00:00
Design of Experiment: 01:20
General Results and Discussion - 03:33
Charts and Data - 06:53
Conclusion and Summary - 12:13
Outro - 13:53
#stablediffusion #prompt #aiart #complexity #experiment

Комментарии • 23

@Yeeeeeehaw 8 месяцев назад ⁺²
I've discovered your channel and I'm mind blown by your hardwork !!
keep going bro..
@Goodgu3963 Год назад ⁺⁵
This was an amazing breakdown. I would love to see something of this nature repeated using numbers of descriptors. Prompt artists often use dozens of descriptions and details, and I'm curious how those effect the desired results, and when/if there is a point where it becomes less effective.
@siliconthaumaturgy7593 Год назад
I might do a followup focusing more on descriptors later on, but for now I'm more focused on ControlNet
@TheRoadTraveledByFew Год назад ⁺²
Thanks for making these videos bro, very informative .. appreciate it!
@kridadkool1319 11 месяцев назад
7:00 is how i imagine "Sil" typing in prompts😂❤
@alexfazio Год назад ⁺¹
Your content is invaluable, these are true masterclasses! I have a question about something I find a bit confusing in the way you break down prompts into their elements. In some examples you seem to define a prompt as: Subject(s) + Prompt Ending. But in other parts of the video you talk about the prompt as composed by: Subject(s) + Neutral Elements, Descriptors, Quantity, Style and Artist. Do you reckon the latter elements are all part of the "prompt ending" or some of these elements are part of the Subject(s) description?
@siliconthaumaturgy7593 Год назад
I consider anything that applies to the whole image to be the prompt ending (neutral descriptors like high resolution, artist, style). The subject is objects, numbers of objects, and specific descriptions (i.e. green dog). Though based on the results for colors I'm not sure SD makes that distinction like we do.
@yigitkaanekimci 11 месяцев назад
Amazing breakdown. As newbie on SD I always thought using deepbooru tags no matter what checkpoint I used were better than typing directly. especially the long ones. Thanks for the great guide.
@pon1 Год назад
Very useful! Even when training a model with text descriptions that would allow several subjects it still gets it wrong and duplicates one subject across them.
@SBaldo8 Год назад
I guess a part 2 of this video including controlnet capabilities would be extremely interesting to watch
@Space-O-2001 10 месяцев назад
Many thanks for taking the time to do all that (quite the electricity bill :P)
@coloryvr Год назад ⁺⁴
Wow! what an elaborate video! Thank you for the work! It will probably stay like this: Before I prompt myself to death, I prefer to use Inpaint, Outpaint, Image to Image and a bit Photoshop ... nevertheless: that was helpful and inspiring!
@blitzar8443 Год назад
When you use the cutoff extension, that decrease concept bleeding a lot but I dont know how it actually works
@jonmichaelgalindo Год назад ⁺¹
CLIP ViT-L/14 has 8 attention heads. I'm surprised the concept-subject limit is three. (Although this more or less matches my testing.)
@gbennett1000 Год назад
This is awesome
@lukas5220 Год назад ⁺¹
great video
@siliconthaumaturgy7593 Год назад ⁺⁵
Thanks! I was considering doing a followup video to this one but then ControlNet nation attacked.
I was also told to try Latent Couple but I haven't been able to get it to work
@jameshughes3014 Год назад ⁺³
I always knew patrick was a secret genius.
@gamingthunder6305 Год назад
i am experimenting a lot with styles and are trying to use descriptor to describe the style i want to use. maybe a video how to craft a prompt to ask chatgpt for a style description of an artist?
reason im moving away from artists as you pointed out in your video is styles via artist prompts are not universal (missing cellphone example in your video)
i also noticed adding "in the style of artist xyz" can influence who is portrait in the image if its a person.
also some artists are better represented in different token configurations. for example. some work with the prase "in the art style" some "style by" some "art by" and so on.
bottom line SD1.5 is currently a mess and i wish the community would move on to develop tools and extensions for SD2.1
@MrSongib Год назад
Thank god someone did it already, I almost do this stuff myself and other things. xd
I think you need to post your stuff on reddit I think it will help more people.
@siliconthaumaturgy7593 Год назад ⁺¹
I'm glad you enjoy the videos. I usually post on Reddit when I upload new videos but usually they're not super popular there. My Reddit username is the same as my channel name.
@lenke22 Год назад
use latent couple if you want to put different things in a image
@Mimeniia Год назад
Thanks my man. I hope the nerds "fix" this pain the ass, liability of the code.

Следующие

Автовоспроизведение

StableDiffusion Deep Dive - HiRes Fix - How to avoid twinning and losing composition

StableDiffusion Deep Dive - HiRes Fix - How to avoid twinning and losing composition

Stable Diffusion Deep Dive - CFG - Don't Accidentally Fry Your Images

Stable Diffusion Deep Dive - CFG - Don't Accidentally Fry Your Images

Stable Diffusion Samplers - Which samplers are the best and all settings explained!

Stable Diffusion Samplers - Which samplers are the best and all settings explained!

Version 2.3 "Farewell, Penacony" Special Program | Honkai: Star Rail

Version 2.3 "Farewell, Penacony" Special Program | Honkai: Star Rail

'Sister Act 2' Cast & LaGuardia High School Performers Perform 'Joyful, Joyful' On 'The View'

'Sister Act 2' Cast & LaGuardia High School Performers Perform 'Joyful, Joyful' On 'The View'

Rich Dunk (Ft. DaBaby) - BIG DAWG [Official Video]

Rich Dunk (Ft. DaBaby) - BIG DAWG [Official Video]

So...I watched THE ACOLYTE - Episodes 1 & 2

So...I watched THE ACOLYTE - Episodes 1 & 2

ControlNet Deep Dive - Depth - Preprocessors, Weight and Guidance, and Generating at Max Resolution

ControlNet Deep Dive - Depth - Preprocessors, Weight and Guidance, and Generating at Max Resolution

Testing ALL Artists in Stable Diffusion - Part 1 - A is for Artist

Testing ALL Artists in Stable Diffusion - Part 1 - A is for Artist

SDXL Models Compared: See How 5 Top Versions Perform with One Prompt

SDXL Models Compared: See How 5 Top Versions Perform with One Prompt

ControlNet Deep Dive - OpenPose - What it can detect and output plus weight and guidance settings

ControlNet Deep Dive - OpenPose - What it can detect and output plus weight and guidance settings

Which Stable Diffusion 1.5 model makes the best hands? Part 2: Thirty More Models

Which Stable Diffusion 1.5 model makes the best hands? Part 2: Thirty More Models

SDXL 1.0 blows away Stable Diffusion 1.5. And here is the testing to prove it.

SDXL 1.0 blows away Stable Diffusion 1.5. And here is the testing to prove it.

The clues to a great story | Andrew Stanton | TED

The clues to a great story | Andrew Stanton | TED

Better Hands in Stable Diffusion 1.5 - Part 3 - Embeddings

Better Hands in Stable Diffusion 1.5 - Part 3 - Embeddings

How Deep Dreams (Basically) Work

How Deep Dreams (Basically) Work

В ДЕТСТВЕ СТРОИШЬ ДОМ ПОД СТОЛОМ

В ДЕТСТВЕ СТРОИШЬ ДОМ ПОД СТОЛОМ

Хватит слухов! Прощание с Анастасией Заворотнюк | Алсу разводится

Хватит слухов! Прощание с Анастасией Заворотнюк | Алсу разводится

😨 Мы не ожидали страшный Многорукий в Майнкрафт ЭТО...

😨 Мы не ожидали страшный Многорукий в Майнкрафт ЭТО...

2000000❤️⚽️#shorts #thankyou

2000000❤️⚽️#shorts #thankyou

КАК ПОЗНАКОМИТЬСЯ С ПАРНЕМ? Лайфхак от МАШ МИЛАШ

КАК ПОЗНАКОМИТЬСЯ С ПАРНЕМ? Лайфхак от МАШ МИЛАШ

Bringing a new meaning to hill sprints 😮‍💨 #hardenduro

Bringing a new meaning to hill sprints 😮‍💨 #hardenduro

Блэк Кити в Биг Сити 2 (Конец) 😼

Блэк Кити в Биг Сити 2 (Конец) 😼

Nodirabegim Kenjayeva Lazizashokuzga nimaga tashlandi? #shoubiznes #lazizashokuz #newvideo #rek

Nodirabegim Kenjayeva Lazizashokuzga nimaga tashlandi? #shoubiznes #lazizashokuz #newvideo #rek