Flux Dev Models Explained For Webui Forge | Low VRAM GPU Options

Поделиться
HTML-код
  • Опубликовано: 27 ноя 2024

Комментарии • 84

  • @MonzonMedia
    @MonzonMedia  2 месяца назад +3

    Let me know which Flux Dev model you are using the most and what GPU you have.

    • @jones77wrx
      @jones77wrx 2 месяца назад +2

      I am using FP16 since it just works like the more resent SDXL and Pony models.
      I have tried FP8 models but quality and consistency is a bit off. I may be doing something wrong.
      Even with Forge this is a lot more complex than running SDXL models and I am leaning towards using Flux models that have VAE/Clip baked in to avoid error massages.

    • @karaghostsongs
      @karaghostsongs 2 месяца назад +1

      honestly I tried all the flux models on comfyui and I did not notice any significant differences in terms of speed in image generation, I have a gpu rtx 4070 ti 12 GB vram and 32 gb of ram but on comfyui with any flux model it goes slow.

    • @cratesify
      @cratesify 2 месяца назад +2

      I have a different graphics card than most. It is an Nvidia RTX Quatro 4000 8g, and 32g system ram. I have very similar results as in the video in terms of speed. Nf4 is the fastest, but I have been using q4 with built in hyper lora for 8 steps. Leads right in to your next video. models that have that hyper lora cooked in just seem to work better in forge than running the lora separately.....at least for me..

    • @MonzonMedia
      @MonzonMedia  2 месяца назад +1

      @@jones77wrx If you can run FP16 definitely the better choice. Personally I like using the FP8 but using the Q8 GGUF more lately since it's closer to FP16.

    • @MonzonMedia
      @MonzonMedia  2 месяца назад +1

      For your gpu you may as well run the fp8 or Q8 gguf. I think the nf4 model is better for people with 12gb cards or less.

  • @armauploads1034
    @armauploads1034 2 месяца назад +6

    I´m using Flux Dev FP8 with Nvidia RTX 2070 8GB VRAM and Forge - and it works great.

    • @MonzonMedia
      @MonzonMedia  2 месяца назад

      Awesome! Curious what speeds you are getting for 1024x1024 at 20 steps?

    • @armauploads1034
      @armauploads1034 Месяц назад +1

      @@MonzonMedia I just did the stopwatch test for you: 1:29 minutes (1024x1024, 20 Steps, Euler/Simple).
      That's OK for me, I'm a very patient person. The main thing for me is that it runs very stably. 🙂

    • @MonzonMedia
      @MonzonMedia  Месяц назад

      @@armauploads1034 That's not bad at all and the good thing with Flux is that it's more coherent for most things so if you have a good prompt you like, it doesn't take too many images to get it right.

  • @RaphaelRema
    @RaphaelRema 2 месяца назад +3

    Thanks for making everything so clear. I was VERY confused about those formats. You did a great job! 👍

    • @MonzonMedia
      @MonzonMedia  Месяц назад

      Late reply but you're welcome and thank you!

  • @TheColonelJJ
    @TheColonelJJ 2 месяца назад +2

    Thank you for helping me find the GGuf text encoders!!

    • @MonzonMedia
      @MonzonMedia  2 месяца назад +1

      You're welcome! I wish there would just be one central place for everything but then again, the open source community is big so....it's a blessing and a curse. 😊

  • @dreamzdziner8484
    @dreamzdziner8484 2 месяца назад +2

    This is the comparison we needed. Thank you so much.🤩

    • @MonzonMedia
      @MonzonMedia  2 месяца назад

      You’re welcome 😊 Glad it was helpful!

  • @vVinchi
    @vVinchi 2 месяца назад +1

    Man, image examples on your comparison are so good!😍

    • @MonzonMedia
      @MonzonMedia  2 месяца назад

      Hey bro! Thanks man...more to come!

  • @me-cm8or
    @me-cm8or 2 месяца назад +1

    Bro damn that’s so helpful I didn’t know about the loras!! Thanks man

    • @MonzonMedia
      @MonzonMedia  2 месяца назад

      You're welcome! I just posted the vid earlier today! ruclips.net/video/L0pRdKQSNcM/видео.htmlsi=mggZOx6NSleCiMDM Also if you check the description there is a link to an 8 step NF4 Hyper Checkpoint. I just saw it after making the video on the loras....figures! hahaha!

  • @havemoney
    @havemoney 2 месяца назад +1

    Thank you, good and useful comparison

    • @MonzonMedia
      @MonzonMedia  2 месяца назад

      Glad it was helpful! And you're so welcome!

  • @bhargavzantye7688
    @bhargavzantye7688 2 месяца назад +1

    best explanation. thanks

  • @mik3lang3lo
    @mik3lang3lo 2 месяца назад +2

    Thank you, I really appreciate your work, I’m not a big fan of comfyui

    • @MonzonMedia
      @MonzonMedia  2 месяца назад

      You're welcome! I'd always choose Forge over Comfyui but we're still waiting for controlnet support. Hopefully it comes soon.

  • @HDLEGSHOW
    @HDLEGSHOW Месяц назад +1

    I dont have the vae/text encoder bar at the top. How do i add that on my interface?

    • @MonzonMedia
      @MonzonMedia  Месяц назад

      Do you have the latest Forge installed? It should be on by default.

    • @HDLEGSHOW
      @HDLEGSHOW Месяц назад +1

      @@MonzonMedia I forgot to run the update file first lol

    • @MonzonMedia
      @MonzonMedia  Месяц назад

      @@HDLEGSHOW so you’re good to go? There should be another update coming soon to use ControlNet for flux. Hopefully this coming week or next week.

    • @HDLEGSHOW
      @HDLEGSHOW Месяц назад

      @@MonzonMedia my only issue now is speed. I unfortunately have a 4gb vram so I think any settings I use will be very slow for flux

  • @liquidmind
    @liquidmind 2 месяца назад +2

    with FORGE UI i can run DEV fp8 with only 6GB VRAM...... it takes 1:50 minutes for 20 steps on euler simple. and i can do as high as 1440x1080 and it will take 3:30 minutes...... WAY FASTER than comfy!!! GO FORGE!!!! and in my experience, Q8 is NOT better than FP8 because its slower and SAME quality, so......

  • @nikijs877
    @nikijs877 2 месяца назад

    Great video, but how did you get those speeds on an 8GB card? I have a laptop 3070 with 8GB VRAM and 32GB RAM, and I generated one image with FP8 in over 4 minutes. I used the settings you showed in the video.

    • @MonzonMedia
      @MonzonMedia  2 месяца назад

      Yeah that's way too long for your card. The only thing I can think of is that I'm using SSD drives to store my models and also to run Forge. Also make sure you are not using CPU for the swap location.

    • @nikijs877
      @nikijs877 2 месяца назад

      @@MonzonMedia All of my models are in the same drive with Forge. I am using shared in the swap location. What should I use in the swap method? I have tried Flux NF4, NF4 v2, fp8 and even the 3 and 4 bit gguf models. The fastest generation I had was 4 minutes and 27 seconds with the NF4 v2 model. Anything else I could try?

  • @4thObserver
    @4thObserver 2 месяца назад +1

    I've recently come back to Forge, Chose Dev-NF4 and it's okay. Just keep the CFG scale to 1 and Sampling steps to 20 and will make a coherent image. Downside being the images always seem to have large pores on skin or grain to them even when you use an Upscaler. I found that you can use the NF4 in combo with PNY models and it really speeds up my 1024x1024 generations. So I'm really withholding on Flux until the Grainy, Large pore images issues are ironed out.

    • @MonzonMedia
      @MonzonMedia  2 месяца назад +1

      I've notice that as well with some images, even with FP8. You could use another model like SDXL when upscaling but set the denoise really low so it doesn't change the details too much.

    • @4thObserver
      @4thObserver 2 месяца назад +1

      @@MonzonMedia True and it only really works well on Euler as of right now, I tried it on DPM2, DPM++ variants and it creates incomprehensible noise even with recommended settings. Plus FLUX doesn't allow for negative prompting, What's up with that. I'll just wait till a Pony-ish highly refined model comes. Because I'll say this about FLUX, It's good, accurate even but a bit rudimentary IMO.

    • @fixelheimer3726
      @fixelheimer3726 2 месяца назад

      ​@@4thObserver pony have announced their next model will be based on aura flow. Don't know if forge already supports that,but others do.

    • @fixelheimer3726
      @fixelheimer3726 2 месяца назад

      Nf4 is fine quality wise. You might want to try Loras for more photo realistic images. With SD so and prior it also took a while for realistic skin textures..

    • @azmeenafandi
      @azmeenafandi 2 месяца назад

      ​@@4thObserverYou can get negative prompts enabled for FLUX in Forge by setting CFG to 1.1

  • @danwood4171
    @danwood4171 2 месяца назад +1

    Just getting into flux. I have flux schnell FP8 1024x1024 4 step running in 2.58 seconds on my 4090.

    • @MonzonMedia
      @MonzonMedia  2 месяца назад

      Nice! If you're running an 4090 you should be using the Full Dev model which is the FP16, or at the very least FP8.

  • @hmm-m
    @hmm-m 2 месяца назад +1

    Hi! I stil use Fooocus and learnt it based on your previous films way some time ago before you moved more to Suno and Udio. Is it possible to use Flux in Fooocus (not Ruined Fooocus) on 1080GTX 8GB?

    • @MonzonMedia
      @MonzonMedia  2 месяца назад +1

      Unfortunately not and most likely it won't happen. Fooocus is coded for SDXL, it would take a lot of work to add Flux as the architecture is different. But who knows? Perhaps the developer (who is the same one for Forge) might consider making a version just for Flux.

    • @hmm-m
      @hmm-m 2 месяца назад

      @@MonzonMedia Thank you. Is WebUI the easiest interface to run it?

    • @hmm-m
      @hmm-m 2 месяца назад

      I see that WebUI is quite user friendly, also prepared like Fooocus by lllyasviel. I will give it a go! Would you consider making a film about setting up Flux in WebUI Forge?

  • @Kmaroz
    @Kmaroz 2 месяца назад

    Hi there. Just a quick question. Do i need to download encoder & safetensor file to run GGUF or safetensor can just be use standalone.

    • @MonzonMedia
      @MonzonMedia  2 месяца назад +1

      The GGUF files can go into your checkpoints folder, just like any other model however you have to use the T5 text encoder, Clip_l and VAE. The first generation will take some time to load but the next one will be faster. Hope that helps.

  • @Alex_1729
    @Alex_1729 2 месяца назад +1

    I just installed Pinokio and flux-webui for the first time. I was hoping I could download and run Juggernaut XL model on 16GB or RAM and 6GB of VRAM. Is it possible? I can't find requirements for these models anywhere.

    • @MonzonMedia
      @MonzonMedia  2 месяца назад

      You should be fine although 16GB of system ram could be a bottle neck for some tasks. Juggernaut is an SDXL model which is under 7GB.

    • @Alex_1729
      @Alex_1729 2 месяца назад

      @@MonzonMedia would you mind sharing which tasks those could be? Isn't this just an image generation? Also, if a model is under 7GB in size, what does that mean exactly? That GPU can run it? I'm new to all this so that's why I ask, thanks

  • @ang454
    @ang454 19 дней назад

    Flux and ponyxl restart my pc, I am using rtx 4060 8gb vram and 16 ram, using forge. Any idea whats wrong?

  • @Nukaisme1
    @Nukaisme1 2 месяца назад +1

    hi can we manipulate gpu on flux, im actually using old mining gpu on a1111 and it can running,
    but when i try use forge it detect my p106 as is, so didnt work and use cpu instead, any solution other than buy new one

    • @MonzonMedia
      @MonzonMedia  2 месяца назад

      Best to ask in the discussion page on GitHub. Not sure to be honest. github.com/lllyasviel/stable-diffusion-webui-forge/discussions

  • @MdNoman-tl6yf
    @MdNoman-tl6yf 2 месяца назад

    i have rtx 4060ti 16gb. can i do comfyui live portrait with this gpu?

  • @MAG_NUZ
    @MAG_NUZ 2 дня назад

    im still hoping to used flux but untill now i got error. i have m1 chip. i used forge

  • @moonduckmaximus6404
    @moonduckmaximus6404 Месяц назад +1

    i cant seem to find fp16 anywhere

    • @MonzonMedia
      @MonzonMedia  Месяц назад

      Original unet huggingface.co/black-forest-labs/FLUX.1-dev/tree/main, GGUF version huggingface.co/city96/FLUX.1-dev-gguf/tree/main. Both need to be used with text encoders and VAE.

  • @b.radical
    @b.radical 2 месяца назад

    All my renders are coming out blurry, no matter which checkpoint I use

    • @MonzonMedia
      @MonzonMedia  2 месяца назад +1

      @@b.radical can’t help you man if I have no details. Settings? Size? Context?

    • @b.radical
      @b.radical 2 месяца назад +1

      @@MonzonMedia Rebooting fixed it, I guess I had gunk in my VRAM

  • @OniNylon
    @OniNylon 2 месяца назад

    Can I run this on my android phone

  • @Vanced2Dua
    @Vanced2Dua 2 месяца назад

    Please share link for vram 4GB

    • @MonzonMedia
      @MonzonMedia  2 месяца назад +2

      All the files are listed in the google doc in the description. For your card you can try the lower Q GGUF models. Also download the T5 text encode that is GGUF file as well.

    • @fullmatchfullhighlight
      @fullmatchfullhighlight 2 месяца назад

      @@MonzonMedia for T5 is it stored in which folder?

    • @MonzonMedia
      @MonzonMedia  2 месяца назад

      @@fullmatchfullhighlight see 6:05 👍

  • @TheCopernicus1
    @TheCopernicus1 2 месяца назад +1

    I need a PC! lolll

    • @MonzonMedia
      @MonzonMedia  2 месяца назад +1

      Yeah that would help! hahaha!

  • @courtmanr
    @courtmanr 2 месяца назад +2

    dev, schnell, nf4, fp8, gguf, vae, clip... this simpleton is confused

    • @MonzonMedia
      @MonzonMedia  2 месяца назад +2

      It's a lot to take in if you are new to all this. I'm preparing a beginners video very soon.

    • @HikingWithCooper
      @HikingWithCooper Месяц назад

      @@MonzonMedia Subbing based on this reply. I've been doing this since the first models were released and it's still confusing AF. Then if you ask the interweb why you get a CLIP error, I think the reply is in actual computer code LOL. Using Pinokio helps a lot but its simplicity actually introduces other difficulties (i.e. the CLIP error.) It would be cool if your tut included file folder management because it seems like every time a new model comes out, it sets up a new folder structure so who knows how many 10-25GB unused models we have sitting around on our hard drives. Maybe none? I don't know.

  • @megapin1
    @megapin1 2 месяца назад +1

    Gtx1060 mod q4.gguf 720*720 13 min(((