March 2024 - Stable Diffusion with AMD on windows -- use zluda ;)

Поделиться
HTML-код
  • Опубликовано: 3 мар 2024
  • SD is so much better now using Zluda!
    Here is how to run automatic1111 with zluda on windows, and get all the features you were missing before!
    ** Only GPU's that are fully supported or partially supported with ROCm can run this, check if yours is fully or partially supported before starting! **
    check if your gpu is fully supported on windows here:
    rocm.docs.amd.com/projects/in...
    Links to files and things:
    Git for windows: gitforwindows.org/
    Python: www.python.org/downloads/
    Zluda: github.com/lshqqytiger/ZLUDA/...
    AMD HIP SDK: rocm.docs.amd.com/projects/in...
    Add PATH for HIP SDK and wherever you copies Zluda files to
    %HIP_PATH%bin
    C:\path\to\zluda\folder
    Start Automatic 1111 webui
    webui.bat
    copy zluda cublas and cusparse to
    ...\stable-diffusion-webui-directml\venv\Lib\site-packages\torch\lib
    delete cublas64_11 and cusparse64_11
    rename zluda files
    cublas.dll to cublas64_11.dll
    cusparse to cusparse64_11.dll
    back in terminal run webui
    webui.bat --use-zluda
    If you have issues with cudnn
    ...\stable-diffusion-webui-directml\modules\shared_init.py
    Add this after def initialize
    torch.backends.cudnn.enabled = False
    If you have a GPU that is not fully supported in hip SDK follow these instructions
    github.com/vladmandic/automat...
  • НаукаНаука

Комментарии • 634

  • @taffyware1059
    @taffyware1059 2 месяца назад +19

    Performance better, worse or equal to Linux ROCm?

    • @FE-Engineer
      @FE-Engineer  2 месяца назад +8

      about 20% - 25% worse than ROCm on linux I would say...but has all the normal features of automatic without any ONNX or Olive stuff that were very irritating.

    • @taffyware1059
      @taffyware1059 2 месяца назад

      @@FE-Engineer ​ Ig its better than having to the all the optimization stuff over and over again, also likely a lot less space is consumed compared to duel booting linux

    • @FE-Engineer
      @FE-Engineer  2 месяца назад +1

      Yes. If you hate the idea of dual booting Linux. Or have other reasons why Linux ROCm is not an option. This is a reasonable work around.

    • @ml-qq5ek
      @ml-qq5ek 2 месяца назад +6

      ​@@FE-EngineerI am only getting 1-2it/s on 6900xt with zluda. What is wrong

    • @HamguyBacon
      @HamguyBacon 2 месяца назад

      @@FE-Engineer I used ventoy to run linux and i don't see where people say its easier to install and use, i had a hard time trying to get SD to even run.

  • @swietypiotrprzykurwiciel6488
    @swietypiotrprzykurwiciel6488 2 месяца назад +12

    I just bought a new card and once again I am back to your tutorials. Your videos helped me before, your tutorials are extremely up to date and easy to follow. Thanks man, you're doing a great job here!

    • @FE-Engineer
      @FE-Engineer  2 месяца назад

      Whahoo! Glad it worked and went smoothly! :). Thanks for watching!

  • @MathieuCruzel
    @MathieuCruzel 2 месяца назад +4

    Thanks a lot for the tutorial. I could not for the life of me get it to work on Fedora and finally this works really well. I moved from a RTX 2060 to anew 7900XT recently and I was getting 1.5x 2x performance on Comfyui but with this I get at last x5 x6 speed when generating with XL Models.

  • @koxu857
    @koxu857 2 месяца назад +3

    can't even imagine how tough was that to work it out. Thanks!

  • @darkenblade986
    @darkenblade986 2 месяца назад +2

    thanks you so much for this tutorial. this worked for me and i have an unsupported 6700xt. first time i got inpaints and sdxl working properly. you do a good job explaining things but the best is how u put the links to everything in the description. makes my life so much easier.

    • @Eminic112
      @Eminic112 2 месяца назад

      what's your performance like with the 6700xt im curious

    • @2ndGear
      @2ndGear 2 месяца назад

      My 6600 XT does 2/its it sucks. Shouldn't have cheaped out on a card lol.

    • @Jay-js6zr
      @Jay-js6zr 2 месяца назад +2

      I also have a 6700xt and am struggling to make it work, would you be able to share any issues you had while setting this up and how you overcame them please? :)

    • @darkenblade986
      @darkenblade986 18 дней назад

      @@Eminic112 between 1 to 2 iters per sec it depends on the prompt. More tokens takes longer.

    • @darkenblade986
      @darkenblade986 18 дней назад

      @@Jay-js6zr I just followed the guide. Wasn't to hard. Make sure you are following it to the letter.

  • @lifekraft
    @lifekraft 16 дней назад +1

    Ty so much for putting time and effort to help random people figure these things. Almost every single one of your recent video helped me navigate this new world of technology and i wouldnt even be able to try it without you. Ty infinitly

    • @FE-Engineer
      @FE-Engineer  14 дней назад +1

      You are very welcome! I am glad they helped! Thank you for watching!

  • @jinxPad
    @jinxPad 2 месяца назад +7

    great stuff! Great tutorial as always, thank you.

    • @FE-Engineer
      @FE-Engineer  2 месяца назад

      Thank you so much for watching :)

  • @Gawdzend
    @Gawdzend 2 месяца назад +1

    I started with one of your other videos, but this one got me officially up and running (on a 6600XT). Much appreciated!

    • @FE-Engineer
      @FE-Engineer  2 месяца назад

      Glad it helped and worked without issue (hopefully). :) thank you for watching!

    • @White-yz4kw
      @White-yz4kw 2 месяца назад +1

      What is the generation rate of it/s with zluda? Is the generation faster than with directml? Interested to know before installing, I have a rx6600.

    • @Torva01
      @Torva01 2 месяца назад

      ​@@White-yz4kwsame doubt

    • @ottomanherox
      @ottomanherox 2 месяца назад +2

      @@Torva01 sounds like if you've ❌ on HIP SDK it's about 3 times slower than Linux ROCM, atleast according to one test with 6700 XT.
      Safe to say it'd be memory efficient regardless and I'm tempted to try on 6700 but I've to check if it's useful for something else like DLSS maybe because that speed gain is not worth it alone.

    • @matthewfuller9760
      @matthewfuller9760 2 месяца назад

      same gpu. is it too slow to use with expectations of say an image every 1 or 2 minutes?

  • @krizo96
    @krizo96 2 месяца назад +12

    You're a blessing upon this world.

  • @horrid8024
    @horrid8024 2 месяца назад +3

    OMG! Thank you so much for this one! I tried for so long to get this running... All the text tutorials were just too complicated.

    • @FE-Engineer
      @FE-Engineer  2 месяца назад

      You are welcome! I’m glad it helped. Thanks for watching!

  • @jovabre
    @jovabre 2 месяца назад +3

    Excellent work. Thanks!!!

  • @bernardy91
    @bernardy91 2 месяца назад +1

    Finally, after days of trying, i found your video...really good explanation, and i was finally able to make it run

    • @FE-Engineer
      @FE-Engineer  2 месяца назад

      I’m glad it helped! :) thank you for watching!

  • @terrestrialman
    @terrestrialman 2 месяца назад +2

    thank you so much, this was actually not too bad to set up!

    • @FE-Engineer
      @FE-Engineer  2 месяца назад

      Yea, it is not exactly straight forward, but it is not that bad either. Thank you for watching and the kind words :)

  • @jk-ze2bo
    @jk-ze2bo Месяц назад

    This was a lifesaver! Fiddled 2 days to get Olive ONNX etc working at at least useable level, and after installing zluda using this tutorial (almost) all works out of box without constant tinkering.
    Inpaint sketch does not work proper (renders whole image instead mask area) but it is prob -directml fork issue

    • @FE-Engineer
      @FE-Engineer  Месяц назад +2

      Overall if users don’t want to go Linux and for real rocm. And until complete rocm is on windows. I think zluda is an excellent compromise that still provides tons of functionality for folks in windows. Thanks for watching!

  • @SvenKloevekorn
    @SvenKloevekorn 2 месяца назад +3

    Very nice work, thanks a lot!

    • @FE-Engineer
      @FE-Engineer  2 месяца назад

      You are welcome! Thanks for watching!

  • @MortisDG
    @MortisDG 2 месяца назад +1

    I was really getting frustrated with all that shit.. Thank you so much for this video! Finally I can use SD properly again 🙏

  • @shefu689
    @shefu689 2 месяца назад +2

    THANKS A LOT MATE! This is so awesome. I have played with directML and its settings before like hell. My command webui-user.bat argument lines were almost one A4 page.
    i noticed that you need to restart your PC to get new PATH directions to work on WIN11. Without restart you end up getting "failed to load zluda path automatically" and "use skip-cuda-torch-test" info. Also first install will download cublas64_12 and cusparse64_12 instead of 64_11 without using --use-zluda argument with user.bat.Idk why.
    My 6750XT results:
    1.
    1.5 SD models: txt2img 1024x1024: 3.75s/it /average and 1:05min generation time.
    SDXL models: txt2img 1024x1024: 3.50s/it average and 1:10 minutes.
    NOTE: without zluda this was impossible task because instant memory error. and SDXL models generated over 2 minutes with 512x512 resolution.
    2. Memoryusage is now calibrated. With zluda SD using only 10.2gb/12Gb memory and it will free up memory after generation. 15min 1024x1024 -> 2048 upscaling did not encounter memory error. With directML you cant use more than 1.5x upscale and controlnet. No you dont need a control net with zluda. This is awesome.
    3. ControlNet works just fine
    4. Ultimate Upscaler works normally
    5. Inpaint works normally
    AMD pro drivers are slight faster than adrealine version. There is sligh 5-15s delay with adrealine when press "generate" and no delay with ProDrivers. IDK what cause this.

  • @darthilli
    @darthilli 2 месяца назад +1

    Okay, I finally got it working thank you so much, you’ve earned a sub

    • @FE-Engineer
      @FE-Engineer  2 месяца назад +1

      Glad it’s up and running! Thank you for watching! :)

    • @darthilli
      @darthilli 2 месяца назад

      @@FE-Engineerkeep up the good work, so much faster now 😌

  • @FormalPluto
    @FormalPluto 2 месяца назад +13

    Very nice tutorial. I've moved onto the NVidia side, but your tutorials were extremely helpful with setting up SD with Olive when I was still using my RX 7800XT.
    Thank you for making it easier for AMD users stuck in windows who are curious about trying SD.

    • @FE-Engineer
      @FE-Engineer  2 месяца назад +1

      Thank you :)

    • @matthewfuller9760
      @matthewfuller9760 2 месяца назад

      are you seeing better performance when considering price? This is skewed by location of course and the used gpu market. I guess windows is easier. But wont you get better performance with linux?

    • @f1am3d
      @f1am3d 2 месяца назад

      Very nice, because it doesn’t work?

    • @FE-Engineer
      @FE-Engineer  2 месяца назад

      This goes back and forth. About a year ago price / performance was on the side of amd mostly but due to continued improvements now nvidia likely has an edge if you can get a good price for like a 3080 or even maybe a 4070 super.
      With AMD. Yes. Linux will give you better performance 99% of the time because full ROCm.

  • @OfficialJohnnyHartmann
    @OfficialJohnnyHartmann 10 дней назад

    You are the best, sir. I have been struggling with getting my 7900 XTX to work with anything. Only one I got to work with Windows was Amuse, which is very lackluster, and it seems like it's dead at this point, and SD.Next with a workaround, which is not great. With the workaround it lacks the ability to run bigger batches, upscaling, inpainting, the pics look choppy, and a lot more. Not great, tbh. And with Linux, that was just a mess. Most wont open, and the few that works will only work of my cpu.
    But with your help, I can finally generate pictures with all the features.
    All hail the king!

  • @RimZeime
    @RimZeime 2 месяца назад +2

    Got it running atlast all thanks to you!!

  • @danielitsfine9818
    @danielitsfine9818 Месяц назад

    Thank you for this. Using onnx and olive was kind of great, getting faster it/s but not being able to use loras and converting models made it not that enjoyable, but it was still good to learn and practice with.

  • @Whyidk
    @Whyidk 2 месяца назад +1

    this video is a blessing thank you!!

    • @FE-Engineer
      @FE-Engineer  2 месяца назад

      You are welcome :) thanks for watching!

  • @konstabelpiksel182
    @konstabelpiksel182 2 месяца назад +2

    the last time i followed your comfyui + windows with directml guide, it worked like a charm for my rx6600 for sd15. wondered if this is any faster. got myself a 4070s now tho 😁

    • @FE-Engineer
      @FE-Engineer  2 месяца назад

      I believe this should be a decent bit faster than just directml -- if I am remembering correctly, this might be about double the performance of directml alone.

  • @auchucknorris
    @auchucknorris Месяц назад

    jsut got stable difusion installed, failed cuda test then you poped up, thanks heaps

  • @MegaGranj
    @MegaGranj 2 месяца назад +2

    Great tutorial!
    P.S For my 7900XTX perfect argumatent for SDXL, with minimum crashes(one out of ~500 generations) for 1024x1024 is:
    set COMMANDLINE_ARGS=--use-zluda --disable-nan-check --no-half-vae
    set PYTORCH_CUDA_ALLOC_CONF=expandable_segments:True,max_split_size_mb:512

  • @zygimantastauras
    @zygimantastauras 2 месяца назад +1

    Thank you very much, it generates pictures on AMD 6800 with around 5it/s

  • @bigdeutsch5588
    @bigdeutsch5588 Месяц назад

    Finally one that worked. My iterations/ seconds increased about 500% in speed. Thank you!! I do have a question, does soft inpainting work with this implementation of SD? I have not had success running soft inpainting. Thanks

  • @rtchannel8171
    @rtchannel8171 2 месяца назад +1

    Thank you, Work perfectly on my Rx6800 so fast. Amazing.

    • @FE-Engineer
      @FE-Engineer  2 месяца назад +1

      Fantastic! I’m glad to hear that. Thank you for watching :)

  • @andresalcaino7570
    @andresalcaino7570 2 месяца назад +1

    It work using a rx 7600 xt, thanks for this amazing tutorial, the only one that really worked for me. Like and sub.

    • @FE-Engineer
      @FE-Engineer  2 месяца назад

      You are very welcome! Thank you for watching!

  • @tushkan4ik111
    @tushkan4ik111 2 месяца назад +1

    It worked! Thanks!

  • @Klaster_1
    @Klaster_1 2 месяца назад +2

    Thank you for the video, took me a while to figure it out, but I finally managed to get a decent generation improvement on my setup - to about 11 it/s in SD1.5 on 7900XTX. If others read this, try out the "--use-zluda" flag in stable-diffusion-webui-directml and SD.next do the patching for you and install the correct torch version - much easier this way.

    • @matthewfuller9760
      @matthewfuller9760 2 месяца назад

      how does that figure translate to time? I am guessing around an image every 5 or 10 seconds at lowish resolution?

    • @Klaster_1
      @Klaster_1 2 месяца назад

      @@matthewfuller9760 you multiply the it/s to the iteration count. That gives 2s for 20it of SD1.5 512x512 or 12s for SDXL base at 25 its 1024x1024. More if you swap models, i.e. if you run an SDXL refiner, but AFAIK that mostly depends on your SSD speed.

    • @erwins_arm
      @erwins_arm Месяц назад

      how do i install the correct torch version and get it installed into the right folder? complete newbie here and having issues

  • @DanDanceMotion
    @DanDanceMotion 2 месяца назад

    There were a lot of mess errors, but I finally succeeded
    Thank you!!

    • @FE-Engineer
      @FE-Engineer  2 месяца назад

      Yea. It’s kind of crazy how many things say error and don’t matter. But it only takes one to wreck everything.

  • @LighthouseLeads
    @LighthouseLeads 2 месяца назад +3

    your the best. hope your family is all good

    • @FE-Engineer
      @FE-Engineer  2 месяца назад +1

      Thank you so much! Family is getting there. My son has a lot of medical issues. So long road there. But thank you for asking! :)

  • @Mike-ss1ju
    @Mike-ss1ju 2 месяца назад

    Thank you so much for this. 7900xtx is finally worth it. I had to disable intigrated graphics in bios to get this to work. Excellent instructional video. This shit is crazy.

    • @FE-Engineer
      @FE-Engineer  2 месяца назад +3

      Ah yes. You could likely set it in windows variables I think it is hip gfx visible devices and then set it to 1 but it works disabling bios as well.

  • @finn9552
    @finn9552 18 дней назад

    Thank you, easy and good guide

  • @PSYCHOPATHiO
    @PSYCHOPATHiO 2 месяца назад +2

    Excuse my language... HOLY SHIT, This is good. I gave up on Windows & been on Linux for a while but now after testing ths on Windows... oooh i love u. I can finally utilize my 7900 XT to its potential. Thank you for the easy tutorial

    • @FE-Engineer
      @FE-Engineer  2 месяца назад

      I know right? It’s sooooo good! While it isn’t perfect. And I still want full rocm on windows. This is in my opinion a very reasonable not quite full rocm alternative finally!

    • @PSYCHOPATHiO
      @PSYCHOPATHiO 2 месяца назад

      Having to juggle between windows for gaming and Linux for AI was frustrating, but this just so fast, even more than when I was on Linux. Thanx for the work, as I'm sure I'm saying on behalf of the whole AMD community :)

  • @corneduplessis6337
    @corneduplessis6337 2 месяца назад +3

    I appreciate your content. Its so frustrating that it cant just work for AMD on windows like it does for Nvidia cards. Im hoping that'll change in the near future but for now I use my 3070 for SD and my 7800XT for gaming and I'm good with that

    • @featy2671
      @featy2671 2 месяца назад

      do u know how much it/s i should get with a rx 7800xt if i dont all right?

  • @fmenguy
    @fmenguy 2 месяца назад +1

    Thanks for your tutorials, they are really well explained.
    For others like me who have an old config:
    I tried, even though I knew very well that my gpu wasn't on the list. If you get this message: "rocBLAS error: Cannot read C:\Program Files\AMD\ROCm\5.7\bin\/rocblas/library/TensileLibrary.dat: No such file or directory for GPU" it's dead!

    • @tolly_HD
      @tolly_HD 2 месяца назад

      What exactly do you mean with its dead? I also get this error even tho I have an RX 7900 xtx which is most definitely completely supported

  • @Cessna-172
    @Cessna-172 2 месяца назад +1

    Such a tutorial has been waiting for a long time. Thank you so much for your service to the Amd community, which is so hated by the AI community

    • @FE-Engineer
      @FE-Engineer  2 месяца назад +2

      You are welcome. I’m glad finally on windows something with relatively decent performance that seems to not be seriously lacking in something.

  • @udinmoklet
    @udinmoklet 2 месяца назад

    Thank you so much bro, it's working on RX 6700 XT!
    took 23 mins+ on first generation

    • @FE-Engineer
      @FE-Engineer  2 месяца назад

      You are very welcome! Thanks for watching :)

    • @joris2032
      @joris2032 2 месяца назад

      very nicee! can it generate fast now?

    • @udinmoklet
      @udinmoklet 2 месяца назад

      @@joris2032 well kinda fast, under 15 seconds maybe? depends on the resolution

    • @joris2032
      @joris2032 2 месяца назад

      @@udinmoklet sound okeay! I am trying to install it for my 6700xt aswell but de hip sdk isn't working for my card, im now trying an other version. 5.5.1

    • @udinmoklet
      @udinmoklet 2 месяца назад

      @@joris2032 there's extra steps that you have to do, read the documentation

  •  2 месяца назад +2

    Gracias , hasta ahora encuentro un tutorial funcional, funcionando con una RX6650XT . Saludos en español comprendo el ingles pero no tengo buena dicción. Gracias

  • @browse7288
    @browse7288 Месяц назад

    Holy shit it actually worked., big thanks man!

    • @FE-Engineer
      @FE-Engineer  Месяц назад

      😂😂 you are welcome. I’m glad it worked :). Thank you for watching!

  • @harrisonajones
    @harrisonajones 2 месяца назад

    Thank you soo much for this. I found it really helpful, especially considering that I am running on one of the RX 6XXX GPU's. In the end the only thing I found on stack overflow to get over the issue was to delete the venv folder and then run the webui-user.bat file. But after a reboot, it seems to be outputting black or white solid images again. Even after deleting that folder again. Could you think why this might be?

  • @jokinbv5715
    @jokinbv5715 2 месяца назад +1

    Thank you so much.
    10 images at 1024x1536 (Hires fix from 512x768) 7900XT
    With previous directml: 16min
    Now with Zluda: 5min 30s

    • @FE-Engineer
      @FE-Engineer  2 месяца назад

      Whoah. That’s way better! Nice!

  • @darthilli
    @darthilli 2 месяца назад +7

    [WinError 126] The specified module could not be found. Error loading "C:\Users\___\ZLUDA\stable-diffusion-webui-directml\venv\lib\site-packages\torch\lib\cublas64_11.dll" or one of its dependencies. please help

    • @sujimayne
      @sujimayne 10 дней назад

      Just FYI, you can use a Windows variable %userprofile% to provide an actual full ksth that can be zsed in Windows without exposing your username.

    • @silvermoonk9121
      @silvermoonk9121 5 дней назад

      Make sure u copied the 2 files he mentioned and renamed them correctly.

  • @MugiwaraRuffy
    @MugiwaraRuffy Месяц назад

    Will take a look into it.

  • @phelix88
    @phelix88 2 месяца назад

    Thanks for the video! Got it up and running with barely any issues. Only question I have is about model training. Is that feature still only an Nvidia thing? When I try to create an embedding it seems to create one but the dropdown in the training tab doesn't seem to function. I also see errors referring to embeddings in command prompt.

    • @FE-Engineer
      @FE-Engineer  2 месяца назад

      That is possible. Zluda seems to correctly translate a decent amount of cuda functionality correctly and accurately. But even as a translation layer it is definitely not a 1:1 map of all cuda functionality. It is more than reasonable to assume as you get into more complex cuda functionality that the translation layer may not function properly or accurately. Both would result in failures or crashes.
      You might try dreambooth. I have not tested. It might work? Likely you will run into the same or even more issues though. Hard to say without trying.

  • @kingyizzus4108
    @kingyizzus4108 3 дня назад +1

    Thank you very much for the detailed tutorial❤, but I have a little problem which is that the Karras type samplers do not appear. Any solution? 😢

  • @fabear4022
    @fabear4022 2 месяца назад

    Noice, works. The only thing different I did from this video is downloaded the latest version of zluda. It's slow though on RX 6700 XT 12GB. I guess my card isn't as good as I thought it was. At least it freaking works.

    • @FE-Engineer
      @FE-Engineer  2 месяца назад

      I did change to the latest version. Overall I did not honestly see any noticeable difference. But for some it might provide a more noticeable change? Or perhaps it supports more cuda functions?

    • @fabear4022
      @fabear4022 2 месяца назад

      @@FE-Engineer Yes, it's about the functions. Everything I would like appears to work, as previously it would just break. And there definitely is a performance increase.

  • @user-yingshubo
    @user-yingshubo 2 месяца назад

    我一直用directml,看这个真的是太棒了,非常感谢作者,我竟然配置成功了!!!

  • @flooosee8437
    @flooosee8437 Месяц назад

    hey, im tryin to add a faceswap extension, but both i tried arent working. Is it because its a fork or the extensions are the issue ? ((the install seems complete, but nothing show up in the UI)ReActor and FaceSwapLab)

  • @ZeroIQ2
    @ZeroIQ2 2 месяца назад +1

    very cool, thank you

    • @FE-Engineer
      @FE-Engineer  2 месяца назад

      Glad you liked it, thank you for watching! :)

  • @bigwinboy
    @bigwinboy 2 месяца назад

    Any news on ZLUDA support for stable diffusion webui forge? The latest layerdiffusion seems to only support forge.

  • @assitm5442
    @assitm5442 2 месяца назад

    for generating pics on amd gpu after all steps in your vid i must use optimised models or not?

  • @afilthyweeb8684
    @afilthyweeb8684 25 дней назад

    Damn you were not lying about that first run. I ended up at nearly 30 minutes

  • @jimmyjupanu
    @jimmyjupanu 2 месяца назад +3

    How to uninstall torch-2.2.0+cu121 and install torch-2.2.0+cu112 , i think that is my problem because when i run sd i run with cpu

  • @victorivanov5667
    @victorivanov5667 2 месяца назад

    Hey, thanks for the ongoing amazing videos, worked like a charm the first time, but after the 2nd try I get the skip torch cuda error ; adding the --skip-torch-cuda only results in an error several people in the comments are expieriencing.
    EDIT: Found the solution, had to open cmd in the zluda dir then navigate to the folder with the webui.bat and start it like in the video!

    • @tiago7063
      @tiago7063 2 месяца назад

      For me was that i didin't started zluda.exe or didn't open amd as admin, idk what solved

  • @jozopako
    @jozopako 21 день назад +1

    When installing with running user.bat file, it says error 1/2 no space left on device. I have 437GB free space.

  • @Karambolagemusic
    @Karambolagemusic Месяц назад +1

    RuntimeError: Torch is not able to use GPU; add --skip-torch-cuda-test to COMMANDLINE_ARGS variable to disable this check.
    Any clues? Do I need to install another version of pytorch? If so, how? Thanks in advance!

  • @user-er7fe6ol4b
    @user-er7fe6ol4b 20 дней назад

    Hi. I ran your tutorial through ZLUDA 6700xt. are there any optimizing settings for SD mode of ZLUDA?

  • @MRrDoctorWho
    @MRrDoctorWho 27 дней назад

    Can u help???
    What is the problem? I have an RX6750 XT, installed libraries, tried different ways, the error does not go away. Either the Stable Diffusion defines the graphics card on the gfx90c architecture
    "RuntimeError: invalid argument to reset_peak_memory_stats"

  • @humbug1
    @humbug1 26 дней назад

    This tutorial seems to have started in the middle, what's with the command prompt? How do i get to that bit first?

  • @egarciamaz
    @egarciamaz 29 дней назад

    It works very well for me, thanks friend good job.

  • @Rich_Mr
    @Rich_Mr 2 месяца назад

    hey man thanks for all the help u've delivered. One thing, are you planing to use LM studio efficiently using ZLUDA to run LLMs locally?
    That would be great as when I was using linux, it worked fine but now I'm on windows and it doesn't work.

    • @FE-Engineer
      @FE-Engineer  2 месяца назад +1

      I was not planning on it. But you are the second person to ask. I will spend some time on it over the weekend and see if I can get it running properly. No promises, it may be using cuda functions that are not supported through zluda.

    • @FE-Engineer
      @FE-Engineer  2 месяца назад

      As another semi related side note. Supposedly amd has a build that is supposed to work with hip sdk I think for lm studio. I had no luck with getting that to work though. :-/

  • @ChrisBuffam
    @ChrisBuffam Месяц назад

    I'm having an issue during installing. I'm using Automatic1111 for DirectML, the latest version. When running the webui.bat it errors out with RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu! (when checking argument for argument index in method wrapper_CUDA__index_select). I'm using XFX 7900XTX and Ryzen 7950XT.

  • @bigdeutsch5588
    @bigdeutsch5588 Месяц назад

    I'm very surface level with my understanding, does this remove the option for ONNX? I'm happy with the speeds but am curious about if its possible to optimize these models further for AMD? I've installed the ReActor extension which seems to want to call for the onnxruntime-gpu. Everything functions as it should (including ReActor) but I'm curious about if there is a way to increase speed further?

    • @FE-Engineer
      @FE-Engineer  Месяц назад +1

      Onnx has significant drawbacks.
      Zluda to my knowledge likely will not apply or work for onnx format. I don’t believe that onnx is necessarily unavailable it likely will not use zluda though.
      The onnx drawbacks were that speed is a bit better. Inpainting will not work. Sdxl will not work. And you may have to convert models which can be a little time consuming and has its own issues sometimes.

  • @Gwaboo
    @Gwaboo 2 месяца назад

    do you have a planed video about zluda with comfyui or are you able to give me a short rundown on how to get it working i only got directml working with comfyui i wuld be soo happy if you did this :)

  • @Rich_Mr
    @Rich_Mr 2 месяца назад

    BTW when is your SD next with Zluda video is dropping out? Just curious and waiting for it as I use SD for my social media.

    • @FE-Engineer
      @FE-Engineer  2 месяца назад +1

      Should be this weekend. Might have two. One for a semi updated guide for this one. It’s not really different just shorter since it now helps you to get the files setup properly. Probably also one on sd.next.
      And I might do one on comfyui. But that is still weird and very manual I believe. :-/

    • @Rich_Mr
      @Rich_Mr 2 месяца назад

      @@FE-Engineer yes personally I hate comfy UI, it's complex to work on for me.

  • @ChuckieChuck3677
    @ChuckieChuck3677 2 месяца назад

    i have a problem here. This method works fine with me UNTILL i restart my PC. And then the good old error "torch can't use this GPU" pop up again. I need to delete all the setup and start fresh again like this in order to use SD. Any solutions ?

  • @baheth3elmy16
    @baheth3elmy16 2 месяца назад

    Great tutorial! Thank you! Obviously my RX570 8GB is not supported at all :(

  • @philjancik4601
    @philjancik4601 Месяц назад

    im assuming for this to work your gpu compatability must be able to support the runtime and the HIP SDK right?

  • @DrivEDrivinginEurope
    @DrivEDrivinginEurope 2 месяца назад +3

    hi, I have this error after launching webui.bat to install everything:
    rocBLAS error: Cannot read C:\Program Files\AMD\ROCm\5.7\bin\/rocblas/library/TensileLibrary.dat: No such file or directory for GPU arch : gfx1036
    rocBLAS error: Could not initialize Tensile host:
    regex_error(error_backref): The expression contained an invalid back reference.
    Press any key to continue . . .
    Any idea what to do? Thanks for your help

    • @banned-user
      @banned-user 2 месяца назад

      same error

    • @banned-user
      @banned-user 2 месяца назад +2

      hey I just fixed it. disable your integrated gpu in device manager and wait a while as it loads and eventually downloads

    • @DrivEDrivinginEurope
      @DrivEDrivinginEurope 2 месяца назад

      @@banned-user thank you, I will try it later. I'm not too sure though how to disable the integrated graphics

    • @FE-Engineer
      @FE-Engineer  2 месяца назад +1

      Can do it from bios for one.
      But you can also set it as an export variable for being used. By rocm and tell it to ignore the igpu

    • @FE-Engineer
      @FE-Engineer  2 месяца назад

      I can tell you something is wrong. See how slashes go from back slashes to forward slashes? And at one spot there is a backslash next to a forward slash? Look at your env variables and check to see if something is weird.

  • @nightjojobackstabber1833
    @nightjojobackstabber1833 2 месяца назад

    Thanks for the video.
    I'm able to get the webUI running, however, I get "RuntimeError: The NVIDIA driver on your system is too old (found version 8000)" when trying to generate image.
    Tried reinstalling HIP-SDK and python 3.10.6, didn't work.
    Any idea what could be the problem? Thank you very much.

    • @FE-Engineer
      @FE-Engineer  2 месяца назад

      That is a really odd error. No one else that I have seen has gotten that error.
      Unfortunately I don’t know how to help here. Are you sure you got the zluda files, copied them over. Renamed them. And added everything to ENV properly?
      Try rebooting your computer if you have not.

    • @nightjojobackstabber1833
      @nightjojobackstabber1833 2 месяца назад

      @@FE-EngineerYes I'm pretty sure I followed every step as instructed and rebooted several times after adding those 2 ENV variables.
      Also, I had to add --skip-torch-cuda-test to even get the webui started, which is quite different from your video. Weird...
      Guess I'll have to stick to dual booting Linux for now.
      Nonetheless thank you very much for replying.

  • @canalpan9
    @canalpan9 2 месяца назад +1

    thank you so much

  • @cartoonworld1000
    @cartoonworld1000 2 месяца назад +1

    I just want to say thanks, it seems to be working on my 7900 XTX, I'm just wondering do you think we can use this in InvokeAI, I kind of like the layout of it and would love to use it on my AMD GPU. When you get the chance let me know if you think its possible.

    • @FE-Engineer
      @FE-Engineer  2 месяца назад +1

      I can pretty definitively say for right now...on windows...I doubt you will get it to run with zluda.
      I spent multiple hours. cudnn is heavily used in here, and while it may entirely be possible, I have not figured out a good way to disable it entirely, and get it running, it is close, I just can not entirely get cudnn disabled, and it seems to be very woven into this program overall.

    • @cartoonworld1000
      @cartoonworld1000 2 месяца назад

      @@FE-EngineerI guess we'll either have to wait for zluda support or full rocm support on windows, correct?

    • @FE-Engineer
      @FE-Engineer  2 месяца назад

      That or if the devs decide to allow it and make a flag that disables cudnn.

  • @artymusoke1352
    @artymusoke1352 2 месяца назад +1

    am getting this runtime error - return torch._c._cuda_memory stats runtimeerror: invalid arguement to memory_allocated. ive left it to render and "nothing is happening" as you initially said. so maybe it will work.
    how do i degrade to torch 118?

    • @FE-Engineer
      @FE-Engineer  2 месяца назад

      Not sure. I haven’t seen folks in comments getting that. What’s your GPU?

    • @davitmodebadze9707
      @davitmodebadze9707 Месяц назад

      @@FE-Engineer Hey, great video. I have 6900XT and everything works, but I'm also randomly getting "NansException: A tensor with all NaNs was produced in Unet" error. --no-half, --medvram don't seem to help.

    • @Vennnaya
      @Vennnaya 19 дней назад

      @@FE-Engineer Im getting this too, once i finally managed to navigate through all the steps that you skipped over in the video.

  • @Mozkiito
    @Mozkiito 2 месяца назад

    Thanks again for the tutorial!
    Is there any way to get extensions such as controlnet to work using zluda?

  • @agx4035
    @agx4035 25 дней назад +1

    will this hip sdk fuck with my adrenalin driver for gaming ?

  • @TheSnow.
    @TheSnow. 14 дней назад

    as a 7900 xtx owner i was getting so mad that i couldn't do any proper AI generation, bless you for your tutorials man. You are amazing, the true hero of AMD.
    but you should consider telling people about Compatibility with other models on the beginning of the video to be honest.

    • @FE-Engineer
      @FE-Engineer  12 дней назад

      That’s fair. I will try to include something at the beginning about this.

  • @matiosjed
    @matiosjed Месяц назад

    could you try to make AMD gpu get detected/work with Applio RVC (text-to-speech) ? mine program just says that it couldn't find nvdia gpu and starts on cpu. It used to work on AMD

  • @machaoverlord5925
    @machaoverlord5925 2 месяца назад

    where are you when I have amd -.- good job

  • @chaz-e
    @chaz-e 18 дней назад

    How fast is this compared to Olive approach?
    Zluda is not officially supported by AMD but they have partnered with Microsoft for Olive and other improvements.

  • @wedsg
    @wedsg 2 месяца назад

    do upscaler not use zluda? My speed dropped from it/s to s/it when using esrgan 7900xt I mean when high res fix is checked.

    • @FE-Engineer
      @FE-Engineer  2 месяца назад

      I have not tested. It’s possible that specific upscale might not. That one has always been noticeably slower for me as well.

  • @displaytalk
    @displaytalk 2 месяца назад

    Hey, I'm trying to get a different program working called Flowframes(just to test cuda rife with zluda). When I open the program with zluda on my 5700xt(hip is installed, env variables added, it wouldn't even run the program through zluda without the env variables) it just says no nvidia gpu is detected(at the top it says 5700xt is detected). I go ahead with a cuda interpolation task anyway and when it comes time for interpolation to start it says no cuda gpu detected falling back to cpu(and it indeed falls back to cpu). Sorry this isn't really on topic to your video, I'm just not seeing any other videos demonstrating zluda with ai stuff. Thanks in advance if you have any ideas

    • @FE-Engineer
      @FE-Engineer  2 месяца назад +1

      Is the 5700xt supported by hip sdk?

    • @displaytalk
      @displaytalk 2 месяца назад

      @@FE-Engineer ahh my bad.. I didn't think 5700xt wouldn't be supported. Man, between this and AFMF not coming RDNA1 is effectively discontinued for new features already.

  • @AmosKito
    @AmosKito Месяц назад

    I got it working and ran plenty of models and then it stopped working the next day after PC sleep?? somehow cusparce deleted itself, very strange...

  • @MichiVersoTV
    @MichiVersoTV Месяц назад

    Any tips to reduce consumption with the 7900 XTX?

  • @unitrixbase5221
    @unitrixbase5221 Месяц назад

    Hi, I did everything the same as you, but the CPU still handles the generation of images. the speed is extremely low, the GPU is not used (in my pc rx6800)

  • @churumbel-ob6wb
    @churumbel-ob6wb 11 дней назад

    got it working, but i only have one question, is it normal that in the task manager the gpu its not geting any load?

    • @FE-Engineer
      @FE-Engineer  11 дней назад

      I don’t remember. That doesn’t sound like a good sign. Usually I use gpu-z for monitoring it.

  • @nenadm5747
    @nenadm5747 2 месяца назад

    Thank you for your effort for us Amd people 😁
    Can I just add Zluda to my currrent A1111 installation? I use directml for months, everything works, slow but works.
    Is there a chance to break something?

    • @FE-Engineer
      @FE-Engineer  2 месяца назад +1

      I think if you have a version that is up to date then yes, just add the use-zluda flag

    • @kobusdowney5291
      @kobusdowney5291 2 месяца назад +1

      Perhaps use --reinstall as well

  • @farzinv1053
    @farzinv1053 11 дней назад

    Expecting value: line 1 column 1 (char 0) what do i do for this error????????

  • @grosgato
    @grosgato 2 месяца назад

    I'm a newbie with stable ... but This workflow is ok withe the comfyui installation ? Or it is just for A111 ? Thanks for your help

    • @FE-Engineer
      @FE-Engineer  2 месяца назад +2

      This is just for auto1111 directml. Comfyui is similar but different. I might make a video on that.

    • @grosgato
      @grosgato Месяц назад

      @@FE-Engineer ok great ... I search solutions to run comfyui with my bg amd card. Thank

  • @josephboanas9709
    @josephboanas9709 Месяц назад

    I have a radeon rx 580 series gpu, do you reckon that’s fine to have or would it not be compatible? It’s not listed on the gpu list but i wanna hold out some hope and ask you

    • @FE-Engineer
      @FE-Engineer  Месяц назад +1

      It is not supported by hip sdk. So sadly no.

    • @josephboanas9709
      @josephboanas9709 Месяц назад

      @@FE-Engineer alright, thank you very much

    • @forsensfishbox
      @forsensfishbox Месяц назад

      @@josephboanas9709 you can actually do this, the solution is listed in the bottom of the description

  • @ralphgerber1999
    @ralphgerber1999 2 месяца назад

    I'm getting stuck on this portion "ONNX: selected=CUDAExecutionProvider, available=['AzureExecutionProvider', 'CPUExecutionProvider']". It just doesn't move any further and ramps up CPU usage heavily for a long time. Edit: Actually it eventually moved on to load the Webui. That said the model took 374 seconds to load, with 240 of that being the empty prompt. Any ideas? 7950x with 6800xt and 128gb ddr6 in win 11 if that helps.

    • @FE-Engineer
      @FE-Engineer  2 месяца назад

      What kind of hard drives?
      That sounds odd I don’t think onnx should be selected at all.
      Check your env variables reboot. Something seems off to me.

  • @GamingAlliance1
    @GamingAlliance1 2 месяца назад

    I could only get it to run with CPU on my rx580 even followed the non supported gpus, I’ve been stuck with an older version of stable that can generate things but Inpainting is broken.

  • @bojanrajic
    @bojanrajic 2 месяца назад +2

    I can't seem to get more than 1-2it. I have 7900XT and Ryzen 9 5900X, M2, 64GB RAM.
    AMD-Software-PRO-Edition-23.Q4-Win10-Win11-For-HIP,
    Python 3.10.11., Git 2.43.0 windows 1, added all the paths as instructed.
    Is it possible that the difference between 7900XTX and 7900XT is 10x?

    • @Eminic112
      @Eminic112 2 месяца назад

      Can't be true, i've also noticed almost a 3X slowdown compared to running it in linux on my 6700 XT.

    • @bojanrajic
      @bojanrajic 2 месяца назад

      @@Eminic112 Everything is installed but i get 10x less iterations per second. I don't know what i am doing wrong.

    • @Eminic112
      @Eminic112 2 месяца назад +1

      @@bojanrajic It seems to be an issue others are having as well, me included. I honestly couldn't tell you the reason, i've tried so many things and my performance isn't anywhere near where it should be. We might just have to wait for an update.

    • @_TrueDesire_
      @_TrueDesire_ 2 месяца назад

      I thought Python 3.10.6 was the newest we could use? Newer breaks Torch.

    • @Eminic112
      @Eminic112 2 месяца назад

      @@_TrueDesire_ I'm using 3.10.6 exactly, and i'm having the exact same issue, so i don't think that has anything to do with it.

  • @carloscamargo6625
    @carloscamargo6625 2 месяца назад

    My torch folder don't contain this cublas file, even the latest zluda release is different, don't include the .exe file. Any chance to have an updated version of this tutorial with latest versions ??? Thanks for your tutorials, thanks to you im running Automatic1111 using directml with a 7800xt

    • @FE-Engineer
      @FE-Engineer  2 месяца назад

      You don’t need the exe file.
      I checked a few days ago and there had not been any updates to zluda that I saw. Even looking at the GitHub from the maker of zluda the latest release is 3. So what is different exactly?
      You are very welcome! Thank you for watching! :)

    • @FE-Engineer
      @FE-Engineer  2 месяца назад

      Also I remember. If you don’t run we ui.bat first without use zluda on it. It won’t install everything. I think. I did it right in the video. But I do remember the first time you have to let it install everything otherwise as you said you won’t have all the files in the torch lib folder.

    • @carloscamargo6625
      @carloscamargo6625 2 месяца назад

      @@FE-EngineerOk i understand, then i need to start a fresh install for zluda use. Thanks a lot i go to try it asap. I hope we can do something like this with Fooocus. Thanks for share all this knowledge bro.

  • @banned-user
    @banned-user 2 месяца назад +1

    4:48 what did you do here? I git cloned then entered the dir and typed webui.bat then it gives me an error

    • @FE-Engineer
      @FE-Engineer  2 месяца назад +2

      I did not do anything special -- I was using windows command prompt -- if you are getting an error, then I think you might have a python problem...or python is not added to path...

    • @banned-user
      @banned-user 2 месяца назад +1

      @@FE-Engineer Yep fixed up. installed latest python version and had to go to "manage app execution aliases" settings and disable it. Thanks

    • @zacharmento6551
      @zacharmento6551 2 месяца назад +3

      @@banned-userDid that get you past the "RuntimeError: Torch is not able to use GPU;"? That's what I'm getting for running webui.bat

    • @banned-user
      @banned-user 2 месяца назад

      @@zacharmento6551 I'm stuck on that part too after I run .\webui.bat -use-zluda

  • @MegaGranj
    @MegaGranj 2 месяца назад +1

    Do you have any plans to make a similar video for ComfiUI + Zluda ?

    • @FE-Engineer
      @FE-Engineer  2 месяца назад

      Possibly. It seems kinda sketchy as to whether it will work…and how well it might work. But I have been looking into it.

    • @MegaGranj
      @MegaGranj 2 месяца назад

      ​@@FE-Engineer I got ~3tx/s for 1024x1024 images on xl model on 7900xtx. After one day of testing it looks stable. I did about 200 generations and got 0 crashes yet.
      BTW i thought i've added a comment with link on Chinese site where was a video with complete guide how to do it. Probably it was deleted by spam filter, or by you :)

    • @MegaGranj
      @MegaGranj 2 месяца назад

      @@FE-Engineer Let me know If you interested, we can do call in discord, and I'll show how it works now 🙃

  • @avelardoblanco7324
    @avelardoblanco7324 2 месяца назад +1

    Hey I have a problem,
    I have a RX 7900 XT and I have ran through all the steps and am using the skip torch command along with zluda but I get an error saying RuntimeError: No CUDA GPUs are available
    It opens the webui but I cant generate anything because of the error.
    Any help would be appreciated 🙏

    • @jeromeboyer3401
      @jeromeboyer3401 2 месяца назад

      Same error here please help

    • @banned-user
      @banned-user 2 месяца назад +1

      @@jeromeboyer3401 you have not installed ZLUDA properly

    • @FE-Engineer
      @FE-Engineer  2 месяца назад +1

      As the other user mentioned you have missed a step or something.
      Didn’t install hip sdk? Didn’t get zluda setup? Didn’t copy the files? Didn’t change env? Hard to say. But you missed something.

    • @avelardoblanco7324
      @avelardoblanco7324 2 месяца назад +1

      @@jeromeboyer3401
      Hey I think i figured it out. Its currently on the step that takes really long time but i finally got it to get rid of the No CUDA GPUS are avilable. I just had to delete all of the old nvidia programs I had in Control Panel since I upgraded from an old Nvidia card to a new AMD one. Thats probably why it recognized the Nvidia and tried to search for a gpu. Hope this helps.