Midjourney has COMPETITION & it's FREE/Open Source - Deepfloyd IF AI Art Model

Поделиться
HTML-код
  • Опубликовано: 9 ноя 2024

Комментарии • 565

  • @MattVidPro
    @MattVidPro  Год назад +38

    Deepfloyd IF might win against Midjourney. I still want to do further testing. I prefer an image model that incorporates all aspects of the prompt first and foremost, while MJ might be clearer or more aesthetic, it often ignores parts of my prompt entirely.

    • @juandeaton5692
      @juandeaton5692 Год назад

      Yes let's do a compare!

    • @michaspringphul
      @michaspringphul Год назад +1

      if it is open source, why a user has to identify itself for just joining the webpage to download and use it?

    • @jams2u786
      @jams2u786 Год назад +4

      The agreement says "Non-Commercial Use", does this mean generated imagery may NOT be used in anyway for business purposes?

    • @wykydytron
      @wykydytron Год назад

      ​@@jams2u786 yes, probably commercial license will be paid. They do need to make money somehow tho we could argue that ai generated art is not owned by anyone as it was created by tool.

    • @peterbelanger4094
      @peterbelanger4094 Год назад +1

      Except it's a total pain to use. It's not totally local like A1111. What is this crazy 'notebook" stuff? It's all tangled up in Huggingface. It's like half-software, some coding required.

  • @GraveUypo
    @GraveUypo Год назад +192

    i love that open source is not getting left behind

    • @spinninglink
      @spinninglink Год назад +37

      We ALL need to support open source. That's going to be imperative for the "normal people" to have ai as powerful as these corporations.

    • @SubLuminary
      @SubLuminary Год назад +4

      Absolutely

    • @arnowisp6244
      @arnowisp6244 Год назад +4

      And so that the Artist Can't sue it into Collapse.

    • @jd2161
      @jd2161 Год назад +1

      ​@@spinninglink yup

    • @dik9091
      @dik9091 Год назад

      this is fake opensource, tying to attract free programmers that will test their sht for free. Check the license you cannot even publicaly use the images. Check the restrictions of the license. Same as facebook fake open source and openAI fake opensource and microsoft fake open source. Only google offers true unrestricted open source as a giant.

  • @GregMatoga
    @GregMatoga Год назад +70

    I can't believe this just came out after I spent 12 hours in front of a computer trying to keep up with ai tools. Insane progress. Thanks for the video!

    • @ReligionAndMaterialismDebunked
      @ReligionAndMaterialismDebunked Год назад

      :p True, cherry-picked, like what debunked religious and debunked material atheists do. Lol. I use that term pretty often.

    • @ReligionAndMaterialismDebunked
      @ReligionAndMaterialismDebunked Год назад

      :p Awesome! Yes, words have sucked so bad with regular AI, and I've tried several sites, and several apps. Hehe. This'll be fantastic for selling them, and should be good for consistent images that you want similae for children's books (what I have written by AI, but no images for it due to inconsistency), and other things.

    • @davidstar2362
      @davidstar2362 Год назад

      Great Job keep up with the good work.

  • @MrGTAmodsgerman
    @MrGTAmodsgerman Год назад +33

    I just tested it on Huggingface and i already noticed some other major difference, and that is generate images of cars. It does a way better job from the start then SD. It looks way more realistic there to generate a car in terms of design.

    • @wykydytron
      @wykydytron Год назад +3

      SD had very little car pictures fed to it, you can tell it very quickly but if you use correct Lora for model of car you want it works very well.

  • @Arc_Soma2639
    @Arc_Soma2639 Год назад +73

    Absolutely, it's impressive to say the least, a huge milestone for the open source community.

    • @dik9091
      @dik9091 Год назад +1

      no it is not, it is a milestone for the company that made it. Check the license

    • @WeirdSmellyMan
      @WeirdSmellyMan Год назад

      ​@@dik9091 yes it is.

    • @dik9091
      @dik9091 Год назад

      @@WeirdSmellyMan how is it open when you have to sign with name and email? That is by definition not open, how can you see that differently? My email and name have value that I have to trade in for a crappy license, uh no.

    • @WeirdSmellyMan
      @WeirdSmellyMan Год назад +1

      @@dik9091 I didn't have to sign in at all.

    • @vytah
      @vytah Год назад +2

      @@WeirdSmellyMan the license prohibits commercial use, copying and creating derivatives, three core requirements of being open source

  • @sullivan3004
    @sullivan3004 Год назад +13

    Remember your vid on Floyd way back, happy to see it release. Open-source AI is really the way to go!

  • @clouds2593
    @clouds2593 Год назад +53

    Midjourney will just implement Deepfloyd to their code and charge for it.

    • @damien2198
      @damien2198 Год назад +15

      They cannot Deepflyod is not really opensource license, just a shitty one for non-commercial pupose/research with tons of restrictions

    • @finlayfisken9817
      @finlayfisken9817 Год назад +4

      Midjourney uses their own codebase, that's why v5 is so much better than SD

    • @electricz3045
      @electricz3045 Год назад +5

      ​@@damien2198 it's still opensource. The menacing of opensource is, that the sourcode is public and can be read by anybody, the licence used dint chnsge anything in the fact that the code is opensource. Also who controls if you sell the AI images produced by this AI? There isn't any true indicator so if someone ask you, you can just say it's made with stable diffusion and so you are allowed to sell it.

    • @damien2198
      @damien2198 Год назад +1

      @@electricz3045 you can read it but cannot do shit with it unless for research purpose with lot of limitation. not opensource

    • @damien2198
      @damien2198 Год назад

      @@electricz3045 you cannot do anything with it, even run if not for research purposes, and you are limited on what do can even run it for. not opensource.

  • @Blastmaster321
    @Blastmaster321 Год назад +91

    You are such a great source of AI news Matt, thank you

    • @ReligionAndMaterialismDebunked
      @ReligionAndMaterialismDebunked Год назад

      :3 True, cherry-picked, like what debunked religious and debunked material atheists do. Lol. I use that term pretty often.

    • @ReligionAndMaterialismDebunked
      @ReligionAndMaterialismDebunked Год назад

      :3 Awesome! Yes, words have sucked so bad with regular AI, and I've tried several sites, and several apps. Hehe. This'll be fantastic for selling them, and should be good for consistent images that you want similae for children's books (what I have written by AI, but no images for it due to inconsistency), and other things.

    • @katieotoole8784
      @katieotoole8784 Год назад

      Hey Matt, hope you are great. Keep it coming, you put out absolutely awesome videos. I feel like I will never get to where I want to be, before I'm 3/4's in the gave, LOL.

    • @katieotoole8784
      @katieotoole8784 Год назад

      Oh shit, I sent the first comment to ask you why I can't get into DeepFloyd AI? Thank you Matt😅

  • @mattrusingmail
    @mattrusingmail Год назад +46

    Unlike Stable Diffusion this model is only non-commercial use so definitely won’t have the same dev excitement

    • @Dereliction2
      @Dereliction2 Год назад +7

      Oooh, good catch.

    • @seemlessartcreations
      @seemlessartcreations Год назад +8

      Noob here, does this mean that you can't take the pictures and sell them or using them as book covers or on print on demand things?

    • @aarohanyt7374
      @aarohanyt7374 Год назад +1

      @@seemlessartcreations yea

    • @aarohanyt7374
      @aarohanyt7374 Год назад

      @@anonymous49125 no as of now. you can sell your images. Laws are not put into place rn

    • @YoungBlaze
      @YoungBlaze Год назад

      ​@@aarohanyt7374 prove it

  • @nahiddotai
    @nahiddotai Год назад +8

    The text output is definitely very impressive, and some of those output images aren't bad either. I think consistently Midjourney will still provide images that are on average better. Although, these new AI art generators will be better at a specific thing over Midjourney i.e. text, inpainting etc. Very keen to get started with this!

  • @tonyaidinis4396
    @tonyaidinis4396 Год назад +6

    Many many thanks for the heads up on this! I just tried a few prompts on hugging face. Indeed, there is nothing like it in terms of handling text. However, the imagery it produces is way way behind midjourney or SD. In producing text, it often misses letters or misplaces them, but still that is way more than others can do. Let's give it time...

  • @zaiologyy
    @zaiologyy Год назад +9

    I love how AI news is happening so fast that by the time you finished the video there's already new updates to the topic lol Thx for doing what you do, you're the best !!

  • @bgill7475
    @bgill7475 Год назад +24

    The boot image with that text is a reference to Venus in Furs by The Velvet Underground which came out in 1967.

    • @Tony_Baloney_69420
      @Tony_Baloney_69420 Год назад

      Uh, thanks.....

    • @bgill7475
      @bgill7475 Год назад

      @@Tony_Baloney_69420 You’re welcome.

    • @DJWESG1
      @DJWESG1 Год назад

      That image stood put for me too. I suppose it's heavily influenced by floyed throughout.

  • @bofuuu
    @bofuuu Год назад +19

    This is looking amazing. I can’t wait to see what the future holds for AI.

    • @umzino1
      @umzino1 Год назад +1

      World domination 😈

  • @potts995
    @potts995 Год назад +10

    This is great, thanks for keeping us all updated! Hope you’re feeling better!

  • @Janizzary
    @Janizzary Год назад +5

    3:04 They're called meerkats. They're related to mongooses.

  • @IceMetalPunk
    @IceMetalPunk Год назад +14

    I've been playing with the HuggingFaces demo and... I'm not really impressed? It's better at text than other models, for sure, but... maybe there's some settings that aren't right, but the upscaled images are often distorted. Especially of human faces... even with things like "ugly, distorted, monster" in the negative prompt, the resulting faces range from mildly awkward to nightmare fuel...

    • @kaleyjanenigh
      @kaleyjanenigh Год назад +1

      ..."with things like 'ugly, distorted, monster' in the negaqtive prompt" 😂😂😂 That was fking hilarious--and I totally agree. I'm playing with it for creating mockups for my design business (yes, I know, no commercial use yet, I'm just testing), and my prompt was "a 24 year old girl with blonde hair wearing a Bella Canvas tshirt with a boho background". It was still kind of impressive, but those god damned eyes will eat away at my soul.

  • @theodoregorgovelt2963
    @theodoregorgovelt2963 Год назад +12

    This is IMPRESSIVE, this can literally help us makes T-shirts, Logos, and much much much more! Such a great news from you, matt! TYSM

    • @Fontgod
      @Fontgod Год назад +1

      For non commercial use only by the looks of things. I wonder how that might apply to making a logo/tshirt design for your own business?

    • @LivingTithe
      @LivingTithe Год назад

      ​@@Fontgod wondering how they could actually inforce that since images can't be copyrighted?

  • @void2258
    @void2258 Год назад +4

    This is cool but even more of a let down for the general user that the vram is so high when Nvidia (your only option for windows users) refuses to give any reasonable amount for non-ludicrous prices.

  • @CreativePunk5555
    @CreativePunk5555 Год назад +2

    Claiming it beats MJ is a massive stretch. I tried it and honestly it feels like MJ version 1 or 2 at best. But again, it's just the beginning and I don't expect anyone to launch something out of the gate that can compete with MJ at this time.

    • @Storygospel533
      @Storygospel533 Год назад +3

      You can tell he barely has a clue about MJ. That or he's getting kickbacks with all those affiliate links, clear embellishment, and out of place enthusiasm.
      With that being said, I hope this tool improves down the road because I love that it's open-source. In the meantime it's a candle to a flame

  • @Ege_E
    @Ege_E Год назад +1

    I like how AI made an accidental sarcasm there by putting a gun inside the cover of heart shaped box even it's not specified on that short prompt in 13:54

  • @karenreddy
    @karenreddy Год назад +6

    It can spell, but MJ's quality and flexibility with V5 is pretty amazing.
    Maybe Dreambooth us what takes DF there

    • @jopansmark
      @jopansmark Год назад

      Wait, MJ has flexibility? It literally has no inpainting, outpainting, training, embeddings and many more basic features

    • @karenreddy
      @karenreddy Год назад

      @@jopansmark That is true, but it's model is far more capable.
      For example, try to go make a well designed and very aesthetically pleasing horror monster in Stable Diffusion. It will take you quite a long time, if at all, unless someone comes up with a new trained model which contains relevant information.
      MJ's V5 can handle a lot of these monsters fairly easily, and is flexible enough to mix it well with other concepts.
      Inpainting, embeddings and other tools are powerful extensions of the base model, but are still limited to the initial training.

  • @wielandsmith
    @wielandsmith Год назад +2

    Thanks Matt! I can’t wait to check this out!

  • @mawungeteye657
    @mawungeteye657 Год назад +3

    I'm willing to bet controlnet is the solution for near perfect text generation like almost on every prompt
    Cant wait to see someone apply the nvidia video ai method with this

  • @zendao7967
    @zendao7967 Год назад +3

    Ok now I'm just waiting for someone to incorporate it into A1111

  • @DeGameBox_SRBT
    @DeGameBox_SRBT Год назад +2

    I am so glad that some open source program has defeated a big company program that takes a lot of money for its project

  • @drew5564
    @drew5564 Год назад +1

    I came here expecting something similar to bluewillow AI, but you have blown my mind out the waters again matt.

  • @chariots8x230
    @chariots8x230 Год назад +10

    This is very nice. But I really want an AI that will allow me to work with my own original characters and pose them together. I wish I could use it to make a comic. So, consistent characters & backgrounds, as well as posing, are all very important features for an AI to have.

    • @rabanal_josh64
      @rabanal_josh64 Год назад +2

      Maybe we could see that in the upcoming months. But yeah, consistency is something creators want

    • @chariots8x230
      @chariots8x230 Год назад +2

      @@rabanal_josh64 Hopefully, we will see this feature soon. Consistent characters, as well as posing of our consistent characters, would be a very important feature.

    • @williamwoghiren2441
      @williamwoghiren2441 Год назад +2

      You can actually use the hyperlink to a preferred image in midjourney as a reference for any prompts you afterwards, essentially making a character you previously generated as the referenced character in future prompts, u just need to include the hyperlink in prompt and - - the image stock number

  • @Brismo7
    @Brismo7 Год назад

    3:25 but the main thing, is the hands and how well done they are.

  • @chyrek.ambient
    @chyrek.ambient Год назад +3

    DeepFloyd IF generated a real Lamborghini for me, brought my ex wife back to life and cured my cancer. Thank you DeepFloyd 🎉

  • @chrislannon
    @chrislannon Год назад +10

    I'm excited AF for DeepFloyd IF.

  • @michaelbone6894
    @michaelbone6894 Год назад +12

    Darn, that is really good for a base model. Mostly solved text. The other major weakness of these models is getting multiple characters/things to interact with each other in a believable way. Is this model any better at that than stable diffusion?

    • @Morbuto
      @Morbuto Год назад +2

      Better, yes, by quite a bit. Probably better at that than MJ.

    • @Athari-P
      @Athari-P Год назад +4

      There's node editor for StableDiffusion which allows manual composition using multiple generations, Z-buffer, vector poses and the rest of crazy tooling. But if you wantt something super complex from just one prompt, then no, it doesn't exist yet. I suspect it may be possible to train a model to create setups for this node editor, but I don't know whether anyone has tried.

    • @tetsuooshima832
      @tetsuooshima832 Год назад

      @@Athari-P You're talking about ComfyUI

  • @emmasnow29
    @emmasnow29 Год назад +2

    Cool, especially with text. Steep VRAM requirements though.

  • @BestCosmologist
    @BestCosmologist Год назад +1

    We're going to be able upscale old videos to 4k on the fly very soon. All videos will be high resolution.

    • @Athari-P
      @Athari-P Год назад

      Topaz Video Enhance AI seems to remain the best video upscaler at the moment, and I haven't heard about Topaz making a good progress for quite some time. And Diffussion models are too slow and inconsistent for video, so the current progress in generative models doesn't improve the state of video upscaling.

  • @spooky4655
    @spooky4655 Год назад

    This is literally the case of the first stable diffusion model, its expensive to run, but look at it now. thanks to open source!

  • @carlkenner4581
    @carlkenner4581 Год назад +4

    Amazing!
    But can it draw hands?

  • @MONTY-YTNOM
    @MONTY-YTNOM Год назад +9

    This BEATS Midjourney ? we can't tell if we can't use it :)

    • @MattVidPro
      @MattVidPro  Год назад +7

      You CAN use it! Link below!

  • @WhiskeyBlack777
    @WhiskeyBlack777 Год назад +1

    OMG. The freaking Doc boot with the Venus In Furs lyrics #PunksNotDead lol

  • @jopansmark
    @jopansmark Год назад

    Peak of science! That's beautiful! Imagine paying 30$/month when people get these images for free(almost, at least without this subscription based business model(I hate it))

  • @XRedDemon27
    @XRedDemon27 Год назад +2

    First to do text well, but midjourneys image quality and creativity is better imo

  • @iamgroot4063
    @iamgroot4063 Год назад +2

    Never say never lol I have heard that Midjourney is working on lettering. DF is pretty incredible but I still give MJ the crown as far as quality. Based on what I have seen of DF so far.

  • @juliodiaz7778
    @juliodiaz7778 Год назад +3

    I honestly believe that its clear the ai padora box has been opened and cant be closed. We better get on as a community and build our ai or its gonna be over for use. We got two future ahead of us, under the boot of corporate ai's, or we balance the field and live in megaman battle network world. Shiiieeet, imma start working on my megaman.exe

  • @renarddubois940
    @renarddubois940 Год назад +3

    My favorite is still Bing create, it has the best colors, best composition, best ability to understand prompts, I think it's the best

    • @sunlight8299
      @sunlight8299 Год назад +2

      It has a LOT of restrictions and limited usage

  • @renesysval
    @renesysval Год назад

    Love your channel, I tell others about it. One tip though, can you warm up your video portions of yourself? It’s so bright on our TV’s, it washes you out.

  • @CryptobdSchool
    @CryptobdSchool Год назад +2

    I personally think BlueWillow is far better than others free AI. BW is 100% free tool. Though BlueWillow is so early stage. I experimented with BW, and I'm really amazed by the results

    • @cosmicaudio4589
      @cosmicaudio4589 Год назад

      I use BW a fair bit and for free it is excellent. I would just hope they include more features like Midjourney. If Midjourney opened up for free it would runaway with the lead out of the two, I have just cancelled my subscription to MJ because there are other AI doing a too similar job for free, Midjourney will die behind a paywall! I think of Blue Willow and Midjourney as a sort of VHS - Betamax battle, I just not sure which one is which right now!

    • @jopansmark
      @jopansmark Год назад +1

      Nah, it's not open source so I don't trust it. Also, is that bots? Like I seen two exact same comment about Blue Willow from two accounts. Really shady

  • @Cl0udEater
    @Cl0udEater Год назад

    "No I don't have a gun" is a line from "Come as you are'' by Nirvana. Misha and I are both fans, it seems!

  • @themanual5619
    @themanual5619 Год назад +2

    So far, it definitely beats it when it comes to cohesiveness, but I don't believe DeepFloyd can beat Midjourney when it comes to making characters, yet.

    • @jopansmark
      @jopansmark Год назад

      It beats it when you need to generate Xi Jinping, lol

  • @MerinLightbringer
    @MerinLightbringer Год назад +18

    bro, you are pumping out video after video after video, you are probably an ai yourself ;D love your videos, love your enthusiasm, love your excitiment and your energy. Do you mind me asking if you use dark mode on Twitter or everywhere possible? Your videos are so bright^^

    • @MattVidPro
      @MattVidPro  Год назад +11

      I will have to try dark mode… thanks for the kind words!

  • @mikeyc8139
    @mikeyc8139 Год назад +2

    Either I'm doing something wrong or I'm expecting something different. It does well at words but that's about it. People are severely distorted with messed up faces, arms coming out of the middle of their chest, etc. Typing any prompt with a known person like "Walter White" produces half of the results matching some random person and the ones that look like WW are distorted. Other than text, this looks like a very early AI text-to-image generator. Like in the realm of the old craiyon.

    • @flameshana9
      @flameshana9 Год назад

      It's to be expected. Videos like this are always baseless hype. "I've used it before and it's amazing." Okay where's the proof? And where's the prompts to go with it? They always lie and show cherry picked results.
      Easy way to know if a video is clickbait: check if it has Midjourney in the title.

  • @HardstyleCastle
    @HardstyleCastle Год назад

    The quality is incredible. I am going to use it for future thumbnails on my channel!

  • @spookybuk
    @spookybuk Год назад +1

    From what I could understand of their license agreement, you can't do any commercial use with anything made in it. That means, can't use it as a book or album cover, etc. That sucks :(

  • @vi6ddarkking
    @vi6ddarkking Год назад +2

    I honestly am interested in the possibility of incorporating this new gen image generation models with the exist A1111 and comfyUI tools.

  • @artofficialintelligence6663
    @artofficialintelligence6663 Год назад +3

    Do you know how many times I've heard "this beats midjourney" in the last couple of months?

    • @MattVidPro
      @MattVidPro  Год назад +4

      I’m serious about this. I don’t toss that around

    • @Athari-P
      @Athari-P Год назад +3

      Well, DeepFloyd definitely beats Midjourney is some aspects and in some prompts. The same is true for StableDiffusion, especially specialized fine-tunes. The only problem is that Midjourney still looks better in 80% of cases. :) And doesn't look like DeepFloyd would be able to beat Midjourney with its heavy reliance on upscaling (4x + 4x), as textures are ruined. But it's "modular", so the IF-I-XL is a huge progress either way.

  • @jeffwads
    @jeffwads Год назад

    The most impressive thing aside from the text is that image of the 5 meerkats with different colored sweaters.

  • @rproctor83
    @rproctor83 Год назад

    Finally I can ask the AI for some photscans of the uncensored Roswell documents.

  • @fnorgen
    @fnorgen Год назад +4

    Oh no! If the community really gets into this I will have to buy a 4090. My eyes have grown hypersensitive to the typical SD coherence problems that have never been properly solved, and I just don't find it very fun unless I can run my models locally. But man! I am super impressed that something this advanced can run on consumer hardware at all. It looks extremely promising!

    • @UltraK420
      @UltraK420 Год назад

      I happen to already have a 4090 in my PC, so if that's what it takes to run AI models like this locally with good performance then I may as well go ahead and do it. How convenient.

    • @razoraz
      @razoraz Год назад

      Or you could buy a Mac with Apple Silicon with the requisite amount of ram. The Video ram is shared with regular ram so much bigger than you'd get on Intel video cards by default. M1&M2 also have ML cores that contribute to FAST rendering for some models. I've got an M1 Pro with 16gb ram which can do many open source calculations. The M2 Max would be the one to consider today if 24+GB video ram is required.

  • @Vartazian360
    @Vartazian360 Год назад +6

    This looks like a really good model but it looks like it has inconsistent lighting across some of the generated images. Looks kinda like photoshop jobs with the ability to add text where as midjourney csnnot do text but still looks like it is producing more consistency and realism in its generation. Very powerful tool but so far ill stick to midjourney v5. Also missing aspect ratio options and such

  • @BlackPenguinStudios1972
    @BlackPenguinStudios1972 Год назад +1

    Thank you for all the great information, thanks for keeping us all updated

  • @Mulnader
    @Mulnader Год назад +1

    Looks like proper prompt generations but with a poor aesthetics. It should be great as a starting point for text2img in PF and then move it in to SD or MJ for img2img

  • @ReligionAndMaterialismDebunked
    @ReligionAndMaterialismDebunked Год назад +1

    Awesome! Yes, words have sucked so bad with regular AI, and I've tried several sites, and several apps. Hehe. This'll be fantastic for selling them, and should be good for consistent images that you want similae for children's books (what I have written by AI, but no images for it due to inconsistency), and other things.

  • @erikprestonTV
    @erikprestonTV Год назад

    Of all the image generators I have used. I think Bing Image Creator generates the best images so far.

  • @MuralidharanJayaram
    @MuralidharanJayaram Год назад +1

    Can the images be used for promoting Social media content? I heard that you cannot use the images for commercial use.

  • @IPutFishInAWashingMachine
    @IPutFishInAWashingMachine Год назад +6

    FINALLY!!! IVE WAITED MONTHS FOR THIS!!!

    • @MrGTAmodsgerman
      @MrGTAmodsgerman Год назад

      You could have used an Automatic1111 extension for SD Text generation

  • @brazilforreal1
    @brazilforreal1 Год назад +3

    Wow! This almost looks too good to be true !

  • @digidope
    @digidope Год назад +1

    My RTX 3090 has 24gb VRAM. Let's get back to this when it's integrated to A1111

  • @mattr9613
    @mattr9613 Год назад

    From what I understand from the agreement is that all there images are subject to copywrite and your not allowed to change them either. sounds like a lot of future litigations.

  • @ReflinWulf
    @ReflinWulf Год назад

    I have been waiting for this for so long that I was seriously considering that it might never come out

  • @garnishstudio3567
    @garnishstudio3567 Год назад +1

    This looks very exciting! Is Deepfloyd able to be run on a Mac as well as PC with dedicated graphics card? Technically the M1 Max should be capable enough, but has anyone tried setting this up on a Mac yet?

  • @erichunter1737
    @erichunter1737 Год назад +1

    Looks great but I wouldn’t switch until this is more user friendly then I’d try it out.

  • @RSV9
    @RSV9 Год назад +2

    Can it be downloaded to the computer to use it with A1111 or do you need another webui ? And if so, which file should be downloaded ?

  • @burkhardstackelberg1203
    @burkhardstackelberg1203 Год назад

    I wait for the day strong open source multimodal models become bidirectional 😍 - so you can input text, getting a rendering of your prompt (including written text), and input an image, getting a description (including the text written in the image)...

  • @mrburns366
    @mrburns366 Год назад +1

    the AI powered NPCs are the thing I'm most excited about. it's really going to change gaming dramatically

    • @wykydytron
      @wykydytron Год назад

      Nah, it will be used mainly to add voiceovers to games that had none, that's about it. Modern games will probably not make use of it aside from maybe randomly generating chatter that will be then censored/curated by devs and placed in game as they do right now but with more variety. If you think anyone is going to put something like chatgtp in game so you can talk with npcs it's not gonna happen with current hardware not mention it had to be very limited to topics only related to that game world resulting in pretty much same thing we have right now, tree of few choices. Tech demos and projects made for fun do not reflect realities of commercial products.

  • @philhartten6273
    @philhartten6273 Год назад

    Yoo Matt.....thank you so MUCH for all this cool incredible info...you da best!!! Phil

  • @unironicallydel7527
    @unironicallydel7527 Год назад +2

    The king in my book is and will continue to be PixAI. Its free, uncensored, has a ton of img gen models, and they're still updating the site. with the release of AnythingV5, I dont think anything can top it. These other img gen model devs heavily censor their shit to the point they arent worth using imo.

    • @friendlyvimana
      @friendlyvimana Год назад

      Is this one censored?(which includes not generating famous people photos)

    • @deathorb
      @deathorb Год назад

      thanks for that!!!
      Not quite free really but good

    • @unironicallydel7527
      @unironicallydel7527 Год назад +3

      @@deathorb Its entirely free. You have to mess with it, but it is free. Make sure to have it on Low priority and only generating 1 img at a time. Mess around with sampling sizes, etc. The Credits are ONLY for generating faster. You cant buy them, and you get 10k free daily.

    • @ShawnFumo
      @ShawnFumo Год назад

      Just keep in mind it is unlikely to stay free long-term, just because of the high costs involved in running any of these sites. But we're definitely getting a lot of options lately. Mage is another that is pretty open about what can be generated, though it does cost to use anything but the default SD models.

    • @unironicallydel7527
      @unironicallydel7527 Год назад +2

      @@ShawnFumo Thats why I have a $25 NovelAI sub as a backup. Its not near as good for images, but still uncensored and you can tune it to produce some amazing images, just not as consistently. And for the price, its very much worth it. Not only do you get unlimited img generation, but also unlimited story generation, and the chatbot they use is fairly good. Im surprised PixAI has been around this long while being free, right now it has no monetization model. But, given how their updating it, we might see one sometime soon.

  • @chrislloyd1734
    @chrislloyd1734 Год назад +3

    Super excited until you mention 16GB GPU. Can it reduce down to 8GB using half accuracy?

  • @dwainmorris7854
    @dwainmorris7854 Год назад +1

    Yes but can it do accurate celebrity likenesses or consistant character design unlike Mid journey

  • @spinninglink
    @spinninglink Год назад +1

    If i woulda known this when getting a video card, i would have opted for one with more vram lol

  • @MarcoCholo-iz9js
    @MarcoCholo-iz9js Год назад

    All these websites will pale in comparison with what Adobe has up its sleeve. I'm talking about Adobe Firefly. It will do to AI art generation, what Photoshop did for the digital art industry back in the day. Can hardly wait

  • @spiritofgivings
    @spiritofgivings Год назад

    Hey Matt, thanks for this but unfortunately using huggingface only produces small very blurry pictures...the whole page doesn't look at all like what is presented in your video. Is there a different link that you forgot to include? I followed what you put the description. I wish I could show you a screenshot of wat I see.

  • @billgrey
    @billgrey Год назад

    Amazing! This is going to be great. I appreciate your finding and showing all this. Slow down on the scrolling past everything, though! We're trying to see the prompts, too! 🙂

  • @sunlight8299
    @sunlight8299 Год назад +1

    I am v.excited about an open version of Midjourney that will quickly become superior. I'm waiting for all AI/AGI to run on mobile phones but I am considering getting a laptop 💻 just so that i can play with AI art. Any suggestions of something affordable are welcome 😅😊

    • @flameshana9
      @flameshana9 Год назад

      You need a decent gpu to run AI locally, so a laptop isn't going to cut it. Better to just get a cheap used computer and plop in a gpu like the 3060.

    • @razoraz
      @razoraz Год назад

      A recent model of iPhone or iPad will run "Draw Things" which is based on the open source Stable Diffusion model. It runs VERY fast on these handheld powerhouses due to Apple's machine learning cores built into recent years of Apple Silicon. PC fanboys have no idea how equal these tiny platforms are to their very expensive Nvidia GPUs in comparison.
      ~!

    • @flameshana9
      @flameshana9 Год назад

      @@razoraz So you're saying iPhones and iPads have a reputation for being affordable?

  • @sythekkkk
    @sythekkkk Год назад

    first ai image generator that can generate words without mistakes

  • @Darksagan
    @Darksagan Год назад +1

    I am super interested in the upscaling. But confused asf on how to do it.

  • @dubshaman
    @dubshaman Год назад

    Excited!

  • @joseluispcr
    @joseluispcr Год назад

    unfortunutly you can use the tool for reasearch use only, no comercial use allowed

  • @Tarheb
    @Tarheb Год назад +1

    bro, midjourney is light years ahead of this...

  • @typingcat
    @typingcat Год назад +2

    Once again, VRAM is the barrier. I started to feel the limitation of VRAM when I started using Blender and how GPU companies are basically robbing consumers with VRAM. But that was when there were only few consumers who needed a lot of VRAM. I nope this growing A.I. image popularity will make more consumers force GPU companies to add more VRAM by boycotting GPU's with smaller VRAM.

    • @flameshana9
      @flameshana9 Год назад

      If only there was a company who said "Hmm, maybe if we make cheap GPUs with tons of vram we'll sell a lot!"
      I'd buy a gpu with 24gb of ram that can't game in a heartbeat. Same with AV1 encoding. They don't sell these things on their own for a reason though. They want you to pony up for a high end gpu instead.

  • @analogtransmissions
    @analogtransmissions Год назад +2

    If it is open source Midjourney will soon implement it just like they did stable diffusion.

    • @Cola-42
      @Cola-42 Год назад

      A wise man once said👆

    • @vytah
      @vytah Год назад

      It's not open source

  • @Leto2ndAtreides
    @Leto2ndAtreides Год назад +5

    Deep Floyd has better accuracy - isn't necessarily as nice looking as MJ5... Since MJ5 uses human feedback to get closer to what humans will think is cool.
    Since it's opensource, and since Midjourney has its own datasets, they may well be able to finetune this to produce MJ style images.

  • @slowcreep6978
    @slowcreep6978 Год назад

    The Venus in Furs reference. Nice!

  • @DivineMisterAdVentures
    @DivineMisterAdVentures Год назад

    AWESOMEY - except if you see the letters IF and it's not I.F. - it's probably "if" not I-F.

  • @hugoruix_yt995
    @hugoruix_yt995 Год назад +2

    Looks really cool! Will it beat sd though?... The darth vader prompt looked really promising

  • @markusbiewer9153
    @markusbiewer9153 Год назад +3

    The meme potential.

  • @cryptosanity361
    @cryptosanity361 Год назад +2

    I can’t get an image to load . Just a blank square with an X in it

  • @carlt.8266
    @carlt.8266 Год назад

    3:30 it‘s more about being able to draw hands and even hands drawing hands, than someone drawing a drawing, I‘d say.

  • @klaustrussel
    @klaustrussel Год назад

    YES!! This is going to be extremely useful 🧙‍♂

  • @Pepsiisgood
    @Pepsiisgood Год назад +1

    Please make a video about the Vits AI (where you can make songs using artists voice 👀)

  • @DivineMisterAdVentures
    @DivineMisterAdVentures Год назад

    "IF" by Rudyard Kipling? "If you can keep your head, when all around you are losing theirs...."

  • @ventonthorn3455
    @ventonthorn3455 Год назад

    Very cool, but will it remove unwanted text?
    MJ gives me a headache when it throws garbage text all over an image where it doesn't belong.

    • @MistaRopa-
      @MistaRopa- Год назад

      Try clip drop by Stability Ai

  • @aidenlinz8422
    @aidenlinz8422 Год назад

    DALL·E 2 is the sapling. The sapling grew.