Stable Video AI Just Got Supercharged! - For Free!

Поделиться
HTML-код
  • Опубликовано: 17 фев 2024
  • ❤️ Check out Lambda here and sign up for their GPU Cloud: lambdalabs.com/papers
    📝 The paper "MotionCtrl: A Unified and Flexible Motion Controller for Video Generation" is available here:
    wzhouxiff.github.io/projects/...
    Try it out: huggingface.co/spaces/Tencent...
    huggingface.co/spaces/Tencent...
    It is also open source - run it locally:
    github.com/TencentARC/MotionCtrl
    📝 My latest paper on simulations that look almost like reality is available for free here:
    rdcu.be/cWPfD
    Or this is the orig. Nature Physics link with clickable citations:
    www.nature.com/articles/s4156...
    🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:
    Alex Balfanz, Alex Haro, B Shang, Benji Rabhan, Bret Brizzee, Gaston Ingaramo, Gordon Child, Jace O'Brien, John Le, Kyle Davis, Lukas Biewald, Martin, Michael Albrecht, Michael Tedder, Owen Skarpness, Richard Putra Iskandar, Richard Sundvall, Taras Bobrovytsky, Ted Johnson, Thomas Krcmar, Tybie Fitzhugh, Ueli Gallizzi.
    If you wish to appear here or pick up other perks, click here: / twominutepapers
    Thumbnail background design: Felícia Zsolnai-Fehér - felicia.hu
    Károly Zsolnai-Fehér's research works: cg.tuwien.ac.at/~zsolnai/
    Twitter: / twominutepapers
  • НаукаНаука

Комментарии • 431

  • @TwoMinutePapers
    @TwoMinutePapers  3 месяца назад +512

    Nothing is as good as Sora, however, this is something that we can all try right now. So cool!

    • @nox5555
      @nox5555 3 месяца назад +50

      Well we dont realy know how good Sora is because its not public.

    • @ivanleon6164
      @ivanleon6164 3 месяца назад +8

      Sora has tons of power behind, is not that much ahead of the rest.

    • @DavidSaintloth
      @DavidSaintloth 3 месяца назад +33

      ​​@@nox5555, nonsense. The live prompting examples that Sam gave on Twitter provided all the proof one needs to conclude that it is state of the art by far.
      The examples demonstrated leading temporal consistency, sequence length, resolution and preservation of fine details as well as more accurate physical modeling (less hallucinated fingers & hands, better physics between light & material) as a researcher in the space is demonstrated clear state of art in several dimensions without needing to be available to the public.

    • @manofsan
      @manofsan 3 месяца назад +2

      How do we try this stuff? Does Stability AI have any code samples, or something we can download?

    • @alexdoan273
      @alexdoan273 3 месяца назад +2

      @@nox5555 on the other hand, it's not public because it's almost identical to real video. They need to put restrictions in.

  • @jdchannelviewer
    @jdchannelviewer 3 месяца назад +924

    Sora jumped about 4 papers. Everyone else is going to have to release anything they've been holding back and triple their efforts for further breakthroughs.

    • @chanpasadopolska
      @chanpasadopolska 3 месяца назад +144

      Yeah, but Sora is owned by private company. Stable Diffusion on the other hand is open source, which means it contributes everyone not only one corporation and its clients.

    • @dvl973
      @dvl973 3 месяца назад +33

      ​@@chanpasadopolskaand if it can't keep up it will be left behind.

    • @devrimarslan5053
      @devrimarslan5053 3 месяца назад +5

      @@chanpasadopolska how can i get educated about Stable Diffusion? is there have any beginner friendly courses?

    • @Ghettofinger
      @Ghettofinger 3 месяца назад +14

      @@chanpasadopolska I only care about results. This is good I guess to make things that are censored by companies, but otherwise, I only care about what gives me what I want, not whether it's open-source.

    • @jdchannelviewer
      @jdchannelviewer 3 месяца назад

      @@chanpasadopolska yet it's way ahead.

  • @aidencoder
    @aidencoder 3 месяца назад +116

    I love that the AI thinks shutterstock watermarks are part of our world

    • @doyourownresearch7297
      @doyourownresearch7297 3 месяца назад +6

      those content copyright games and stock images. My god, that is exactly why I love AI.

    • @mvmlego1212
      @mvmlego1212 3 месяца назад +2

      It's hands-down proof of Elon Musk's claim that these companies have violated copyright laws to train their models.

    • @gabrielv.4358
      @gabrielv.4358 3 месяца назад

      i dont care@@mvmlego1212

    • @user-my3sp4oi4r
      @user-my3sp4oi4r 3 месяца назад

      @@mvmlego1212 Who cares

    • @mvmlego1212
      @mvmlego1212 3 месяца назад

      @@user-my3sp4oi4r -- ...presumably, the visual artists who will never work again because a company stole their artwork to create a contraption that will put them out of business.

  • @Siranoxz
    @Siranoxz 3 месяца назад +278

    Its very encouraging to see other AI models being improved despite the Sora breakthrough.

    • @Iswimandrun
      @Iswimandrun 3 месяца назад +24

      Sora works but is being kept to a limited customer base to protect against mis use. Open source will get there eventually but better networking stacks for training needs to be adopted to scale to this level of problem.

    • @ykwtfgo
      @ykwtfgo 3 месяца назад +24

      @@Iswimandrunthey’re not only limiting to prevent misuse , it’s for $ too

    • @Metarig
      @Metarig 3 месяца назад

      @@Iswimandrun
      In history, open-source products rarely achieve the same level of success as commercial, closed-source products. This is because open-source often means it's created by people who don't make money from it. And in this world, you need money to live. It's money that unlocks your full potential.

    • @bahshas
      @bahshas 3 месяца назад +7

      @@ykwtfgo tbf the government would shut them down if they didnt do their will

    • @JackCrossSama
      @JackCrossSama 3 месяца назад +15

      dont worry, open source takes a while to catch up but it will.

  • @ClayMann
    @ClayMann 3 месяца назад +183

    I'm all for supporting the competition. We need a vibrant range of companies all competing. No one wants a Sora monopoly and the dire consequences that come with that over time. Come on little A.I's, you can do it!

    • @OhioNPC911
      @OhioNPC911 3 месяца назад +1

      Mf do something with yr life, try to create art yourself

    • @chillsoft
      @chillsoft 3 месяца назад +16

      I don't think you comprehend what Sora is. It is racks upon racks of H100's, if you had that horsepower you could have Sora at home rn. But noone does, so only ClosedAI has it for now.

    • @adrianmunevar654
      @adrianmunevar654 3 месяца назад +8

      Sora Is boring, look at all those restrictions. When they release it, well, dumb people will be excited, but what kind of interesting things will they do with a so restricted model? 🥱
      Stability AI has lots of money, tons. They're already figuring out their next move. As Emad said, they're cooking something...

    • @OhioNPC911
      @OhioNPC911 3 месяца назад

      Where is my comment?

    • @jerbear7952
      @jerbear7952 3 месяца назад

      ​@@OhioNPC911RUclips and everyone you know is out to get you

  • @PHIplaytesting
    @PHIplaytesting 3 месяца назад +38

    This paper is more about the amount of control that is able to be expressed in the output rather than simply the "quality" of the output (which Sora clearly exceeds). It's a great demonstration of the new types of things we'll be able to do with this technology as it develops.

    • @adrianfiedler3520
      @adrianfiedler3520 3 месяца назад +3

      Excatly, in SVD1 you had no control about any movements and it was trial and error. Now there is much more control about what should happen in the video. I'm sure quality will also improve significantly in the future.

    • @moritz584
      @moritz584 3 месяца назад +3

      Yes. Sora has different capabilities.

    • @moritz584
      @moritz584 3 месяца назад +1

      @@adrianfiedler3520can you imagine what we’ll be able to control just two more papers down the line

  • @117translyrics
    @117translyrics 3 месяца назад +26

    people forget that sora isnt available right now and was revealed early to counter gemini 1.5, and that stable video is completely open-source. people also forget that openAI has microsoft's financial backing and that stability AI is a start-up company. this is incredibly promising news because we dont need to run prompts/queries through openAI's API to get something like what is in the video.

    • @jacobnunya808
      @jacobnunya808 3 месяца назад +2

      All these smaller AI companies will probably be eventually gobbled up by the bigger ones. The smaller ones won't be able to keep up and the bigger ones will want more talent.

    • @moritz584
      @moritz584 3 месяца назад +9

      Well said. It’s also amazing to see promising competition because we do not want a microsoft monopoly on AI. Also worth noting, as Károly mentioned in another comment, what’s new here is something, that sora can’t do, which is fine controllability of the objects and the camera in a video

    • @117translyrics
      @117translyrics 3 месяца назад +1

      @@jacobnunya808 unlikely for stability AI, as their vision directly clashes with openAI's. together they would probably make something fantastic, but i dont think current leadership at stability would stand for it

    • @117translyrics
      @117translyrics 3 месяца назад +1

      @@moritz584 exactly. it flew under the radar, but 6 days ago stable diffusion cascade was released. it brings it up to par to midjourney, which costs USD per month pre-tax if you dont want people to see what you are doing, with multiple features not found in openAI's DALL-E. stable video and cascade are both MASSIVE for people who do not want a monopoly from corporations that want to endear ONLY to the mass public for profits

  • @albertsitoe7340
    @albertsitoe7340 3 месяца назад +44

    It’s very impressive what they’ve done but it’s also the shamelessness of the shutter stock watermark is insane 😂

  • @vanjavicko20
    @vanjavicko20 3 месяца назад +14

    seeing the shutterstock logo is funny because I remember when older video AI's also did that like that one where will smith ate spaghetii

  • @Dp-dx3zu
    @Dp-dx3zu 3 месяца назад +73

    I remember when ai interpolation for higher fps was groundbreaking

    • @jacobnunya808
      @jacobnunya808 3 месяца назад +4

      I mean 3x higher fps was pretty cool. Made ray tracing practical.

    • @dnsjtoh
      @dnsjtoh 3 месяца назад +4

      It kinda is groundbreaking. But it also kinda sucks. You can notice the latency, especially in some games. I don’t use it in The Finals, because it’s awful

    • @MisterPerson-fk1tx
      @MisterPerson-fk1tx 3 месяца назад +4

      I remember when AI had to cheat to beat you in games.

  • @ZeroControl
    @ZeroControl 3 месяца назад +10

    All this shit is about to fuck us all up.

    • @jacobnunya808
      @jacobnunya808 3 месяца назад +1

      Will save companies a lot of money with special effects.

  • @TheCynicalNihilist
    @TheCynicalNihilist 3 месяца назад +6

    This channel is the Nostradamus of the tech world. Ive been watching for years and everything that has been ahow always come to fruition. In games, video, and ai. Obviously these arent predictions but current research that gets used eventually. i just dont know any other channel that accuretly shows the future of tech like two minute papers.

    • @Anttisinstrumentals
      @Anttisinstrumentals 3 месяца назад

      What if I told you there is no doctor Károly Zsolnai-Fehér. It was a clever name AI chose.

  • @errorhostnotfound1165
    @errorhostnotfound1165 3 месяца назад +9

    4:07 funny how the generated image has the shutterstock watermark :P
    I guess the people who made the ai didn't want to pay for a bunch of stock images

  • @gabrielv.4358
    @gabrielv.4358 3 месяца назад

    THANK You for making this video. I was hopeless in trying to find an freee updated version of ai text to video.

  • @Acehalo2
    @Acehalo2 3 месяца назад +2

    This may not be as technically impressive as "Open"AI Sora, but for one, it's still early days, and two (more importantly) it's freely accessible and here now! I am unimpressed with Sora solely because it's going to be "cool kids club only" material where we uneducated peasant classes will never get access to tech like that. It might as well be movie hologram technology in my mind. Sora gives me a "Huh. Cool. I guess..." feeling, quite honestly.
    I'm glad the open source community is making leaps and bounds ahead for this technology. :) I wish them nothing but success in bringing technology to the everyman! Thank you for covering this!

  • @multiverse-republic
    @multiverse-republic 3 месяца назад

    actually very valuable video. We all scrolled through Sora and forgot about the other projects. Thanks bro ❤

  • @fynnjackson2298
    @fynnjackson2298 3 месяца назад +5

    Open-source will inevitably be equally good as private. As AI steps into chatgpt 7-8 it will be used to develope opens open-source clones and open-source video models.

    • @dr.emmettbrown7183
      @dr.emmettbrown7183 3 месяца назад +2

      That is not necessarily true if enormous "open-source" computing power is not available.

  • @mithrillis
    @mithrillis 3 месяца назад

    This is great. I think having direct camera and object control is more important than trying to understand the same command in text. For people seriously trying to get a video scene they need, knowing the model will nearly deterministically follow your order is much better than "suggesting" the model to do the same and hoping it works.

  • @ickaruus4909
    @ickaruus4909 3 месяца назад +1

    It's so good that it's open source. Big companies having a monopoly on these incredible world changing technology would be an even bigger problem than it already is

  • @_spartan11796
    @_spartan11796 3 месяца назад +118

    When we gonna be able to revive old cancelled animated shows with this tech?

    • @pandoraeeris7860
      @pandoraeeris7860 3 месяца назад +42

      2025.

    • @soulsmith4787
      @soulsmith4787 3 месяца назад +56

      "Hey machine, please generate Firefly season 2. Thank you."

    • @JohnKerrashVirgo
      @JohnKerrashVirgo 3 месяца назад +2

      Never, the corps will pay wall it

    • @kinsley7777
      @kinsley7777 3 месяца назад +4

      @@soulsmith4787
      I’m with you …
      no idea why it didn’t last longer …

    • @incription
      @incription 3 месяца назад +30

      @@JohnKerrashVirgo how they gonna paywall open source? lmao

  • @swordofkings128
    @swordofkings128 3 месяца назад +2

    1:40 actually I believe the correct term for some of those camera motions are pedestal up/down and truck left/right.

  • @TroyRubert
    @TroyRubert 3 месяца назад +75

    It feels like the singularity got significantly closer.

    • @Scratchfan321
      @Scratchfan321 3 месяца назад +10

      we have mere seconds

    • @mito._
      @mito._ 3 месяца назад +4

      Can't wait 🎉

    • @21EC
      @21EC 3 месяца назад +7

      🤣 I also believe a mini - AI - singularity is taking place now, so crazy that just a few hours later Stable Video AI releasing this more advanced model of theirs, it feels like this AI revolution is getting out of control and getting faster and faster and more and more crazy and advanced by each day/hour that passes.

    • @hydrohasspoken6227
      @hydrohasspoken6227 3 месяца назад +6

      not even close.

    • @spooderderg4077
      @spooderderg4077 3 месяца назад +1

      Singularity: The single infinitesimal point of mass of a black hole where not even light can escape where time is effectively frozen by the sheer force of gravity (also would kill people who touched it).
      AI bros: this sounds like the word I want to use.

  • @torarinvik4920
    @torarinvik4920 3 месяца назад

    The accent and enthusiasm of the Dr Feher. makes the videos 3 times better! I held on to my papers!

  • @Amin2k
    @Amin2k 3 месяца назад +13

    The speed at which this is developing is scary

  • @iBerry420
    @iBerry420 3 месяца назад

    Such an incredibly fast race between all the AI projects! IIt's exciting and scary. Wow.

  • @HCforLife1
    @HCforLife1 3 месяца назад +2

    The text to video at the moment is when we were with Dalle-2 and Midjourney v1-2. Wait a year or two...

  • @boltvanderhuge8711
    @boltvanderhuge8711 3 месяца назад

    It's all about accurate and highly granular segmentation, which luckily is one of those things that can use its own output to improve itself

  • @ethzero
    @ethzero 3 месяца назад +1

    As I've said many a time, this'll all be just a forgotten about part of a Holodeck one day, but how cool is it that we get to see this technology emerge *today* 😊

  • @prunabluepepper
    @prunabluepepper 3 месяца назад +11

    Noooooo, your video is only 21 minutes old and the huggingfasce webpage is already too busy 😭

  • @oshapermadi
    @oshapermadi 3 месяца назад +54

    Is this video recorded before sora? you don't mention sora at all, doctor.

    • @TwoMinutePapers
      @TwoMinutePapers  3 месяца назад +144

      You are indeed right, my apologies! Right as I was done making this one, Sora appeared and I could not believe my eyes. Luckily, this innovates in a different direction (controllability) and is free so I think it is a fantastic value proposition to show it to you Fellow Scholars now.

    • @oshapermadi
      @oshapermadi 3 месяца назад +22

      @@TwoMinutePapers That's completely fine. I just wondering why you doesn't mention Sora at all. You're right, this paper have its different inovation. Thank you for delivering this paper to us fellow scholars 😁

    • @volkerengels5298
      @volkerengels5298 3 месяца назад

      Europe likes to have their own A-Bomb security. "What a time to be alive" @@TwoMinutePapers

  • @RandomGuy-hi2jm
    @RandomGuy-hi2jm 3 месяца назад +25

    What a time to be alive

  • @chanpasadopolska
    @chanpasadopolska 3 месяца назад +3

    How to have it locally on Mac? Is there something like DiffusionBee for image generating?

  • @channelname7859
    @channelname7859 3 месяца назад +1

    To be far, finetuning 1.5 (and now SDXL) by the community led to insane improvements in image diffusion, so I assume the same can be said for video diffusion.

  • @albertstarfield
    @albertstarfield 3 месяца назад

    Yes! What a time to be alive

  • @hotrodhunk7389
    @hotrodhunk7389 3 месяца назад +17

    If you showed me this last week I'd be so impressed. But after seeing Sora...

    • @bifrostbeberast3246
      @bifrostbeberast3246 3 месяца назад

      Well, how many ppl have currently access to Sora? And how many people have access to Stable Diffusion?

  • @jakekeltoncrafts
    @jakekeltoncrafts 3 месяца назад

    The walls are a massive upgrade. Samwise was wise to hire you for a redecorating!
    I love lore stuff like the reflective pool. If only we had half slabs of glass that you could walk on but put stuff like end robs and skulk under it. We need more blocks Minecraft!
    Maybe the ender city needs a temple to give chorus fruit sacrifices to their moon godess?

  • @mikosoft
    @mikosoft 3 месяца назад

    The cats and zebras walk by cloning their legs tho :D

  • @Quick_VFX
    @Quick_VFX 3 месяца назад +1

    From my understanding Sora generates Unreal Engine scripts that then generate the images and video hence know weird warping etc

  • @danlivas
    @danlivas 3 месяца назад

    Thanks Ren

  • @Awesomlypossom
    @Awesomlypossom 3 месяца назад

    Imagine giving a whole comic book to this ai and having it animate it. Cool

  • @CeapaCoolOfficial
    @CeapaCoolOfficial 3 месяца назад +1

    AI is advancing so fast this technology got surpassed before this video was even posted

  • @DIProgan
    @DIProgan 3 месяца назад

    It's funny to think of how valuable this channel will be as a historic document of AI

  • @galenspring8019
    @galenspring8019 3 месяца назад

    What is your linguistic origin? Such a unique and consistent rhythm and cadence

  • @Konanan
    @Konanan 3 месяца назад +22

    Soon you'll be able to feed a novel into a prompt and ask it to make a feature movie out of it. Imagine that.

    • @MrMsschwing
      @MrMsschwing 3 месяца назад +4

      put the bible as prompt! ...will be pegi18 for sure ^^

    • @jerbear7952
      @jerbear7952 3 месяца назад +2

      Are you a kid?

    • @aylameridian
      @aylameridian 3 месяца назад +4

      So no one gets to enjoy the process of actually making the film? Sounds incredibly boring and depressing to me... I really hope that's not our future...

    • @blacknoir2404
      @blacknoir2404 3 месяца назад +1

      What I really want is to have brand new episodes of a TV series that is no longer made

    • @MrMsschwing
      @MrMsschwing 3 месяца назад +2

      @@aylameridian that's not true. Who ever wants to film in traditional ways can still do so. It's just an additional way of creation.

  • @miroaja1951
    @miroaja1951 3 месяца назад

    The Shutterstock logo on the outputs kills me lol

  • @Monstah7
    @Monstah7 3 месяца назад

    What a time to be alive..👍

  • @ramlozz8368
    @ramlozz8368 3 месяца назад +2

    Sora is in another level, the way it’s able to create simulations of the real world is 🤯 I think open AI is using a totally different approach on training their new models, I wouldn’t be surprise if they are using unreal engine to teach the model to have an understanding of 3D and light, they just need to teach the model cause and effect and it will be perfect 😅

    • @user-hl7lr8ld2i
      @user-hl7lr8ld2i 3 месяца назад

      you can read their paper on Sora

    • @jopansmark
      @jopansmark 3 месяца назад

      The difference between Sora and Tencent SVD is that Tencent SVD actually exists and is not a scam of dying startup

  • @user-lm4nk1zk9y
    @user-lm4nk1zk9y 3 месяца назад +1

    Two (or) more papers down the line we will have video output from generated high-detailed 3D worlds

  • @lobabobloblaw
    @lobabobloblaw 3 месяца назад

    I think the trick to SORA is that it has an autonomous GPT agent governing the diffusion process on a minute scale.

  • @a.thiago3842
    @a.thiago3842 3 месяца назад +1

    Now one thing came to mind. In the old times, whenever something new came to the market, it would cost a liver to have at home. But now, i just can download it nd use it the way i want to. I just need to wait a few seconds. If that's not amazing, nothing else can be.
    We just need to be afraid of technology bombardment. Cause the more we see thing, less strange and less amazing it might get. And i don't wanna feel this way. It's like if we had teletransport machine. If we had it, after a few months or years, it wouldn't bother you or make you be amazed the same way anymore.

  • @DanFrederiksen
    @DanFrederiksen 3 месяца назад

    it's an interesting question if AI should generate into a traditional euclidian 3D cad space and render or if it should stay in a pure 'live' neural space. I think I have the answer actually.

  • @GoelWCS
    @GoelWCS 3 месяца назад

    We enter the era of quantic pepars both the last one and 2 papers behind the last one ! This is going so fast !

  • @smetljesm2276
    @smetljesm2276 3 месяца назад

    Controlability engineer = cameramanof the future

  • @eyal.herlin
    @eyal.herlin 3 месяца назад

    Two Minute Papers bringing back Slashdoting into fashion.

  • @odw32
    @odw32 3 месяца назад +2

    I think there's a huge need for open models, or at the very least "open weight" self-hostable models.
    While it's incredibly cool what OpenAI (and Midjourney, Google, etc) are doing -- We need products which work in a datacenter of your own choosing, or even locally on your own consumer graphics cards. Especially when you want to combine image, video and LLMs with potentially sensitive customer data, it is essential that we can take security measures appropriate for the use case.

    • @jerbear7952
      @jerbear7952 3 месяца назад

      Are you even following along with what's going on with local models

    • @flingyourself
      @flingyourself 3 месяца назад

      @@jerbear7952what’s going on?

  • @mbadpa
    @mbadpa 3 месяца назад

    I can imagine a future where we put in a prompt, and out comes a complete world that we can explore using the camera.

    • @jerbear7952
      @jerbear7952 3 месяца назад +1

      That didn't take your entire imagination did it?

  • @SaintMatthieuSimard
    @SaintMatthieuSimard 3 месяца назад

    The application I am looking for is to enhance the realism of 3D scenes that I make myself without creating anything new but only giving a perfect color grading and perfect shades. Could that work?

  • @Zanroff
    @Zanroff 3 месяца назад +21

    "Pan Up, Pan Down" kills me as a camera man.

    • @Tyrone-Ward
      @Tyrone-Ward 3 месяца назад

      What is it then?

    • @moritz584
      @moritz584 3 месяца назад +2

      @@Tyrone-Wardpanning would be changing the angle I think, what this is doing is moving linearly on one axis. I guess you’d call that move up/down

    • @bendichter4116
      @bendichter4116 3 месяца назад +6

      @@Tyrone-Ward In film lingo you "pan" left/right and "tilt" up/down

    • @Zanroff
      @Zanroff 3 месяца назад +4

      @@Tyrone-Ward Tilt up, Tilt down

    • @john_hunter_
      @john_hunter_ 3 месяца назад +3

      But you're a camera man. They can't die.

  • @Mark73
    @Mark73 3 месяца назад

    I can't wait to see this used to make a Bad Apple video.

  • @GraveUypo
    @GraveUypo 3 месяца назад

    This is what i want. Free models i can run on my computer.

  • @KillerMZE
    @KillerMZE 3 месяца назад +1

    That shutterstock watermark is an easy loss in court

  • @Greenthum6
    @Greenthum6 3 месяца назад +1

    SVD is for research only so it is not same as free. Since you cannot monetize, it's use is fairly limited. Hopefully Stable AI will bring us commercial license soon for video.

  • @tauheedulali2652
    @tauheedulali2652 3 месяца назад +1

    It's great these tools exist, but there needs to be a new file format specifically for AI generated video which forms the entire video using a new type of encoded pixel called AI pixels or an AI based vector file format for video or images. That would make it clear when any piece of video content is created or derived from AI generated content as these tools become widely adopted. Since each pixel is an AI generated pixel type, it would not be possible to remove the indicator that this was an AI generated file because each pixel is indicated as computer generated.

  • @lobabobloblaw
    @lobabobloblaw 3 месяца назад

    Well, doc, it appears we’re too late already; the demo is definitely functioning like the ticket sales portal for a David Bowie resurrection tour.

  • @zerosiii
    @zerosiii 3 месяца назад

    Seems you made this video before the Sora one :D

  • @MineAnimator
    @MineAnimator 3 месяца назад

    Geralmente fico impressionado com o que é apresentado aqui, mas como já vi Sora, o interessante desses papers é que são acessíveis

  • @poldiderbus3330
    @poldiderbus3330 3 месяца назад +1

    From my point of view, it's just insane what's happening right now. It's happening so quickly that people who thought they had found a new income and could build a business with software for a feature find themselves a month later in a situation where everything is obsolete. Not to mention the large number of people who haven't even got rid of the habits from the Stone Age. It's fascinating, yes, but I think we're heading into a time that's even darker than we previously thought. I almost wish that a independent super AGI would take over as soon as possible...🙈

  • @2001DavidBowman
    @2001DavidBowman 3 месяца назад +1

    Damn, imagine being an artist

  • @yoverale
    @yoverale 3 месяца назад +3

    Shutterstock won’t like it

  • @joaodecarvalho7012
    @joaodecarvalho7012 3 месяца назад

    This acceleration looks like the proximity of the singularity.

  • @dr.emmettbrown7183
    @dr.emmettbrown7183 3 месяца назад +1

    With SORA out there this seems like news from a year ago.

  • @Parasmunt
    @Parasmunt 3 месяца назад +1

    This technology is working out like VR, miles of potential but unrealised or inaccessible.

  • @lemonke8132
    @lemonke8132 3 месяца назад +3

    To be honest all of these videos about ai video on this channel feel the same. Can't even tell the difference any more.

  • @boriswilsoncreations
    @boriswilsoncreations 3 месяца назад

    First Sora and then this. I really want to become a professional animator someday. I hope AI doesn't take away the career of my dreams from me, otherwise I don't know what I would do with my life. It's so impressive and depressing at the same time.

  • @vectoralphaAI
    @vectoralphaAI 3 месяца назад +2

    What this tells me is just how far advance OpenAI SORA trully is. This video basically shows a new paper state of the art, but here comes SORA on a literal nother level than this.

  • @devxsadik
    @devxsadik 3 месяца назад +2

    At this rate, Unsplash, Shutterstock and other stock image, video sites are gonna go bankrupt 😂😂😂😂

  • @allensmith9062
    @allensmith9062 3 месяца назад

    I'm waiting for the day I can upload an entire book of my choice and then generate an entire movie.

  • @makeitraindom1634
    @makeitraindom1634 3 месяца назад +5

    Do you speak like that because you are the firts ai that made a RUclips channel on its own?
    Or because you translate and then read?
    (Im 100% respectful and serious)

    • @NutrejaSFD
      @NutrejaSFD 3 месяца назад +6

      It's his accent, he's not an AI.

    • @johndank2209
      @johndank2209 3 месяца назад +1

      @@NutrejaSFD LMAO

    • @TheUltraMinebox
      @TheUltraMinebox 3 месяца назад +3

      Hes been on the platform long before chatgpt got announced, hes legit

    • @makeitraindom1634
      @makeitraindom1634 3 месяца назад +1

      @@NutrejaSFD no it's not just the accent it's also the way he speaks like he says every sentence the first time in his life, smartass

    • @BlackoutGootraxian
      @BlackoutGootraxian 3 месяца назад +1

      ​@@makeitraindom1634Hungarian accent is like that, and his is quite strong. I am hungarian myself so i know how it sounds. He is not an AI.

  • @jayaybe1
    @jayaybe1 3 месяца назад +1

    What a time to be alive! 😀

    • @aegisgfx
      @aegisgfx 3 месяца назад

      I suspect you won't be saying that 3 years from now and nobody has any work. The entire film industry is about to lay off everybody, the entire gaming industry is already in the process of laying off everybody as is tech sector laying off hundreds of thousands of people. Can anyone explain to me why this is a good thing??

    • @jayaybe1
      @jayaybe1 3 месяца назад

      @@aegisgfx I was just humourously referencing the uploader's catchphrase, it wasn't meant to be a treatise on the future human civilisation.

    • @aegisgfx
      @aegisgfx 3 месяца назад

      @@jayaybe1 I'm aware of that. Regardless, we will all be starving in a few years while openai will be worth 80 trillion dollars. All of this makes no sense

    • @DeceptiveRealities
      @DeceptiveRealities 3 месяца назад

      @@aegisgfx I think you are being somewhat over the top, but yes, there are some serious problems coming. Software engineers will be first to go as the code output is already fantastic (I am using GPT-4 on a project right now - sure, it makes mistakes, but it is correct 8 times out of 10). Then it will be the turn of the creative industries - first writers and photographers, then film makers, followed by actors and singers. There is going to have to be a massive shift in how we think of work and whether we need a universal basic income implemented. So, not quite the disaster you seem to suggest, but we are in for a rough and scary ride.

    • @jayaybe1
      @jayaybe1 3 месяца назад

      @@aegisgfx Seriously, I do share your concerns. Governments cannot and will not let that happen. They'd be strung up from lampposts. I'm no communist but the wealth will have to come from somewhere to pay for the 80% unemployed or whatever it is.
      Maybe governments will seize control of AI companies citing national security and redistribute the wealth through universal basic income. Who knows?
      I don't know if you follow David Shapiro's channel but he is excellent. Not just bringing AI news but also looking at the philosophical and practical outcomes for civilisation. Please check him out.
      All the best 🙂

  • @xyzero1682
    @xyzero1682 3 месяца назад +1

    That shutterstock watermark is gonna get this killed.

  • @MikkoRantalainen
    @MikkoRantalainen 3 месяца назад

    I just yesterday watched the video about Sora and it just underlines how much better the Sora is right now. However, the difference is that Sora is locked into a lab and we can actually use this one.

  • @IndyStry
    @IndyStry 3 месяца назад

    lol all those shutterstock watermarks in there. :D

  • @Plafintarr
    @Plafintarr 3 месяца назад

    Commence the scholarly stampede!

  • @apoage
    @apoage 3 месяца назад

    holy s**t that escalating fast

  • @cogitoergocogito5032
    @cogitoergocogito5032 3 месяца назад

    Did anyone get this to work? The local is not working cause of dependency errors [some modules not even available] and with the API I get code errors

  • @nodelayfordays8083
    @nodelayfordays8083 3 месяца назад

    Just a few more problems and puzzle pieces to fit together and we have an AI rendering engine

  • @UtraVioletDreams
    @UtraVioletDreams 3 месяца назад

    Good progress in there paper. However Sora seems superior! Are there any benefits, using this technique over Sora?

    • @favesongslist
      @favesongslist 3 месяца назад +3

      It is available now for free.

    • @tuxxyy1
      @tuxxyy1 3 месяца назад +2

      In the comments it says he produced this video before Sora was announced, so that's why there's no comparison

  • @DigitalXrisXros
    @DigitalXrisXros 3 месяца назад

    hello 👋 i just saw Amazon Fire Stick has some kind of Ai ambient background generator

  • @SamuelHauptmannvanDam
    @SamuelHauptmannvanDam 3 месяца назад

    But when can I give it a video and have it give it's version of it. That's the real mind blower. Sora, showed it can do that.

  • @The_CGA
    @The_CGA 3 месяца назад

    It’s with no small irony that these are not Zoom-ins, they are “Dolly In” or “dolly out” in camera movement speak

  • @andywillers87
    @andywillers87 3 месяца назад +1

    Woop!

  • @SurferXGhost1293
    @SurferXGhost1293 3 месяца назад

    What is a paper???

  • @d34d10ck
    @d34d10ck 3 месяца назад

    You know technology is moving fast, when current models are already way better than the last video you produced on the subject.

  • @Thatmemeguylols
    @Thatmemeguylols 3 месяца назад +3

    Sora's leap forward means we're all stepping up our game - time to unleash our potential and push even harder for groundbreaking discoveries!

  • @NalonB
    @NalonB 3 месяца назад

    Anyone got a link to run this kind of stuff?

  • @good_deeds_always_get_punished
    @good_deeds_always_get_punished 3 месяца назад +1

    Now more stock images and videos in company presentations.

    • @tuxxyy1
      @tuxxyy1 3 месяца назад

      This'll be my biggest use. My corporate blogs are gonna start looking awesome lol

  • @SpahGaming
    @SpahGaming 3 месяца назад +1

    cool

  • @Jay-rr6me
    @Jay-rr6me 3 месяца назад

    At this point I am convinced AGI will happen in about 5yrs

  • @inbox0000
    @inbox0000 3 месяца назад

    "scholary stampede" 😁

  • @luke.rayman
    @luke.rayman 3 месяца назад

    Was it all trained on Shutterstock images? 😄