OpenAI's Sora: Text-to-Video AI is a World Simulator?!

Поделиться
HTML-код
  • Опубликовано: 2 фев 2025

Комментарии • 129

  • @madushandissanayake96
    @madushandissanayake96 11 месяцев назад +71

    Imagine we are living in a simulation created by Advanced AI nearly 14 billion years ago using a text prompt.

    • @Alice_Fumo
      @Alice_Fumo 11 месяцев назад +19

      And it's all part of an AI research project to enhance capabilities and the only reason our instance hasn't been pruned is because it shows promise in developing more advanced hardware designs than what this universe runs on.

    • @markmuller7962
      @markmuller7962 11 месяцев назад +9

      That time could be 2 seconds or something

    • @FaultyTwo
      @FaultyTwo 11 месяцев назад +18

      "generate me a planet called earth, inhabited by evolved monkeys, warmongers, sentient, milky way, green scenario, future dystopia."
      "... This does not seem to compute."
      "just do it"
      "Fine."

    • @whyflovnes9565
      @whyflovnes9565 11 месяцев назад +11

      and its just someone's term paper

    • @thevalarauka101
      @thevalarauka101 11 месяцев назад +3

      basically Voltaire's short story Plato's Dream

  • @technicalmaster4054
    @technicalmaster4054 11 месяцев назад +34

    The most interesting thing to me is that it actually keeps getting better with more compute. Imagine what a future model with much more compute will be able to simulate with this level of progress. If this continues, we might soon be simulating chemical reactions, cellular processes, rigid and soft body dynamics and so much more.

    • @martiddy
      @martiddy 11 месяцев назад +8

      I mean, Alpha Fold already can simulate proteins more accurate than any previous methods and also hundreds of times faster too. This will eventually spread to other science areas too

    • @anak_kucing101
      @anak_kucing101 11 месяцев назад

      ​@@martiddy So it means no more testing in animals? 😮

    • @martiddy
      @martiddy 11 месяцев назад +5

      @@anak_kucing101 Unless we can simulate the whole human body down to the molecular level, I'm afraid we still need animals for testing for a long time.

    • @erickmarin6147
      @erickmarin6147 11 месяцев назад +2

      ​@@anak_kucing101 in some specific cases yes

  • @SweetHyunho
    @SweetHyunho 11 месяцев назад +43

    Those video artifacts are like free VFX

  • @oowaz
    @oowaz 11 месяцев назад +49

    some of the animal videos are terrifying there's something about incorrect anatomy with that level of detail, also the turtle eating jewelry 💀

  • @Droid3455
    @Droid3455 10 месяцев назад +1

    Those failed results really make it look like dreams, where most of the time things are constantly changing and don't make any sense

  • @yaelm631
    @yaelm631 11 месяцев назад +40

    0:45 Wow, I thought it generated nerfs and used assets, which then another AI would beatify the result.
    The fact that it's that much 3D consistent (enough for SFM) is an emerging capability is insane to me.
    We are going to get good 3D generated scenes in no time soon

    • @wpelfeta
      @wpelfeta 11 месяцев назад +5

      What's crazy is that it seems like they could improve the simulation just by throwing more compute at it, so this could still get better.

    • @numb0t
      @numb0t 11 месяцев назад

      @@wpelfeta to the point where we question reality

  • @BrianMosleyUK
    @BrianMosleyUK 11 месяцев назад +13

    This is so reminiscent of lucid dreaming... and also the concept that we are always dreaming, it's just that our waking dreams are framed by the physical world.
    Future generative models will be guided by a 3d physics engine of some sort. We're so very close!
    As 2 minute papers would say... Just another couple of papers down the line... and what a time to be alive!

  • @gnollio
    @gnollio 11 месяцев назад +23

    Bling Zoo needs more footage. I've got to see what that monkey king is up to.

  • @joelface
    @joelface 11 месяцев назад +1

    Love that someone turned some of these simulations into G-Splats. SO much potential by simply prompting the model for a 3D rotation of an item/person/etc. If it can do that consistently, it can make some amazing 3D models that can then be rigged and animated, or simply viewed in holographic space, or explored in 6DOF, etc.

  • @sneedtube
    @sneedtube 11 месяцев назад +1

    By far the best video I saw on the subject

  • @almundtan
    @almundtan 11 месяцев назад +14

    this makes us closer to having Star Trek holo deck simulators

    • @LucasVisage
      @LucasVisage 11 месяцев назад +2

      There is already several physically real 3D hologram devices. Light Field Labs is one that comes to mind. There's only like 2-3 videos showcasing their technology though.

  • @черепахаестклубничку
    @черепахаестклубничку 11 месяцев назад +103

    As a cinematographer i was shitting my pant's seeing Will Smith eating spaghetti. Year later, at this point i really don't care. The industry is doomed and we won't do anything about it. I think at the moment it starts to collapse, there will be more things collapsing, so that would be the least of our problems

    • @AC-zv3fx
      @AC-zv3fx 11 месяцев назад +16

      I don't quite understand. Are you standalone cinematographer working for yourself or are you for hire? Because if you are standalone, then it is just a perfect instrument for you to get anything you want that you can't film irl without need to pay a lot of money to get good CGI.

    • @tyler.walker
      @tyler.walker 11 месяцев назад +26

      As someone who also works professionally in television and video, I agree. Anyone who thinks AI won't eventually become better than every human at nearly every job just doesn't realize how exponential progress is. In the 1950s, the RAMAC 305 was released with 5 Megabytes of storage. The cost of storage has decreased dramatically from about $10,000 per megabyte for the RAMAC 305 to approximately $0.0001 per megabyte for modern SSDs ($0.10 per gigabyte). That's a reduction in cost by a factor of about 100 million times in the last 70 years, and the speeds are about 6,000 times faster.
      As we perfect the chip-stamping process as opposed to lithography for the incredibly delicate >3nm chips (Sam Altman's recent $7 Trillion business ventures may supercharge this), we're going to scale up compute power so much, so quickly. Frankly, I'm slightly worried that the government will consider a legal limit on "consumer compute", because of what will be possible for each and every individual. It sounds like actual sci-fi now, but imagine being able to comfortably run hundreds of LLMs as NPCs in a video game world that was being live-crafted by a Sora-like model, all of them with perfect sounding, emotive voices generated in realtime. Whatever game mechanics, art style, music, characters, story, that you want. Imagine that generating around you in a cheaper, lighter future-version of the Apple Vision Pro, hot damn. The future is gonna be so cool for stuff like this, but our jobs as creatives are so, so in trouble.

    • @ashdang23
      @ashdang23 11 месяцев назад +10

      It’s like you’re gonna completely ignore that society changes and new technology/inventions replace things. this isn’t new my guy and has been happening since the start of humanity
      people always have the same reaction every single time. Complain new technology has arrived and then suck it up and move on in life.

    • @AC-zv3fx
      @AC-zv3fx 11 месяцев назад +10

      And yeah, any disruption in entertainment industry must be the least of our concern compared to cascade effect and beginning of post truth era.

    • @черепахаестклубничку
      @черепахаестклубничку 11 месяцев назад +2

      @@AC-zv3fx im working as a part of production team. Im using a lot of AI tools in editing and pre-production, and trying to learn generative AI as Stable diffusion. But firstly, i mean that all my dreams since high school about becoming high budget movie director are drowning now

  • @itssoaztek4592
    @itssoaztek4592 11 месяцев назад

    Always a pleasure to hear your opinion together with some good explanations of important technical intricacies. Thank you!

  • @NIkolla13
    @NIkolla13 11 месяцев назад +1

    one particular detail that caught my eye is them saying they are using synthetic data to train the models, this may be a clever way of solving both copyright issues and it can be used to imprint a recognizable aesthetic on generated content.

  • @Mobay18
    @Mobay18 11 месяцев назад +29

    I just hope somebody trains this on the most abundant and fastests growing data source for videos, that involve human interactions. ;-)

    • @ps3guy22
      @ps3guy22 11 месяцев назад +3

      😏

    • @starsnoireart
      @starsnoireart 11 месяцев назад +4

      Touch grass.

    • @ironman8257
      @ironman8257 11 месяцев назад +6

      @@starsnoireart cope better

    • @xviii5780
      @xviii5780 11 месяцев назад +8

      @@starsnoireart * touches you * 😳

    • @hitstoythy24
      @hitstoythy24 11 месяцев назад

      @@starsnoireart m ma touch u tonight lil bro

  • @Y0UT0PIA
    @Y0UT0PIA 11 месяцев назад +5

    I thought this would take at least a few more years.
    I want to get off Mr. Bones wild ride.

  • @zedo2512
    @zedo2512 11 месяцев назад

    These guys are moving so fast that my brain cant even keeep up with them 🤯🤯

  • @MemesnShet
    @MemesnShet 11 месяцев назад +2

    Nobody is talking about the leapfrog SORA has made over Dalle3 for single image generation:SORA IMAGES ARE INDISTINGUISHABLE FROM REAL ONES
    Can't wait to try it

  • @lulboiking5806
    @lulboiking5806 11 месяцев назад +3

    this is unreal!!🤩

  • @Allplussomeminus
    @Allplussomeminus 11 месяцев назад +2

    5:02... That looks like something out of a dream... All those previous examples really.

  • @chrisrosenkreuz23
    @chrisrosenkreuz23 11 месяцев назад +1

    This is AMAZING

  • @techwitheds
    @techwitheds 11 месяцев назад

    So nice. This is a game changer

  • @keenheat3335
    @keenheat3335 11 месяцев назад +3

    this current trend of synethic data remind me of an issue that was brought up during tesla FSD development. They were asked why don't use more simulated driving data like waymo or cruise. Tesla respond that you only push the problem from solving self driving to solving perfect simulation of reality. Which is a lot harder problem. Then they show a collection of weird road condition you wouldnt think it would exist. IE: an old man "shepherd" a group of washing machines chain together on a highway, weird shadow pattern that made the road looks like it split into two roads, intersection with 50 plus traffic lights, etc. Reality is a lot weirder than simulation by order of magnitude. So to capture edge case, you still have go observe reality.
    I get the feeling synthetic data will have the same issue of "how close is the synthetic data to reality". These data probably don't capture reality too well. But I guess that's okay since image generation is lot less mission critical than self driving.

  • @Veptis
    @Veptis 11 месяцев назад +5

    So, Sora was ready since March 2023 and they spend a year cherry picking the blog post. Only to drop it once Google and meta had big announcements.
    What else are they holding back on?

  • @dev_ression
    @dev_ression 11 месяцев назад +3

    I just made a video on Sora too and still can’t believe how far we’ve come!

    • @kv4648
      @kv4648 11 месяцев назад

      Is it available?

    • @dev_ression
      @dev_ression 11 месяцев назад

      @@kv4648 not to the general public

    • @kv4648
      @kv4648 11 месяцев назад

      @@dev_ression are you part of team red/beta testers ( I forgot what they're called).
      If you are, are there key limitations that aren't as popularly known yet? I noticed a lot of video game footage or cinematography but no animations, is there something with animation?
      Do you have access without the filters and do you know how or if they work?
      Is there a limit to how long they can extend videos before something happens?

  • @telebijeon3109
    @telebijeon3109 11 месяцев назад

    5:15 rip innocent pedestrians 🤧

  • @metakron
    @metakron 11 месяцев назад

    With this tool, the animation industry goes further than live action series and films

  • @PainfullySubjective
    @PainfullySubjective 11 месяцев назад

    very nice video. thank you

  • @justsomeonepassingby3838
    @justsomeonepassingby3838 11 месяцев назад +1

    6:45 this has to be a jojo reference.
    Dojaaaaan

  • @21EC
    @21EC 11 месяцев назад

    Cool :) I'm exactly the 1000th like

  • @Mad-v3d
    @Mad-v3d 11 месяцев назад +2

    Brings a whole new meaning of God "speaking" the world into existence. The implications will be more clear in a few years when we are able to literally speak into existence entire simulated universes where each entity is operated by its own AI. Pandoras box will be opened, and it will not be able to be closed.

  • @Quack_34
    @Quack_34 11 месяцев назад +3

    mass surveillance chapter begins here... #fubigcorps

    • @boukimagash2083
      @boukimagash2083 11 месяцев назад

      Explain please

    • @Quack_34
      @Quack_34 11 месяцев назад

      @@boukimagash2083holy shit , this RUclips is keeps on deleting my comment

    • @Quack_34
      @Quack_34 11 месяцев назад +1

      RUclips is keeps on deleting my comments, even though they don't have any external links..

    • @kleyyer
      @kleyyer 11 месяцев назад

      bots

  • @hamman_samuel
    @hamman_samuel 11 месяцев назад

    Google playing catch up and losing every time feels good

  • @logbia7k608
    @logbia7k608 11 месяцев назад

    Imagine having AI Holograms, we have a whole new world of NPC's

  • @mrrespected5948
    @mrrespected5948 11 месяцев назад

    Nice

  • @mrsigmaboy-sigma
    @mrsigmaboy-sigma 11 месяцев назад

    7:08 Bling Zoo🤑🤑🤑🤑🤑🗣🗣🗣🗣🗣🗣🥶🥶🥶🥶🥶🥶🔥🔥🔥🔥🔥

  • @garciajoshuagabriels.442
    @garciajoshuagabriels.442 11 месяцев назад +3

    holy sht

  • @leocoyne-xk8gq
    @leocoyne-xk8gq 11 месяцев назад

    If it’s truly understanding physics then what happens when you put in a prompt that deviates from its understanding of the physics ? Like “a man flying through the sky”

  • @yngmeka
    @yngmeka 11 месяцев назад

    Open AI adding watermarks and a marker in what I’m guessing is the photo/video’s metadata is a very good thing but couldn’t that be easy circumvented by photoshoping the watermark out and cleaning any imperfections with another AI model and then screen recording/ screenshotting the video/photo to get rid of the metadata or am I off in my understanding of how this works?

  • @ElSolRacNauj
    @ElSolRacNauj 11 месяцев назад

    6:45 that's oddly satisfying

  • @vaisakh_km
    @vaisakh_km 11 месяцев назад

    I always wanted to be a film director without any skllls... turns out only thing i need to do is wait...

  • @Miguel-vb4xz
    @Miguel-vb4xz 11 месяцев назад +1

    I guess we'll soon be getting our first "that wasn't me that was ai" criminal trial.

  • @davischance-e5h
    @davischance-e5h 11 месяцев назад

    Tha matrix is getting real😂

  • @haveacigar5291
    @haveacigar5291 11 месяцев назад

    and we cant use it and their is still no ai image or video generator that installs as easily as a video game.

    • @maslaxali8826
      @maslaxali8826 11 месяцев назад

      Look into SD or any other huggingface models. They are easy to install as a video game I promise

  • @RhumpleOriginal
    @RhumpleOriginal 11 месяцев назад

    3 years. Robots can build and can build more of themselves.
    Infrastructure set up becomes a joke
    Manual work comes into question in 10 years or less
    Then came the next explosion, that changed the world as we know it

  • @diadetediotedio6918
    @diadetediotedio6918 11 месяцев назад

    I bet it still cannot generate a horse riding a man.

  • @carkawalakhatulistiwa
    @carkawalakhatulistiwa 11 месяцев назад

    First 🎉😊

  • @MyFedora
    @MyFedora 10 месяцев назад

    I think people tend to overestimate how much higher budget movies will use AI. My man, these IMAX camera don't even have auto focus. Even if the camera they use has auto focus, they flat out don't use it in production, period. They can't afford to lose a shot because an AI struggles with a certain scene, or lacks the manual controls to get the desired result. Some even wreck an IMAX camera or four to get a shot they'd only have one opportunity to take. Reliability is one of the most important aspects in higher budget movie productions, and AI just isn't reliable enough.

  • @TheRajmoney
    @TheRajmoney 11 месяцев назад

    Its dreaming... before being born

  • @Emir-Değercan
    @Emir-Değercan 11 месяцев назад

    lol 😂😂😂 3:07

  • @ipu7819
    @ipu7819 11 месяцев назад

    This depresses me on so many levels.

  • @tlpenguin3758
    @tlpenguin3758 11 месяцев назад +1

    i hope they really control what people could do with it, imagine what would happen if people start misusing it

    • @LucasVisage
      @LucasVisage 11 месяцев назад

      The Western Govs will try to control AI on their continents but only the smaller companies will particularly be restrained. China and Russia will have zero restraints. Terminator the movie is soon to be made real.

    • @transsexual_computer_faery
      @transsexual_computer_faery 11 месяцев назад +2

      yeah and people misuse knives all the time and yet we still use knives daily for carpentry and cooking

  • @attilao
    @attilao 11 месяцев назад

    Not a peacock though.

  • @Hamburgernice
    @Hamburgernice 11 месяцев назад

    20 seconds early 💀

  • @Adrians_Lost_and_Found_Visions
    @Adrians_Lost_and_Found_Visions 11 месяцев назад

    Great, early retirement for everyone! :)
    The sooner the AI and robots take over all of the jobs - the better.
    Poverty will be solved. Also 40-80% of people don't like their jobs.
    We all need a few Optimus robots per person and it will be the end of human labor forever.
    Unless you want to work and create something of course. :)
    The transition period - the next 15 years - that might be tough though.
    Hope we will find a solution as fast as possible.
    Deflation in prices of goods and services is one of the best options for starters. Things should be cheaper if humans are not making them.

  • @flareonspotify
    @flareonspotify 11 месяцев назад

    👁👅👁

  • @Lancer95_305
    @Lancer95_305 11 месяцев назад

    I hate it when I'm right 😒😒😒

  • @ProjectMoff
    @ProjectMoff 11 месяцев назад

    I am sick of this small minded narrative of people losing their jobs… The issue isn’t people losing their jobs, the issue is the system.

  • @marcosmwb8444
    @marcosmwb8444 10 месяцев назад

    Just one step further and we will have real time world generation with interactive inputs... game developers like myself should probably find a second job to work until we start a cybercomunism, machines being the working class and humans the state... for those accourding to their needs(humans) from those according to its capabilities(AI/robots).
    The only way communism might work...
    This expectation coming from an extreme rightwing libertarian.

  • @Donttouchher-x1l
    @Donttouchher-x1l 11 месяцев назад

    The Open AI company is miserable, and China is ready to steal technology across the board😂

  • @sahebbeshra7659
    @sahebbeshra7659 11 месяцев назад

    AI takeover is coming 😅

  • @numb0t
    @numb0t 11 месяцев назад

    Late, this was yesterdays news

    • @greenhound
      @greenhound 11 месяцев назад +10

      god forbid somebody takes time making a video properly instead of rushing out a video spouting incomplete thoughts as soon as they see the news

    • @numb0t
      @numb0t 11 месяцев назад

      @@greenhound right... If you're intellectually incapable...

    • @tydiego
      @tydiego 11 месяцев назад +1

      which has relevance today

  • @ondrazposukie
    @ondrazposukie 11 месяцев назад

    Gemini 1.5 is much bigger news than Sora

    • @joelface
      @joelface 11 месяцев назад +1

      Definitely not, but it's bigger news than it's getting credit for, for sure.

    • @ondrazposukie
      @ondrazposukie 11 месяцев назад

      @@joelface videos are just art, they are for fun, answering questions is much important for humanity

    • @joelface
      @joelface 11 месяцев назад

      @@ondrazposukie I think a multi-modal model trained on text and video and more will be needed to truly solve humanity's burning questions. So I actually think Sora represents a really big step towards building a model with a much deeper understanding of reality.