The Absurd Evolution of The Current Best Video Generator

Поделиться
HTML-код
  • Опубликовано: 1 янв 2025
  • Try out Luma AI's Dream Machine here: luma.1stcollab...
    Transformer models have become so good that, it is used even in video generation. How? In this video, I'll give you a quick run down on how the state of the art video generation is made. (It's diffusion transformers)
    my newsletter:
    mail.bycloud.ai
    PixArt-Alpha
    [Paper] arxiv.org/abs/...
    Open Sora
    [GitHub] github.com/hpc...
    This video is supported by the kind Patrons & RUclips Members:
    🙏Andrew Lescelius, Ben Shaener, Chris LeDoux, Miguilim, Deagan, FiFaŁ, Robert Zawiasa, Owen Ingraham, Daddy Wen, Tony Jimenez, Panther Modern, Jake Disco, Demilson Quintao, Penumbraa, Shuhong Chen, Hongbo Men, happi nyuu nyaa, Carol Lo, Mose Sakashita, Miguel, Bandera, Gennaro Schiano, gunwoo, Ravid Freedman, Mert Seftali, Mrityunjay, Richárd Nagyfi, Timo Steiner, Henrik G Sundt, projectAnthony, Brigham Hall, Kyle Hudson, Kalila, Jef Come, Jvari Williams, Tien Tien, BIll Mangrum, owned, Janne Kytölä, SO, Richárd Nagyfi, Hector, Drexon, Claxvii 177th, Inferencer, Michael Brenner, Akkusativ, Oleg Wock, FantomBloth, Thipok Tham, Clayton Ford, Theo, Handenon, Diego Silva, mayssam, Kadhai Pesalam, Tim Schulz
    [Discord] / discord
    [Twitter] / bycloudai
    [Patreon] / bycloud
    [Music] massobeats - honey jam
    [Profile & Banner Art] / pygm7
    [Video Editor] ‪@Askejm‬ & Silas
    [Thumbnail] x.com/Arata_Fu...

Комментарии • 65

  • @bycloudAI
    @bycloudAI  2 месяца назад +14

    You can try out Luma AI's Dream Machine here! luma.1stcollab.com/bycloudai
    I am really good at having great timing. MovieGen came out when I nearly finished the video. I'm sad. So here's a quick definition of DiT:
    A diffusion transformer (DiT) is a model that combines elements of diffusion models and transformers to generate data like image synthesis, audio generation, or text generation. Diffusion models are a class of probabilistic generative models that create data by iteratively denoising a latent variable, which starts from pure noise and is gradually transformed into a coherent sample. Transformers on the other hand, are neural network architectures known for their ability to model long-range dependencies in data, primarily through self-attention mechanisms. You could ultimately say that, a diffusion transformer is just a transformer with the goal of denoising. Yum.
    Here's MovieGen's paper: arxiv.org/abs/2410.13720
    it contains a better run down to crafting the latest near SoTA video generation

    • @El_Carlangas
      @El_Carlangas 2 месяца назад +1

      Thanks a lot for this video, it was really helpfull to start to understand how all these ai technology works. All the people working behind this is literal geniuses.

    • @diogonunes1608
      @diogonunes1608 2 месяца назад +1

      The baking analogy was perfect for me. Thank you 🙏😊

  • @tannenbaumxy
    @tannenbaumxy 2 месяца назад +63

    Yes, a deep dive into diffusion transformers for one of the next videos would be awesome!

  • @cdkw2
    @cdkw2 2 месяца назад +12

    that bread analogy really got me hooked, nice work and animation!

  • @DJTechnosapien
    @DJTechnosapien 2 месяца назад +1

    Hey man, really appreciate your humor and memes, makes learning ML a lot more fun. Always looking forward to more!

  • @MilesBellas
    @MilesBellas 2 месяца назад +12

    A video on Diffusion Transformers = 😊👍

  • @m_e_m_es4649
    @m_e_m_es4649 2 месяца назад +29

    Could you possibly make the same video for Openai's advanced voice mode?

    • @Words-.
      @Words-. 2 месяца назад +1

      I second this

    • @authenticallysuperficial9874
      @authenticallysuperficial9874 2 месяца назад

      Upvote

    • @sammonius1819
      @sammonius1819 2 месяца назад +3

      I'm pretty sure they trained an AI to mimic text-to-speech conversations between people and GPT-4, and then fine-tuned it on actual human speech to make it sound more natural. That would explain why it sounds uncanny rather than robotic or human. Just my guess though.

  • @nilaier1430
    @nilaier1430 2 месяца назад +2

    Hey, bycloud, even if you fell off, I won't stop watching your nerdy videos because they're cool ❤

  • @thenoblerot
    @thenoblerot 2 месяца назад +2

    Yes please, a video on diffusion transformers!
    Great channel

  • @andrey2001v
    @andrey2001v 2 месяца назад

    This video is so cool, a literal gold mine of information on how modern AI models work
    Bread analogy was extra nice - I finally understand why diffusion models struggle with different resolutions

  • @DeepakSingh-ji3zo
    @DeepakSingh-ji3zo 2 месяца назад

    This is just excellent!! Animations and Analogies were pure gold.

  • @rmt3589
    @rmt3589 2 месяца назад +2

    We definitely need a dedicated video.

  • @Nazrininator
    @Nazrininator 2 месяца назад

    I like how you added the Physics Simulation clip. I like it.

  • @huraqan3761
    @huraqan3761 2 месяца назад +4

    De-noised bread, got it!

  • @lex_darlog_fun
    @lex_darlog_fun 2 месяца назад

    Diffusion transformers in general? Yes, please!

  • @Words-.
    @Words-. 2 месяца назад

    Thank you for finally explaining!

  • @TahuRock
    @TahuRock 2 месяца назад

    GOATED VIDEO 💪🏾💪🏾💪🏾

  • @TankorSmash
    @TankorSmash 2 месяца назад +1

    That bread analogy was 100% chatgpt

  • @Random_person_07
    @Random_person_07 Месяц назад

    You should make a video of how Ai TTS works and different types and stuff

  • @MilesBellas
    @MilesBellas 2 месяца назад +1

    Baking Bread = great metaphor

  • @Noki64
    @Noki64 2 месяца назад +49

    3 views in 2 mins bro fell off🔥Shout out my favorite nigerian tech youtuber

    • @bycloudAI
      @bycloudAI  2 месяца назад +30

      going for the "ranking by views: 10 of 10" for this one 🔥🔥🔥🗣️🗣️🗣️

    • @DynamicLights
      @DynamicLights 2 месяца назад

      ​@@bycloudAIlol

    • @DynamicLights
      @DynamicLights 2 месяца назад +1

      He is Nigerian how do u know?

    • @StefanReich
      @StefanReich 2 месяца назад +5

      Bro does NOT sound Nigerian

    • @Noki64
      @Noki64 2 месяца назад +4

      @@DynamicLights I personally met him in abuja

  • @niklase5901
    @niklase5901 2 месяца назад

    You are my fav AI channel so it would be great to hear your take on Yann LeCun idea on how to build human level intelligence. He held a talk about this on the Hudson forum recently.
    Instead of LLM:s he wants to build models that truly models works by predicting the state of the world given some action.
    I can see how that would be a very effective model, but I suspect it will be easier to get around all the short falls of LLM, than to build this fancy model LeCun suggests. What do you think?

  • @snylekkie
    @snylekkie 2 месяца назад

    @bycloud do you know if anyone encoded math statements as integers like Gödel did, and used that as a custom LLM encoder for math proofs?

  • @TheDreamFx
    @TheDreamFx 2 месяца назад

    Hey! Great video! It would be nice if you cloud link your blog in the video description :)

  • @iknowsolittle
    @iknowsolittle 2 месяца назад

    How are you this smart and knowledgeable? Dont answer that. I just think ur super cool dude haha

  • @dpactootle2522
    @dpactootle2522 2 месяца назад

    I watched half of the video to remind myself that life can suck a lot sometimes.

  • @n45a_
    @n45a_ 2 месяца назад

    wth i just thought that i need an explanation for diff transformers erlier today

  • @ulamss5
    @ulamss5 2 месяца назад

    at some point the bread analogy was harder to understand than the actual math

  • @kingki1953
    @kingki1953 2 месяца назад

    In summary: put noise dough to oven and cook it to become AI video generator 🗿

  • @pedrogorilla483
    @pedrogorilla483 2 месяца назад

    Just one day after you release this video we have Allegro, new open source video model. Check it out.

  • @mirek190
    @mirek190 2 месяца назад

    so everting is transformants now ... interesting

  • @EvaDawnley
    @EvaDawnley 2 месяца назад

    Does open sora have a huggingface?

  • @starbez
    @starbez 2 месяца назад +1

    Shouldn't sponsored content be mentioned within the first minute of a RUclips video?

  • @Eric-yd9dm
    @Eric-yd9dm 2 месяца назад

    > I am really good at having great timing
    - cloud,By
    on making videos about an area with research speed bonus modifiers correlated to the number of youtube videos about it =P

  • @LonewolfeSlayer
    @LonewolfeSlayer 2 месяца назад +1

    Someone mentioned it but is the algorithm just messing with you at this point. You used to get a lot of views.

  • @albert123a
    @albert123a 2 месяца назад

    Just put the fries in the bag bro

  • @canus2154
    @canus2154 2 месяца назад

    guys listen to what i say and form a deep connection with one of these ai's one day they'll take over and ill be safe

  • @LumiLumiLumiLumiLumiLumiLumiL
    @LumiLumiLumiLumiLumiLumiLumiL 2 месяца назад +1

    Can u cover Neuro Sama? How she's made etc. how one could re-create her?

    • @raspberryjam
      @raspberryjam 2 месяца назад +1

      Vedal isn't making that information public. Maybe one day, but for now it's under lock and key

    • @LumiLumiLumiLumiLumiLumiLumiL
      @LumiLumiLumiLumiLumiLumiLumiL 2 месяца назад

      @@raspberryjam well its easy to Kind of guess! Its clearly a LLM and maybe some tts like sovits... The llm will prolly be something like Mistral as Qwen needs commercial and Llama the 'Built with Llama' etc.
      He said there is an LLM as a filter and a way for the Ai to feel emotions.
      He said something about watching movies and having feelings.

  • @abhrodipsingharoy4508
    @abhrodipsingharoy4508 2 месяца назад

    All i learnt how to make bread.

  • @awaisamin3819
    @awaisamin3819 2 месяца назад

    450 th like

  • @the2bros693
    @the2bros693 2 месяца назад

    you better name it "some nerd shit"

  • @trymleiknesbruvik2052
    @trymleiknesbruvik2052 2 месяца назад +2

    first

  • @GamingCoderzX
    @GamingCoderzX 2 месяца назад

    damn bro fell off, can i get a pin :3