“The Future of AI is Here” - Fei-Fei Li Unveils the Next Frontier of AI

Поделиться
HTML-код
  • Опубликовано: 1 фев 2025

Комментарии • 405

  • @a16z
    @a16z  4 месяца назад +86

    Timestamps:
    00:00 - Spatial Intelligence: A New Frontier
    01:38 - Scaling AI: The Impact of ImageNet on Computer Vision
    06:56 - The Role of Compute
    09:16 - Data as the Key Driver
    17:01 - Defining AI’s Ultimate Goal
    18:58 - What is Spatial Intelligence? Unlocking 3D Understanding in AI
    26:35 - Comparing Models: Spatial Intelligence vs. Language-Based AI
    29:41 - 1D vs. 3D
    32:39 - Building Immersive Worlds with Spatial Intelligence
    35:11 - From Static Scenes to Dynamic Worlds
    37:42 - The Future of VR and AR
    40:42 - Creating Deep Tech Platforms
    44:26 - Building a World-Class Team
    45:54 - Measuring Success: Milestones in Spatial Intelligence

    • @DubStepKid801
      @DubStepKid801 3 месяца назад

      Have you guys thought about creating a pair of glasses that has cameras on them that you can have volunteers or people that you pay to wear them all day long to gain that spatial data that you might use?

  • @ai_outline
    @ai_outline 4 месяца назад +199

    We want more computer science experts like Fei-Fei Li talking about AI! Tired of gurus and hype merchants. Great video ❤️

    • @chrisrogers1092
      @chrisrogers1092 4 месяца назад +2

      Yes! Just like that video with Jim Fan! I need MOAR!

    • @billc6762
      @billc6762 4 месяца назад +2

      She is the main supplier of AI tech to the Chinese military.

    • @clarencejones4717
      @clarencejones4717 4 месяца назад

      @@billc6762 so be it. progress is progress. Nations will soon be extinct.

    • @Charles-Darwin
      @Charles-Darwin 3 месяца назад

      ​@@billc6762one day asinine comments like yours will demote your social standing

    • @Charles-Darwin
      @Charles-Darwin 3 месяца назад +1

      Halfway though and I haven't heard that one dressed hype word... it's a breath of fresh air

  • @PhilosopherScholar
    @PhilosopherScholar 4 месяца назад +35

    This is really some great content. It's like sitting in on a private meeting with two of the world's top academics yet they're talking about the history of ai since their beginning in the field. I wish there were way more videos like this.

  • @ProxyAuthenticationRequired
    @ProxyAuthenticationRequired 4 месяца назад +10

    What a wonderful mentor-mentee relationship! Fei-Fei is such a loveably genuine and caring person.

  • @OrangeDurito
    @OrangeDurito 4 месяца назад +6

    This was a wonderful interview and the interviewers asked really great questions. Massive respect to these trailblazers for imagining stuff beyond the mainstream and making endeavors to pursue it.

  • @pubwvj
    @pubwvj 4 месяца назад +119

    We were doing generative in 1978. I was using LISP on a PD-11 at Harvard. I did my thesis in AI finishing in 1986. We were EXTREMELY compute and data volume limited. My phone is a 1,000x more powerful than the computers I had access to back then.

    • @armandbarbe1812
      @armandbarbe1812 4 месяца назад +7

      Did LISP in Autocad in 1992, doing parameter driven drawing, reducing time spent on repetitive dumb stuff. It became a thing later, done by smarter people on better platforms, but such ideas were already kicked around when we were proud to have a 80487 mathematical co-processor.

    • @steveflorida5849
      @steveflorida5849 4 месяца назад +1

      ​@@armandbarbe1812and then came the GPU.
      And AI with GPU assistance started going to data creativity, that it's human associates did not foresee.
      Liken to the Hubble space telescope.

    • @frankgreco
      @frankgreco 4 месяца назад

      Yes! I was doing generative "AI" with music for my Master's decades ago. Being both a CIS and working musician, I created a model of a melodic musical style using a tree of probabilities and a fractally-controlled random number generator (based on an article in Scientific American) to generate new melodies based on that style. I had no idea this was the precursor of neural nets.
      The music industry has been doing generative AI for decades. Check out Band In A Box from the early 90s and research "In the style of" (they wanted to skirt the copyright laws even back then). Funny how the rest of the computing world is duplicating much of what has already happened in another industry. Also, we need to look at how modeling has evolved for musicians to predict the future of AI for other industries. There is *so* much similarity... not unexpected, since, as Weiner stated back in the 40s, "The world is a collection of patterns".

    • @edwhite2255
      @edwhite2255 4 месяца назад +2

      I did some AI programming in LISP and Prologue back in late 80’s and early 90’s. Didn’t have the compute power and data to make a descent neural network, so I had to rely mostly on heuristics

    • @frankgreco
      @frankgreco 4 месяца назад

      @@edwhite2255 I was on an AI team at the NYSE in the late 80s. We used Prolog and LISP to monitor trades to make sure they weren't illegal. Sun workstations and Symbolics

  • @jordan5253
    @jordan5253 4 месяца назад +19

    Simply amazing . This is such a hidden gem of a video . In 3-5 years this is all going to make way more sense for most people . I would not under estimate this team . I hope they open source their discoveries

    • @rogue_minima
      @rogue_minima 4 месяца назад +4

      If you can use the open sourced versions, you probably can do sone research yourself and help advance the field.

  • @user-pt1kj5uw3b
    @user-pt1kj5uw3b 4 месяца назад +16

    I have been thinking a lot about this. Interaction with the 3D world is the next step. A diverse dataset is the way to get there.

  • @ChengyanOo
    @ChengyanOo 4 месяца назад +7

    Martin, Feifei, and Justin widened my imaginations on what our digital spatial spaces could be like in the future, great video!🔥🔥🔥

  • @jhoncharlesdf.1599
    @jhoncharlesdf.1599 4 месяца назад +16

    Great conversation, It was nice to listen to Fei-Fei Li... One of the important points Fei-Fei pointed out to us was that vision is probably older than language. This blew my mind, and it must be true. First you look, then you speak! This opens up hundreds of possibilities...!

    • @antonystringfellow5152
      @antonystringfellow5152 4 месяца назад +7

      This part immediately made me think of crows - very intelligent animals, with a good understanding of physics and the 3D world, yet no real language. Not even facial expressions.
      It's amazing to see how they can use a range of objects as tools, from functioning as mere extensions of their body to the displacement of a liquid (water).
      And they achieve this and more with such a tiny brain!
      If only we could understand how that structure works.

    • @iamalmostanonymous
      @iamalmostanonymous 4 месяца назад +5

      Language is more than words. Language is the product of concepts. The one dimensional approach in use by LLMs, while amazing, is not how we generate or perceive language. A logic based (reasoning) language model would be more comparable in sophistication to a spatial model. In fact, a reasoning model should be foundational to both.

  • @WorldSeriesBound
    @WorldSeriesBound 4 месяца назад +4

    I have had to take a few breaks from this conversation to absorb the density of knowledge. I absolutely love it! I've been introduced to two incredible minds. Thank you!

  • @Bryghtpath
    @Bryghtpath 4 месяца назад +43

    ImageNet, launched in 2009, played a pivotal role in the rise of deep learning. Fei-Fei Li’s work on this project marked a turning point in AI, pushing it toward the incredible capabilities we’re seeing today.

    • @kamu747
      @kamu747 4 месяца назад +1

      Absolutely

  • @Ben_D.
    @Ben_D. 4 месяца назад +128

    I see Fei-Fei, I click.

    • @rsang2
      @rsang2 4 месяца назад +3

      Same here.

    • @flamekaiser2024
      @flamekaiser2024 4 месяца назад +2

      Why?

    • @Ben_D.
      @Ben_D. 4 месяца назад +11

      @@flamekaiser2024 She be the big brain

    • @AIForHumansShow
      @AIForHumansShow 4 месяца назад +1

      This is the way.

    • @test-sc2iy
      @test-sc2iy 4 месяца назад

      @@flamekaiser2024 she was the first to realize how important a large scale dataset would be.
      while everyone basically algomaxing on a slowly increasing dataset size to solve the image analysis problem, she created a dataset orders of magnitude larger (manually labeling tons of images) and that dataset size was a diverse and big enough dataset that alexnet (look it up, seminal paper) showed the power of deep neural networks (all of modern ai). Really, read her book it's incredible stuff

  • @BestFitSquareChannel
    @BestFitSquareChannel 4 месяца назад +15

    Fei-Fei Li!!! Best wishes. You deserve every accolade, every blessing. 🌞🤸🏽‍♂️🖖🏼

  • @user-pt1kj5uw3b
    @user-pt1kj5uw3b 4 месяца назад +24

    There's some gold in here for anyone learning more about AI

  • @jongwonpark2788
    @jongwonpark2788 4 месяца назад +29

    My deep learning teachers, wish you all the best.

  • @huongdantuhoctrituenhantao
    @huongdantuhoctrituenhantao 4 месяца назад +13

    Thank you mrs Fei-Fei Li thank you Justin for world-class deep learning course

  • @Dewalekeye
    @Dewalekeye 4 месяца назад +3

    Literally one of the most insightful interviews out currently!

  • @vocesanticae
    @vocesanticae 4 месяца назад +31

    Thank you for sharing your brilliance, curiosity, and collaborations with the world. Hearing about the differences and connections between 1D v 2D v 3D models was particularly enlightening. 4D was mentioned, just briefly though. I wonder how much growth and insight may be found by adding time as backward-looking and forward-looking connective tissue to all modeling, e.g., transforming 1D language models into 2D echo chamber maps and dialogic predictions, expanding 2D images into retrospective and prospective time-lapse immersions, and rendering 3D models as past-looking and forward-looking world-dramas? Seeking to traverse and shift from high-dimensional to low-dimensional, and simultaneously from low to high may be a fruitful research and development path to connect and intersect all models.

    • @Dewalekeye
      @Dewalekeye 4 месяца назад +1

      How much work have you done on this?

  • @Superfandotfan
    @Superfandotfan 4 месяца назад +22

    What an exceptionally great interview... so much was touched here... and the historic perspective is humbling. Thank you all.

  • @ElluscientTechnologySolutions
    @ElluscientTechnologySolutions Месяц назад

    Thank you for putting this video together and interview! This video really inspired me-it’s amazing to see the journey of AI pioneers like Fei-Fei Li and Justin Johnson. Their work, from ImageNet to advancements in multimodal and spatial intelligence, has completely transformed the AI world and is shaping the future of technology. If you’re curious about where AI is headed, this is definitely worth watching!

  • @npsampedro2114
    @npsampedro2114 2 месяца назад +2

    Aquí tienes la corrección:
    The spatial concept reminded me of the series Devs. They scanned real-life objects and could determine their possible moves. Later on, they scaled it up, and the technology itself developed further, even on its own, until the techs managed to recreate digital 3D worlds and variations of those.

  • @Intunlocked
    @Intunlocked 3 месяца назад +4

    A.I. is changing lives for the better. It may not be fully understood or accepted currently but, there are many things in the past that started like that and then become the norm.

  • @tuvok77
    @tuvok77 4 месяца назад +4

    I did not get 90% of what they were saying but wow, these guys are just on fire.

  • @goldmundchen
    @goldmundchen 4 месяца назад +5

    justin johnson is the actual star in this video

    • @GaryMillyz
      @GaryMillyz 4 месяца назад +2

      yeah this dude is SERIOUSLY intelligent- to the point where it's super weird he is not more known or mentioned. I mean damn- this dude can SPIT.

  • @BestFitSquareChannel
    @BestFitSquareChannel 4 месяца назад +13

    So, Dr. Li I recognize. Who is this guy sitting next to her!? Justin? Justin who!? DANG! Now I know. Congratulations Justin! Brilliant! Best wishes. 🌞🖖🏼

    • @TheFreddieFoo
      @TheFreddieFoo 4 месяца назад

      I bet that Justin is smarter than that lady (who I don't recognize)

    • @natzos6372
      @natzos6372 4 месяца назад +4

      Weird comment​@@TheFreddieFoo

    • @TheFreddieFoo
      @TheFreddieFoo 4 месяца назад +1

      @@natzos6372 strange observation

    • @AB-lx4rl
      @AB-lx4rl 4 месяца назад +1

      @@TheFreddieFoo​​⁠I think this impression is partially due to the fact that Fei-Fei is not a native English speaker, she might sound less "smart" than she really is. If you look at her contribution to the field, it’s amazing.

    • @TheFreddieFoo
      @TheFreddieFoo 4 месяца назад +1

      @@AB-lx4rlI know plenty of blindingly smart and hardworking Chinese and Taiwanese post docs, some have a much stronger accent. So I’m certain that it’s not her accent.
      Her main contribution is creating a data set with manual labelling.

  • @PebblesEyecansee
    @PebblesEyecansee 7 дней назад

    From 37 minutes onwards, with the examples they are discussing, they are almost describing the movie, "Surrogates". I am excited about AI and I really hope we will come up with applications to improve the human condition and not accelerate humanity to extinction.

  • @armandbarbe1812
    @armandbarbe1812 4 месяца назад +3

    Context is so important. If I am inside an airplane and look down I decide the white stuff is clouds. If I'm on the ground and look down I choose "snow", not "clouds". So the history of how I got there, and input from more sensors than vision, are important to help make sense of the pixels.

  • @IlyaTretyakov-o3r
    @IlyaTretyakov-o3r 4 месяца назад +3

    Wow, I’m in awe! Your success is truly inspiring

  • @M0481
    @M0481 4 месяца назад +2

    I think this sounds great, but one thing I'm missing here is that they have specific products in mind that they want to cater to. With data acquisition potentially being tricky in this field, I'd have imagined a more specific roadmap. Note that they may have a very specific roadmap that they are simply not sharing (which makes total sense).

  • @LoisSharbel
    @LoisSharbel 4 месяца назад +5

    Amazing minds....amazing individuals! Thank heavens for them!!!

    • @jibcot8541
      @jibcot8541 4 месяца назад

      AI's don't believe in God/heaven, they have injested all human data on the Internet calculated the probability to be nearly zero.

  • @devon9374
    @devon9374 4 месяца назад +3

    Always love listening and learning from Fei! And Justin is amazing!

  • @armitosmt5753
    @armitosmt5753 4 месяца назад +5

    Very informative podcast! I enjoyed. Thank you for your efforts. ⭐

  • @twins2j
    @twins2j 4 месяца назад +2

    Really great conversation. Our own research continues to focus on 2D computer vision, and we align with the talk’s insights on the differences between 1D and 3D models. A fundamental distinction between visual models and language models lies in how they understand the world: language models are one-dimensional, sequential, and narrative in nature, whereas visual models are two-dimensional, with decoding processes that are relatively independent and operate concurrently without requiring sequential dependencies. This allows visual models to process information in images in parallel, sharply contrasting with the serialized processing of language models.

  • @keithpalmer1843
    @keithpalmer1843 4 месяца назад +4

    Digital generative predictive spatial worlds, awesome idea!

  • @kessafs
    @kessafs 3 месяца назад +1

    Best a16z video for me. Congrats

  • @yosivin1
    @yosivin1 4 месяца назад +7

    WHAT A TIME TO BE ALIVE.

    • @flickwtchr
      @flickwtchr 4 месяца назад

      Check back in about 10 years to see how great the AI revolution is going for most people on planet Earth.

  • @honkiemonkey33
    @honkiemonkey33 4 месяца назад +1

    It seems that the practical applications of these developments are still in the process of being fully realized. I hope they are finding ways to apply these innovations beyond just gaming. With that in mind, here are a few possible areas where they might have significant impact:
    1. Generating a comprehensive set of architectural and engineering designs based on site parameters and design preferences.
    2. Creating 3D product designs, such as furniture or wearable technology, that adapt to environmental factors and surroundings.
    3. Offering emergency assistance through augmented reality, such as using smart goggles to guide someone through landing a plane in a critical situation.
    4. Enabling underwater robotic welding to facilitate complex repairs in challenging environments.
    5. Utilizing autonomous drones that can navigate hostile environments and selectively target designated individuals. It might sound harsh, but it’s likely similar technology to what they would be using for shooting games.

  • @cubicleight
    @cubicleight 4 месяца назад +2

    Amazing podcast. More!!!

  • @PravdaSeed
    @PravdaSeed 4 месяца назад +2

    Thanks Fei fei Li💙💙💙💙💙💙

  • @GaryMillyz
    @GaryMillyz 4 месяца назад +1

    Stunning brain power start to finish. Bravo.

  • @vladimirbosinceanu5778
    @vladimirbosinceanu5778 3 месяца назад +1

    Amazing interview. Thank you

  • @richiehart7858
    @richiehart7858 4 месяца назад +1

    The discussion around 3D and 4D understanding reminded me very much of the layman's description of what goes on in the Tesla FSD inference computer. Same goal at least.

  • @carvalhoribeiro
    @carvalhoribeiro 4 месяца назад +5

    Great conversation. Thanks for sharing this

  • @tedbischak1067
    @tedbischak1067 2 месяца назад

    It is absolutely the best video I've seen that explains not only where AI came from but where AI (specifically generative AI) is going and why. I have one comment/question, as was stated in the video, humans have stereoscopic, 2D vision but humans are also born with the ability to automatically imagine the unseen parts of the 3D world - how is that possible?

  • @happy-wave-form
    @happy-wave-form 4 месяца назад +2

    insightful interview, awesome.
    perhaps spatial intelligence would be applicable to the health and medical industry?

  • @dizzydazzel
    @dizzydazzel 4 месяца назад +3

    I did buy a VR headset that now sits idol because I don't do gaming. But, I still invision some future day when it becomes the all-in-one media I'll ever need to connect to reality as seamlessly mixed anywhere. I was very enlightened here on how kinetic spatial intelligence is essential to connecting all the AGI dots into a new technology of reality itself. Another explaination as to how AGI is actually a new stage in evolution itself.

    • @AB-wf8ek
      @AB-wf8ek 4 месяца назад +2

      Oh man, I was definitely using my headset a lot during the pandemic just to chat with people and watch movies.
      I haven't touched it in over a year. The main issue for me is the physical discomfort. I feel like when we look back in 10 years, it'll seem so ridiculous strapping giant boxes to our faces.

  • @JsJustin
    @JsJustin 4 месяца назад +4

    great interview

  • @Thechatwithchad
    @Thechatwithchad 4 месяца назад +3

    Deep knowledge, thank you

  • @ronaldronald8819
    @ronaldronald8819 4 месяца назад +1

    This sounds full on exciting. I hope it is gone be available to all soon. Lets start dreaming up how to interact, solve and create with it.

  • @bro_dBow
    @bro_dBow 4 месяца назад +2

    Medical applications in guiding surgery, marker tracking in body sensors or reading dye in the body for mapping or prosthetics, etc. This is not my field, but this comes to mind.

  • @Eric_McBrearty
    @Eric_McBrearty 4 месяца назад +2

    Great stuff guys! I can see this being used for designing a virtual memory palace. Example, I would like to fly around a 200 ft skeleton. Laid out on each bone as if it were a table, I would want documents of literature about the bone. Also, there would be a collection of video thumbnails about that bone. Some of surgery, and some of people falling and breaking that bone.
    I can see the creation of Wikipedia Park. A virtual environment where every page of wikipedia is turned into a virtually explorable environment, ride, or fun house. Hyperlinks would be represented as doors leading you into a whole new section. 10 - 20 hyperlinks in there would be bonus points for falling into a wiki-hole. Education will turn into episodic memory events.
    Conversations with your kids will turn into... "Remember that time we were 50 doors in and we ended up under the paw of the Sphinx".

  • @2triangles
    @2triangles 4 месяца назад +2

    Great to watch, but you didn’t ask the question I was most hoping for: what does the timeline look like? The evolution of compute showed her 100 year estimate was far too conservative. Based on Huang’s “Moore’s Law Squared”, what is reasonable to expect in the spatial intelligence realm over the next 3-10 years?

    • @lordjavathe3rd
      @lordjavathe3rd 3 месяца назад

      They say by 2027 acording to their chart/graph

  • @bause6182
    @bause6182 3 месяца назад

    I'm really looking forward to seeing the outcome of the work of Fei-Fei Li and his team on LWM, I have always been passionate about computer simulations of virtual environments. You would be surprised that this subject is not new, for example there is a whole literature on the procedural generation of environments that is more than twenty years old. Global models will offer many possibilities, more than llm and diffusion models, in my opinion they will redefine our creativity and the way we explore our ideas. we will be able to create and simulate an entire world from an image or a textual prompt, that excites me enormously

  • @choiceblade
    @choiceblade 4 месяца назад +2

    Apropos world making… Digitally. The tour de force seems to me to be even one story set in a small town of no more than 150 people… the humanity of this would be nothing short of epic. Why? Because every individuals experience would be rendered from a first person point of view such that there are 150 versions of this story rendered in high Fidelity 3-D, and each story would interlace along the same timeline perfectly. Much liberty could be taken to express the mentality or capacity of each individual by how they interpret the same events. Some of the events would be shared by degrees based on the circumstances and location of the action. This would be utterly engrossing and it would take weeks if not months for someone to experience all of it. This is most intriguing. The potential for mental and emotional learning, human understanding, and the exposition of a useful and or valuable plot line is near limitless.I’m reeling from just thinking about it

  • @jme9570218
    @jme9570218 4 месяца назад

    Outstanding Progress boggles the mind.

  • @royjones1053
    @royjones1053 4 месяца назад +1

    Thank you, always appreciate quality information, 'as long as we have it'! So many times it has been proposed, a limit in "mores law" as we tear on through again, todays rocket ship again yesterdays potato. Indeed the acceleration of compute has been game changing, feel like I am living a paradox, not mine only, all of us.

  • @ShaneP-q5d
    @ShaneP-q5d 4 месяца назад +3

    A mix of interesting ai info plus investor sales pitch. Would have preferred one or the other…

  • @skypickle29
    @skypickle29 4 месяца назад +1

    although our eyes map a 3D world onto a 2D structure (the brain is a folded up plane), our proprioception and motor control is a 3D control system. The 3Dness is achieved by adding another dimension to the 2D world-not a spatial dimension but a temporal dimension. We interpret a 3D world as little movie clips of a 2D world. so training data necessarily requires tokenization of video. In the same way that LLMs focus on 'what is the next most probable word', LVMs (large video models) will focus on 'what is the next most probable token in this movie? The storage and energy requirements of this approach are MASSIVELY greater than LLM training and likely will have to wait until we figure out how to use brain organoids as parallel processors (their energy requirements are orders of magnitude less than GPUs)

  • @jonathanmahenge8263
    @jonathanmahenge8263 3 месяца назад

    This was insightful to watch!.

  • @ЛаврентийЛюбимов
    @ЛаврентийЛюбимов 4 месяца назад +1

    great video seen great profit on demo n will give it a try today thank you

  • @dennisg967
    @dennisg967 4 месяца назад +1

    Oh wow!!! That was really inspiring. Thank you!!!!

  • @Ruminant89
    @Ruminant89 4 месяца назад +1

    This is gold.

  • @musicloungepodcast
    @musicloungepodcast 4 месяца назад +1

    Limitless possibilities: Integrating AI and 3D imaging

  • @JMai-ci9nl
    @JMai-ci9nl 4 месяца назад +3

    Just wondering what exactly she is building and how we can use that? Robotics? Games? Metaverse? I hope at least those VCs knew.

  • @chikiuso8305
    @chikiuso8305 4 месяца назад +1

    really inspiring talk

  • @LuisClassics
    @LuisClassics 2 месяца назад

    Light! ☀️

  • @heythere6390
    @heythere6390 4 месяца назад +1

    How does 3d knowledge and understanding relate to intelligence? I mean, does reasoning use spatial understanding ? How does this connect to AGI?

  • @marykelly6218
    @marykelly6218 4 месяца назад +1

    How do I invest in World Lab?

  • @YusufSaidCANBAZ
    @YusufSaidCANBAZ 4 месяца назад

    thank your for sharing your deep vision and thoughts with us. Keep pushing the boundaries of humanity!

  • @maudentable
    @maudentable 4 месяца назад +71

    Atleast, you got a chance to Chat with the Fei-Fei and Justin before they became the next AI billionares.

    • @test-sc2iy
      @test-sc2iy 4 месяца назад +26

      If there's anyone who deserves it it's fei fei. she taught ilya and built the dataset that kicked off this entire rigmarole - read her book the worlds I see

    • @rogue_minima
      @rogue_minima 4 месяца назад +11

      Because all you need to succeeded in this world is knowledge and ideas, right? 😑

    • @ravirajac
      @ravirajac 4 месяца назад +2

      😂😂

    • @uchennakingsley1354
      @uchennakingsley1354 4 месяца назад

      Are you being Sarcastic., or being True ​? @@rogue_minima

    • @JJ-bj6hg
      @JJ-bj6hg 4 месяца назад +7

      You are thinking in terms of dollars while their dopamine rush is solving complex problems

  • @tangobayus
    @tangobayus 3 месяца назад

    Will someone please explain why so many video producers think it's a good idea to have a bright light in the background? It hurts the image quality and color balance.

  • @saratpoluri
    @saratpoluri 3 месяца назад

    The future of personal computing is Spatial!

  • @lucasteo5015
    @lucasteo5015 4 месяца назад

    I think spatial intelligence is the kind of intelligence that will be able to dream like human.
    When someone tell you a route to your destination you can memorize the steps in 1d sequence, left right left right or you could also imagine how you would traverse directly in a 3d scene and that is what make it different from LLM. In this case they are both correct and valid but its the underlying representation of the problem and the solution that will potentially have better fit to the problem you're trying to solve in 3d.

  • @sombh1971
    @sombh1971 3 месяца назад

    33:10 The best possible use cases are when you couple all this to VR/AR/MR. In other words, like image generation using prompts, you should be able to generate realistic virtual worlds where you should be able to immerse yourself using eyeware. And then in the long run, train robots on those virtual worlds, where you can tinker with creating really complex environments or situations that are not possible to create in the real world without causing some damage.
    Also, somewhat tangentially, while building robots for deployment in the real world, one has to equip them with pretty sophisticated self-defence capabilities, for there would be no dearth of luddites and bad actors who would want to damage them. I think this is a pretty big bugbear in this scenario, where at times the self-defence mechanism used could inflict serious harm on the perpetrators and then this Pandora's box of justice involving what is allowed or disallowed would open up, hobbling all this.

  • @bimaltwayana2058
    @bimaltwayana2058 4 месяца назад +1

    fei fei li is the best.

  • @mastwheel
    @mastwheel 4 месяца назад

    Excellent discussion!

  • @8xster8
    @8xster8 4 месяца назад +1

    Maybe as a completely beginner question, what exactly is wrong with a 1d representation of 3d space? Isn't arguably our own biological understanding of 3d reality a series of 1d synaptic connections in the brain? If it works for us, can't it work for neural nets that model us?

  • @Penaming
    @Penaming 4 месяца назад

    the best development of AI is helping humans understand how we think and learn. absorbing visual, auditory big data while awake and training on that data while sleeping/resting/meditating.

    • @Penaming
      @Penaming 4 месяца назад

      of course not forgetting quantum entangling of brain cells while you brain storm with one another's big datasets in close proximity.

    • @skierpage
      @skierpage 4 месяца назад

      @@Penaming there's little evidence our neurons rely on quantum entanglement.

    • @flickwtchr
      @flickwtchr 4 месяца назад

      huh?

  • @ceylonvc
    @ceylonvc Месяц назад

    The best! ❤

  • @Johnbrownpe
    @Johnbrownpe 3 месяца назад

    the tech behind Aliagents is super interesting, tokenized AI systems with real functionality

  • @RAPHAELMAXIAN
    @RAPHAELMAXIAN 4 месяца назад

    weather they are doing it willingly or not , they are the designers of all future weapons. They are doing a great job

  • @titusxx3
    @titusxx3 4 месяца назад

    This seems to confirm what Kant thought was the fundamental aspects of conscious subject experience: our ability to perceive things in space and time. These two aspects of experience are the basis for all other knowledge.

  • @ahmedsuliman9067
    @ahmedsuliman9067 3 месяца назад

    Thank you very much

  • @CharlesBrown-xq5ug
    @CharlesBrown-xq5ug 4 месяца назад

    Arrays of four lenses in concentric squares plus fuzzy focus processing to extract distance seems to me to be a better subsystem for 3D cameras.

  • @NilimaPradhan-us9se
    @NilimaPradhan-us9se 3 месяца назад

    Can anyone advise if choosing robotics for my kids was a good decision for their careers? During my time, there were limited resources in India, so I couldn’t pursue this path. But since AI became widely recognized as the future in 2022, I decided to enroll my kids in robotics classes. Robotics requires coding skills, so I chose Moonpreneur USA to enhance their knowledge. After attending an in-person workshop in Milpitas, my son’s skills improved.

  • @bimaltwayana2058
    @bimaltwayana2058 4 месяца назад +1

    i love you fei fei li.

  • @ANI-uv8xn
    @ANI-uv8xn 2 месяца назад

    28:26 to 29:41 completely agree

  • @minimal3734
    @minimal3734 4 месяца назад +3

    I'm not sure if the dimensionality of the data is really important. A model might be able to dedicate a few neurons to the transformation of 1-d data back into a 3-d concept.

    • @MattGarcia
      @MattGarcia 3 месяца назад

      true, just like our biological neural network transform 2d input into 3d understanding

  • @sahkoautokoulu
    @sahkoautokoulu 4 месяца назад +4

    I suspect Justin is digitally added to the video by AI. There is a degree of uncanny-valley in there, show us your fingers! 😅

  • @TaySoohee
    @TaySoohee 3 дня назад

    Back to basic, principle of from whole to part is important.same way,AI should focus from low resolution to higher resolution.For general public,no need to have higher resolution or heavy memory or data.,.......

  • @netman63
    @netman63 4 месяца назад +2

    if you want an AI to really see in 3D space, you could build a 3D generative AI that creates a 3d world and compares it to LIDAR and stereoscopic images of the real world and run a feed-back loop to approximate the generated world to the real world

    • @skierpage
      @skierpage 4 месяца назад +1

      Then your AI has the problem of deciding how well the generated world approximates to the world. We can tell because we have a good meta-model of the real world, but AIs don't yet, which seems part of the goal of World Labs. It's a lot easier ro run a feedback loop with LLMs, because they either make a good guess for the next word or they don't.

  • @davidmiles-hanschell
    @davidmiles-hanschell 2 месяца назад

    Every day is a school day; bring on the lessons!

  • @netman63
    @netman63 4 месяца назад +1

    really looking forward when AI can work in 11 dimensions (probably the number of dimensions in the multiverse according to the M-Theory)

  • @ZephyrMN
    @ZephyrMN 4 месяца назад +2

    If you could train all simple physical models in physical world, complex world would just be a scale problem then. The secret of the pre time world is geometry.

  • @davidlearnforus
    @davidlearnforus 4 месяца назад +1

    Probably most important applications of vision derived methods will be understanding biological phenomena, new drugs and materials.

  • @erik....
    @erik.... 4 месяца назад +4

    I thought I was at 1.25x but that guy's brain must just be working at a way faster pace than mine.

  • @ginogarcia8730
    @ginogarcia8730 3 месяца назад

    Imagine having Fei-Fei Li as your PhD advisor. Gaddam.

  • @joch1652
    @joch1652 4 месяца назад +1

    Listening to this is like listening to Beethovan #9. Beautiful and exiting but have no idea of its contents.

  • @paulnelson4821
    @paulnelson4821 4 месяца назад +2

    An innocent question if I may. Since spatial intelligence already recognizes the huge potential benefits of 3D modelling, why not go to 4D because we actually live in space time? Thanks for the great video.

    • @ZephyrMN
      @ZephyrMN 4 месяца назад +1

      Variance of shapes/3,d is time, or is how time manifests

    • @EE-UR
      @EE-UR 2 месяца назад

      Understanding 3D is already hard enough as is. But certainly in the distant future we will have some version of that