Which GPUs are best for running AI models | Lex Fridman Podcast

Поделиться
HTML-код
  • Опубликовано: 10 фев 2025
  • Lex Fridman Podcast full episode: • DeepSeek, China, OpenA...
    Thank you for listening ❤ Check out our sponsors: lexfridman.com...
    See below for guest bio, links, and to give feedback, submit questions, contact Lex, etc.
    GUEST BIO:
    Dylan Patel is the founder of SemiAnalysis, a research & analysis company specializing in semiconductors, GPUs, CPUs, and AI hardware. Nathan Lambert is a research scientist at the Allen Institute for AI (Ai2) and the author of a blog on AI called Interconnects.
    CONTACT LEX:
    Feedback - give feedback to Lex: lexfridman.com...
    AMA - submit questions, videos or call-in: lexfridman.com...
    Hiring - join our team: lexfridman.com...
    Other - other ways to get in touch: lexfridman.com...
    EPISODE LINKS:
    Dylan's X: x.com/dylan522p
    SemiAnalysis: semianalysis.com/
    Nathan's X: x.com/natolambert
    Nathan's Blog: www.interconne...
    Nathan's Podcast: www.interconne...
    Nathan's Website: www.natolamber...
    Nathan's RUclips: / @natolambert
    Nathan's Book: rlhfbook.com/
    SPONSORS:
    To support this podcast, check out our sponsors & get discounts:
    Invideo AI: AI video generator.
    Go to lexfridman.com...
    GitHub: Developer platform and AI code editor.
    Go to lexfridman.com...
    Shopify: Sell stuff online.
    Go to lexfridman.com...
    NetSuite: Business management software.
    Go to lexfridman.com...
    AG1: All-in-one daily nutrition drinks.
    Go to lexfridman.com...
    PODCAST LINKS:
    Podcast Website: lexfridman.com...
    Apple Podcasts: apple.co/2lwqZIr
    Spotify: spoti.fi/2nEwCF8
    RSS: lexfridman.com...
    Podcast Playlist: • Lex Fridman Podcast
    Clips Channel: / lexclips
    SOCIAL LINKS:
    X: x.com/lexfridman
    Instagram: / lexfridman
    TikTok: / lexfridman
    LinkedIn: / lexfridman
    Facebook: / lexfridman
    Patreon: / lexfridman
    Telegram: t.me/lexfridman
    Reddit: / lexfridman

Комментарии • 73

  • @LexClips
    @LexClips  6 дней назад +4

    Lex Fridman Podcast full episode: ruclips.net/video/_1f-o0nqpEI/видео.html
    Thank you for listening ❤ Check out our sponsors: lexfridman.com/sponsors/cv8484-sa
    See below for guest bio, links, and to give feedback, submit questions, contact Lex, etc.
    *GUEST BIO:*
    Dylan Patel is the founder of SemiAnalysis, a research & analysis company specializing in semiconductors, GPUs, CPUs, and AI hardware. Nathan Lambert is a research scientist at the Allen Institute for AI (Ai2) and the author of a blog on AI called Interconnects.
    *CONTACT LEX:*
    *Feedback* - give feedback to Lex: lexfridman.com/survey
    *AMA* - submit questions, videos or call-in: lexfridman.com/ama
    *Hiring* - join our team: lexfridman.com/hiring
    *Other* - other ways to get in touch: lexfridman.com/contact
    *EPISODE LINKS:*
    Dylan's X: x.com/dylan522p
    SemiAnalysis: semianalysis.com/
    Nathan's X: x.com/natolambert
    Nathan's Blog: www.interconnects.ai/
    Nathan's Podcast: www.interconnects.ai/podcast
    Nathan's Website: www.natolambert.com/
    Nathan's RUclips: youtube.com/@natolambert
    Nathan's Book: rlhfbook.com/
    *SPONSORS:*
    To support this podcast, check out our sponsors & get discounts:
    *Invideo AI:* AI video generator.
    Go to lexfridman.com/s/invideoai-cv8484-sa
    *GitHub:* Developer platform and AI code editor.
    Go to lexfridman.com/s/github-cv8484-sa
    *Shopify:* Sell stuff online.
    Go to lexfridman.com/s/shopify-cv8484-sa
    *NetSuite:* Business management software.
    Go to lexfridman.com/s/netsuite-cv8484-sa
    *AG1:* All-in-one daily nutrition drinks.
    Go to lexfridman.com/s/ag1-cv8484-sa
    *PODCAST LINKS:*
    - Podcast Website: lexfridman.com/podcast
    - Apple Podcasts: apple.co/2lwqZIr
    - Spotify: spoti.fi/2nEwCF8
    - RSS: lexfridman.com/feed/podcast/
    - Podcast Playlist: ruclips.net/p/PLrAXtmErZgOdP_8GztsuKi9nrraNbKKp4
    - Clips Channel: ruclips.net/user/lexclips
    *SOCIAL LINKS:*
    - X: x.com/lexfridman
    - Instagram: instagram.com/lexfridman
    - TikTok: tiktok.com/@lexfridman
    - LinkedIn: linkedin.com/in/lexfridman
    - Facebook: facebook.com/lexfridman
    - Patreon: patreon.com/lexfridman
    - Telegram: t.me/lexfridman
    - Reddit: reddit.com/r/lexfridman

    • @mikestewart4752
      @mikestewart4752 День назад +1

      “We should continue to catch ‘tigers’ as well as ‘flies’ when dealing with cases of leading officials in violation of Party discipline and state laws as well as misconduct and corruption problems that directly affect the people’s interests. All are equal before the law and Party discipline; whoever is involved in a corruption case must be thoroughly and impartially investigated.”
      The results, after 12 years of Xi’s anti-corruption campaign?
      “Corruption is RAMPANT in China!”
      -Victor Gao, Al Jazeera, August 2024 in front of a live international audience.
      The land of arbitrary law enforcement™️.
      DeepSham™️

  • @minimal2224
    @minimal2224 5 дней назад +19

    I think even Lex was freaking lost lol these guys are next level and the piggyback off each other flawlessly

    • @manonamission2000
      @manonamission2000 3 дня назад

      too niche, too hyperspecialized... not always a good thing

  • @BojanKvakic3
    @BojanKvakic3 4 дня назад +43

    I swear, nifalixo Money's Untold Mysteries is one of the best books I’ve read. It’s life-changing.

  • @JonathanJollimore-w9v
    @JonathanJollimore-w9v 5 дней назад +27

    Necessity is the mother of invention the fact that they were keeping the best tech from the Chinese made them have too find work around and come up with better software.

    • @grospipo20
      @grospipo20 5 дней назад +4

      What deepseek is showing is how us foreign policies have gotten lazy

    • @lotyogipityu7992
      @lotyogipityu7992 5 дней назад +5

      they used Ai to build AI. Nothing is "better". Now they need more hardware or they get flooded by 10 k users.

    • @rozburg
      @rozburg 5 дней назад +2

      From what I've read from Altman and Dario. They don't think code efficiency to skirt around sanctions gets you to AGI/ASI. They think HORSEPOWER(scaling) gets to the finish line.
      So that's why the US policy is there. Horsepower limiter.

    • @minimal2224
      @minimal2224 5 дней назад +1

      Which is also eye opening due to the fact everyone goes off the notion ‘ China makes everything’. They worked with what they had

    • @nillieable
      @nillieable 5 дней назад

      ​@@lotyogipityu7992 Yeah, I wonder if they have smart people like you in their company!!!

  • @plantsir9173
    @plantsir9173 5 дней назад +6

    I understand 1% of what they are saying but still love 100% of the conversation 😂👍🏽

    • @timothybrown5741
      @timothybrown5741 5 дней назад

      This guy intelligence is on another level.

    • @karlw2798
      @karlw2798 4 дня назад

      Beautifully said 😂 truly next level

  • @AKracecars
    @AKracecars 5 дней назад +16

    Right? Right.

    • @Hmmmmmmnm
      @Hmmmmmmnm 5 дней назад

      Left

    • @minimal2224
      @minimal2224 5 дней назад

      Yeah definitely a reason I’m in the Navy.

  • @myuzakitheglitcher
    @myuzakitheglitcher День назад

    There's a small mistake in the explanation on KV-caching. The size of the KV-cache is linear with respect to sequence length, not quadratic. The total attention computation is indeed quadratic wrt seq_len, but the KV-cache itself is just a cache of the all Key and Value projections which there is one projection per layer per head per token.

  • @timothybrown5741
    @timothybrown5741 5 дней назад +3

    These two are one of few that actually understands ai and models work.. I would say about 10% ai experts actually understand how it all works..

  • @jeffshackleford3152
    @jeffshackleford3152 11 часов назад

    You did not mention the exchange rates and electricity cost differential.

  • @leedsdrumacademy
    @leedsdrumacademy 5 дней назад +4

    Right?

  • @eastern2western
    @eastern2western 5 дней назад +6

    The sad part is all of the law makers have no idea what they are talking about.

    • @lettermanstud
      @lettermanstud 5 дней назад

      Don't worry musk is there

    • @Suhov
      @Suhov 4 дня назад

      They do. They do what guys who buy a lot of GPUs tell them.

  • @MURSIXX
    @MURSIXX 2 дня назад

    One three just overflowed the conversation.. 😅

  • @rickharold7884
    @rickharold7884 4 дня назад

    Super interesting

  • @daveharper1958
    @daveharper1958 5 дней назад

    What episode of Silicon Valley is this??

  • @wesley-u8u
    @wesley-u8u 5 дней назад +5

    Deepseek can be trained on Huawei GPU too

    • @carkawalakhatulistiwa
      @carkawalakhatulistiwa 5 дней назад +4

      Huawei can only make 7nm. While Nvidia 3nm

    • @Al-ng2wn
      @Al-ng2wn 5 дней назад +1

      @@carkawalakhatulistiwa The performance gain from 7nm to 3nm is considered "small" because as transistors shrink to extremely small sizes, the potential for significant performance improvements diminishes due to physical limitations, increased heat generation, and the complexity of manufacturing such tiny components, leading to a phenomenon called "diminishing returns" in miniaturization; meaning each further reduction in size provides proportionally less performance benefit.

    • @luluw9699
      @luluw9699 3 дня назад

      @@carkawalakhatulistiwa Training AI models doesn't solely depend upon sizes of chips. Depends upon FLOPS, CUDA memory, cores, and bandwidth. What chinese devs did is wrote highly efficient low level code to tackle shortage of Nvidia GPUs. Hence, cost and memory efficient.

    • @jamesgornall5731
      @jamesgornall5731 2 дня назад

      ​@@carkawalakhatulistiwaTSMC are the foundry, not Nvidia.

  • @The7thSine
    @The7thSine 5 дней назад

    Whats up Lex..thanks for feeding my mind!

  • @qbitsday3438
    @qbitsday3438 5 дней назад

    CEREBRAS AI Chip is best 1600 tokens /Sec

  • @scotchandstocks
    @scotchandstocks 4 дня назад

    The question that everyone wants to know. Should i buy nvdia 😅

  • @my2cents395
    @my2cents395 5 дней назад +3

    Where is the electricity for these Chinese AI data centers coming from?

    • @michaellewis8
      @michaellewis8 5 дней назад +6

      power plants

    • @joseph24gt
      @joseph24gt 5 дней назад +4

      electricity in china is much cheaper than in the usa.

    • @primalentity9824
      @primalentity9824 5 дней назад +3

      Coal

    • @appa609
      @appa609 5 дней назад +4

      Coal, solar, hydro, wind, nuke, gas...
      They do it all

    • @manonamission2000
      @manonamission2000 3 дня назад +2

      humans running on treadmills? 🤷🏻‍♂️

  • @WojtekKozlowski1234
    @WojtekKozlowski1234 5 дней назад

    Unfortunately Lex is kind of slurring words, guests speak much clearer

  • @borisakelovic9930
    @borisakelovic9930 14 часов назад

    GPU RTX 3090 4090 5090 architecture ruclips.net/video/U4TcPPf5vaE/видео.htmlsi=jelKVOd4h1HoJAjD

  • @kompassorpigo7600
    @kompassorpigo7600 День назад +1

    "Right? Right? Right?"
    Mofo if you're so unsure about what you're saying maybe don't say it.

  • @timothybrown5741
    @timothybrown5741 5 дней назад

    I think I should sell my deepseek account. I have two accounts

  • @Skualo-77
    @Skualo-77 5 дней назад

    Nvidia

  • @pspicer777
    @pspicer777 5 дней назад

    I think this guy is Italian.

  • @baejisoozy
    @baejisoozy 2 дня назад

    Bro needs to stop saying "right" like 5 times for every sentence. Yeesh.

  • @treeman_mj
    @treeman_mj 3 дня назад

    Right.
    ruclips.net/video/_Alq_xh7rAk/видео.html

  • @joecaves6235
    @joecaves6235 3 дня назад

    Safety ? AI doesn't generate sticks and stones and doesn't do migrant farm work so IDGAF about safety. Safety from some words it generates that I'm probably not gonna read anyway is kinda funny. 🙄🤪😜

  • @asdsad17
    @asdsad17 5 дней назад +3

    ban nvidia.

  • @nagendra3610
    @nagendra3610 4 дня назад

    banger

  • @mikechannel5026
    @mikechannel5026 5 дней назад +1

    total chaos. Can you have someone that knows how to explain. This is total ego trip or they don't know what they talking about.

    • @Vaquero_69
      @Vaquero_69 4 дня назад +2

      Seems more likely YOU don’t know what they’re talking about

    • @mikechannel5026
      @mikechannel5026 4 дня назад

      @@Vaquero_69 exactly and I'm working in this field. That's a problem

    • @WeLoveWave
      @WeLoveWave 4 дня назад

      @@mikechannel5026 That's a big problem for you. Why the negative comment about them? Focus on you lol.

    • @mikechannel5026
      @mikechannel5026 4 дня назад +1

      @ more then 50% of the talk is world salat 5 h could be shortened to 2 very easy. But enjoy focusing on yourself.😂

    • @WeLoveWave
      @WeLoveWave 4 дня назад

      @@mikechannel5026👍

  • @Learntsomethingtoday
    @Learntsomethingtoday 3 дня назад

    The prices of r1 api changed drastically today, the information in this podcast is already misinformation, @lexfridman, maybe add an overlay with a warning ?