this is the fastest AI chip in the world: Groq explained

Поделиться
HTML-код
  • Опубликовано: 21 фев 2024
  • Try Groq: groq.com/
    SimTheory with Groq: simtheory.ai
    In this video I explain what Groq is, demonstrate low latency with Groq and show how incredibly fast Groq is. Groq will open up whole new possibilities for AI with 25x faster speeds and 20x less cost. I'm excited!
    plz sub, like, comment etc. xox

Комментарии • 37

  • @RobertFletcherOBE
    @RobertFletcherOBE 19 дней назад +1

    1:37 That "Silicon Wafer" looks suspiciously like a circle of corrugated cardboard covered in tinfoil...

  • @sophiekandul6737
    @sophiekandul6737 3 месяца назад +10

    Groq was before Grok

  • @kylebehrend
    @kylebehrend 3 месяца назад +6

    Quick is an understatement. I wonder if we get to a point where the systems actually record our inputs before we press enter. E.g. they are working in the background as we type in our input and when we press enter it may be instantaneous ?

  • @baheth3elmy16
    @baheth3elmy16 3 месяца назад +8

    Thanks for the video! So is Groq going to be sold as a chip sometime? Like will it be installed on a motherboard like a GPU or RAM or NVME?

    • @razoraz
      @razoraz 2 месяца назад

      I was thinking about this also - maybe as an add-on PCIe card. For the crowd that may believe that like "not your keys, not your coins" - "not your cloud, not your data". I could see Apple as a candidate for buying them to make their already-fast Neural Engine on the motherboard into something that puts OpenAI out of business.

  • @jesuswithragdoll
    @jesuswithragdoll 3 месяца назад +4

    Current Groq configuration comes with limited stroage space of SRAM and it would not be sufficient for Training but Inference purpose. It 's comparable to Tesla's Dojo but with lesser computation power of Tesla Dojo. However when the potential buyers would have to ponder the limitation before making the purchase decision, Layers of KV Cache and Batch Size would make it only suitable for medium size computation. Not to mention that Complexity of Software should be taken into the consideration.

    • @waterbot
      @waterbot 3 месяца назад +1

      i feeel like this first chip is more a proof of concept or a marketing effort from Groq more than anything, this video is a perfect exapmle of that.

    • @morethisdayinai
      @morethisdayinai  2 месяца назад

      gotta make stuff for us hype boiz

    • @QH96
      @QH96 Месяц назад

      Hopefully their next chip fixes these shortcomings.

  • @remelin75
    @remelin75 3 месяца назад

    Really good video. Looks very professional and the information was very easily understood. Great work!

  • @dearlove88
    @dearlove88 3 месяца назад +4

    It’s crazy though that it’s will cost about $2 million worth of cards to run a 70b model

  • @P__114
    @P__114 3 месяца назад +10

    The branding confusion with Grok is going to be tough. One needs a rename.

    • @undergroundxp
      @undergroundxp 3 месяца назад

      Nah, it's funnier this way.

    • @razoraz
      @razoraz 2 месяца назад +1

      @@undergroundxp What would be even funnier, and differentiating, is if they decided to tell everyone to pronounce this one "groque", like "baroque" 😅

  • @UselessHumanBeing
    @UselessHumanBeing 3 месяца назад +3

    Obligatory comment. Hope to keep seeing more videos from your channel 🙏

  • @byrnemeister2008
    @byrnemeister2008 3 месяца назад +1

    So there is a one off gain in putting the algorithm / transformer. In a perfect world that one off gain should be in the range of 50 to 100 x. But this costs a lot of money and is a big gamble. If that algorithm is tweaked then you need new chips. So it’s a balance between flexibility and the resulting risk and cost. All the major service providers and semi companies are looking at this. Groq have a time to market advantage but it’s not going to be that big. 6-12 months.
    Big message. Short Nvidia stock.

  • @backacheache
    @backacheache 3 месяца назад

    I wonder if it is better with energy efficiency too?

  • @iritesh
    @iritesh 3 месяца назад +1

    Didn't explain how it works but 👍

  • @codediporpal
    @codediporpal 3 месяца назад

    I imagine NVidia will have an LPU inference only offering soon.

  • @frankforrester42
    @frankforrester42 3 дня назад

    it cost the price of a new car

  • @user-ni3ti9cz5f
    @user-ni3ti9cz5f 2 месяца назад

    Good content but I want to know who I’m getting my info from. Are you an engineer?

  • @toulasantha
    @toulasantha 3 месяца назад

    Chamath is the Goat ❤

  • @ngothianhquy7561
    @ngothianhquy7561 Месяц назад

    Does anyone know how to invest in this company

  • @g0d182
    @g0d182 3 месяца назад

    barely any difference between the gpt3.5 and groq respond times at 0:20 and 1:00

  • @johnpope1473
    @johnpope1473 3 месяца назад

    (Can you guys stop mentioning simtheory on podcast - I've been on waitlist for 4 months ?? - I hit the page each week in hot anticpation only to be disappointed things are still closed.... sighh......(fix this please))

    • @morethisdayinai
      @morethisdayinai  3 месяца назад

      Join the discord and send us a message with your email and we can let you in straight away: discord.gg/Rf7v6SZB

  • @WJ1043
    @WJ1043 Месяц назад +1

    Groq was never explained. You just quoted the obvious time and time again. The title is just click bait.

  • @Veneer22
    @Veneer22 3 месяца назад

    Damn bro! Can see all the whites of your eyes! You on that high end addy or what?

  • @edwincloudusa
    @edwincloudusa 3 месяца назад

    Why would you have that annoying background music that makes impossible to listen to what you are saying? I don't get it...

  • @richard_d_bird
    @richard_d_bird 2 месяца назад

    i can relate to the difficulty of getting a wash for ones pig