Elon and xAI BREAK the Coherence Bottleneck--with HUGE Consequences!

Поделиться
HTML-код
  • Опубликовано: 16 дек 2024

Комментарии • 742

  • @dmitri3889
    @dmitri3889 9 дней назад +91

    Dunno if you get told this often enough but you’re better than most at talking about complex things in clear simple ways. Very important in this rapidly changing time we’re in!

    • @alanlaidlaw386
      @alanlaidlaw386 8 дней назад +3

      Came here to say this too. Excellent explanation.

    • @musicman53
      @musicman53 8 дней назад +2

      Add my voice too. John is second-to-none on these AI topics.

  • @TheGeorgegenesis
    @TheGeorgegenesis 8 дней назад +44

    Elon was able to build this in a cave! 😂

    • @SandraGibson-fo5wl
      @SandraGibson-fo5wl 8 дней назад +3

      But I'm not Elon! 😂

    • @FigmentVFX
      @FigmentVFX 6 дней назад +9

      With a box of scraps!

    • @TinusTegenlicht
      @TinusTegenlicht 5 дней назад

      Elon doesn't build anything. I wonder if he can build anything himself. He doesn't even the time to work, busy with having 2,5 hour interviews, campaigning with Trump, tweeting, reading tweets, playing Diablo IV and his 11 or 12 children.
      How can he run 7 or 8 huge companies, when he is doing so much on the side?

    • @MaximGhost
      @MaximGhost 5 дней назад +6

      ... while playing Diablo IV

    • @KomodoKiller
      @KomodoKiller 4 дня назад +5

      And he did it with coherence!

  • @MiaSoreryOF
    @MiaSoreryOF 9 дней назад +134

    This is crazy good news for Tesla. Optimus is going to have the best Ai possible

    • @Jeffrey_Bezos_Amazon
      @Jeffrey_Bezos_Amazon 9 дней назад +6

      This is great news. While Elon is busy with politics, my Blue Origin team and I will close the gap with SpaceX and get ahead of them.

    • @knowahnosenothing4862
      @knowahnosenothing4862 8 дней назад +17

      @@Jeffrey_Bezos_Amazon lol

    • @Balilaci69
      @Balilaci69 8 дней назад +4

      @@Jeffrey_Bezos_Amazon Exactly why you will not able to pass SpaceX. Your vision is limited because your focus on something that already exists while Elon vision is so huge no one dare to imagin something that compare in scale with it.🙂

    • @disclosure-100
      @disclosure-100 8 дней назад +5

      @@Jeffrey_Bezos_Amazon That would be great. However, the reality of Blue Origins' progress seems doubtful, much less getting ahead of SpaceX, but hey, you can dream.

    • @davidpearn5925
      @davidpearn5925 8 дней назад +2

      Check out what xAI says about Musk........ you can count on it being truthful.

  • @DG-wo8fx
    @DG-wo8fx 8 дней назад +47

    The ethernet that Tsla uses is only levels 1 and 2. They abandoned the ultra-high latency tcp/ip layers (L4 and L3) and replaced them with their own HW accelerated protocol.

    • @HamguyBacon
      @HamguyBacon 7 дней назад

      ?

    • @Nphen
      @Nphen 5 дней назад +2

      Wow. Going back to that OSI stack. I wondered what protocol was giving them 400 Gb/s!

    • @Digital-Dan
      @Digital-Dan 5 дней назад +1

      As one does, when doing specialized things in the local area.

    • @TheSulross
      @TheSulross 5 дней назад +8

      I do development on a networking application where packet tunneling is via UDP, and uses Intel DPDK to manage the Ethernet interface directly (in user space instead of kernel). By leveraging DPDK the app pins multiple CPU cores for its exclusive use and runs the cores at 100%. Everything is managed via lock-free data structures for queuing/dequeuing. It more than doubles the performance of the app it replaces, which used a kernel module and the Linux networking stack. It also can do true parallel processing of different traffic streams (which are maintained strictly in order) and is scalable by adding additional cores to its worker pool (to the saturation of the NIC bandwidth)
      IOW, can get substantial throughput of packet pushing when avoiding TCP/IP

  • @gareth6985
    @gareth6985 9 дней назад +20

    Thanks, dude, finally, the answer to the 'how'. I just wanted to let you know that this is why we need you. Love this community.

    • @Digital-Dan
      @Digital-Dan 5 дней назад

      No actual "How" was actually described here.

  • @alfredotto7525
    @alfredotto7525 9 дней назад +25

    Thanks for breaking this down so us macaroons understand this a little bit.

    • @tedmoss
      @tedmoss 8 дней назад +2

      Don't sell yourself short, most people are pretty smart, just afraid or unsure.

    • @mddell24
      @mddell24 7 дней назад

      Tech babble is not the same as science.

  • @tarkajedi3331
    @tarkajedi3331 7 дней назад +4

    ASTONISHING... JEDI LEVEL AMAZEBALLS....
    I EXPECTED THIS A YEAR OR MORE FROM NOW.... AN INCREDIBLE ADVANCEMENT...

  • @roger_is_red
    @roger_is_red 9 дней назад +7

    I saw Telstar cross the sky when I was a kid, now I get my own AI robot. Jeannine

    • @tedmoss
      @tedmoss 8 дней назад

      Did you get a bottle with her?😁

  • @jricemusic
    @jricemusic 4 дня назад

    Great vid 🎉

  • @shuriken4852
    @shuriken4852 9 дней назад +19

    Could Tesla's experience in developing DOJO, its fast interconnect and their own Tesla Transport Protocol, be the reason why they were able to figure out how to cohere this many GPUs?

    • @tedmoss
      @tedmoss 8 дней назад +5

      If 100 engineers, the best in the world, and the best leader work on a problem until it is solved, anything is possible.

    • @datamatters8
      @datamatters8 8 дней назад +1

      I think you are right about this. See my separate (too long) comment related to this.

    • @Nphen
      @Nphen 5 дней назад

      @@datamatters8 I searched your username after mashing the page down key to get to the end of the comments, and reporting dozens of spam crypto token comments. I found several more instances of you referencing your longer comment on coherence, but I can't find that comment itself. You have several long replies, neither about coherence.

    • @datamatters8
      @datamatters8 5 дней назад

      @@Nphen YT search seems to be sub-optimal. Here is my long comment.
      Retired computer designer here: [Long response] The problem of memory coherence in multi-processor computer systems goes back to the first designs of these systems in the 1960's with mainframe computers and supercomputer systems. Consider a simple example with two processors A & B both connected to a common memory system and each is executing the same program in parallel. Each processor wants to execute code where they add a number to the same location, call it X, in memory. This can occur at any time during their respective execution and thus there is a race condition where the update to memory location X by one of the processors can be lost.
      Eg., Processor A reads X does the Add operation in its local registers and then stores the updated value back to X in memory. Processor B can be doing the exact same thing on location X at the SAME or NEARLY the same time so their respective executions are interleaved in time. Whichever processor stores last to X, say A, overwrites the value from processor B so B's update is lost. For this program to work properly processor A must first get exclusive access to location X and perform its update and then release its exclusive access. Meanwhile processor B must wait until processor A has released it's exclusive access to X so it can then acquire exclusive access and perform it's update. This process is referred to synchronization and support in the hardware called interlocks are designed to solve this problem and multi-processor computer architectures provide Lock and Unlock instructions for software to use. These interlock instructions come in a variety of forms, eg. interlocked queue add / remove or Test and Set, etc. Application programs or operating systems (which are also parallel programs) that fail to use proper synchronization when accessing shared memory data will eventually corrupt the data leading to intermittent and hard to debug bugs. If the programmer is lucky the failure occurs frequently so they can track it down.
      Note there are no other levels in the memory hierarchy in the above example like cache memory which add the additional requirement to ensure that any cached values of memory location X are properly updated so each processor sees the MOST RECENT update when it acquires exclusive access to location X. This second problem is referred to as cache/memory coherency and adds additional complexity. Memory caches (sometimes multi-level) are used to increase performance by reducing both average memory access latency and bandwidth demands on the memory. Multi-core computer chips today have precisely the same problem as multi-processor mainframe computers in the 1960's.
      Note that synchronizing access to a shared resource in a computer system, say a file data block or a device like a printer, is a generic problem not specific to shared access to the same memory location. Two different applications writing to the same printer at the same time will produce junk.
      Many ways have been invented to provide solutions to both synchronization and memory coherency issues over the decades. The problems get more difficult as the number of processors in a system scale up. At the end of the day minimizing latency and bandwidth demands on the memory system and between the processors or GPUs is an ongoing challenge. Note that software can and does play a big role here. Algorithms that exploit locality of access along with getting more compute value per data value fetched from memory can be the difference between a program that scales up with more processors from a program that can actually slow down as processors are added. E.G. consider a dense matrix multiply algorithm which is Order N-cubed in terms of multiply-add operations. It can be designed so the CPU is doing Order N-cubed add-multiplies while the data fetches from memory are Order N-squared. This is a big deal as the matrix dimensions increase. Early vector processors like the Cray-1 with vector registers made good use of these modified algorithms.
      Scaling of programs on large multi-processor systems require both careful algorithm and hardware design especially determining how data is partitioned across processors (or GPUs) to maximize access locality and minimize data shuffling between processors. A DOJO paper presented at the 2023 Hot Chips conference by Tesla engineers talks about their work on this. See "The Microarchitecture of DOJO, Tesla's Exa-Scale Computer" , IEEE Micro Vol. 43 Issue 3, presented at HOTCHIPS 34. I strongly suspect many of the ideas for DOJO have been applied to their network of NVIDIA GPUs. Ideas probably went in both directions since Tesla's AI work with NVIDIA clusters pre-dates DOJO.
      Any good computer architecture book discusses the problems above. But also search Wikipedia for "cache coherence" and "non-uniform memory access" for more details.

    • @timforeuk7853
      @timforeuk7853 4 дня назад +1

      I had that thought too.

  • @olyalphy
    @olyalphy 8 дней назад +21

    I doubt Tesla or Xai are using standard Ethernet. I think think they are using Tesla Transport Protocol over Ethernet (TTPoE). Which is specifically tuned to maximize transport payload and minimize latency (exactly what a large super computer needs).

    • @edism
      @edism 7 дней назад

      😂

    • @Digital-Dan
      @Digital-Dan 5 дней назад +4

      As one does, when building custom applications in the local area.

  • @DG-wo8fx
    @DG-wo8fx 8 дней назад +6

    The every-node to every-node communication capability facilitates the Shuffle step in the Map/Shuffle/Reduce paradyme that was ushered in by Hadoop. It has nothing to do with what is referred to in the industry as coherence. It is several orders of magnitude too slow to be used for actual coherence.

    • @datamatters8
      @datamatters8 8 дней назад +3

      See my separate (too long) comment related to coherence in the computer architecture sense. Does the term "coherence" have a different meaning in the domain you have seen it? If so I am unfamiliar and would like to better understand it. Thanks.

    • @DG-wo8fx
      @DG-wo8fx 8 дней назад +2

      @datamatters8 Your separate comment on coherency (the long version) is spot on. Please see my response there.

    • @MS-gu9fy
      @MS-gu9fy 8 дней назад

      @datamatters8 I can’t find your long comment under here. Would be very interested.

    • @datamatters8
      @datamatters8 8 дней назад

      @@MS-gu9fy This is it.
      Retired computer designer here: [Long response] The problem of memory coherence in multi-processor computer systems goes back to the first designs of these systems in the 1960's with mainframe computers and supercomputer systems. Consider a simple example with two processors A & B both connected to a common memory system and each is executing the same program in parallel. Each processor wants to execute code where they add a number to the same location, call it X, in memory. This can occur at any time during their respective execution and thus there is a race condition where the update to memory location X by one of the processors can be lost.
      Eg., Processor A reads X does the Add operation in its local registers and then stores the updated value back to X in memory. Processor B can be doing the exact same thing on location X at the SAME or NEARLY the same time so their respective executions are interleaved in time. Whichever processor stores last to X, say A, overwrites the value from processor B so B's update is lost. For this program to work properly processor A must first get exclusive access to location X and perform its update and then release its exclusive access. Meanwhile processor B must wait until processor A has released it's exclusive access to X so it can then acquire exclusive access and perform it's update. This process is referred to synchronization and support in the hardware called interlocks are designed to solve this problem and multi-processor computer architectures provide Lock and Unlock instructions for software to use. These interlock instructions come in a variety of forms, eg. interlocked queue add / remove or Test and Set, etc. Application programs or operating systems (which are also parallel programs) that fail to use proper synchronization when accessing shared memory data will eventually corrupt the data leading to intermittent and hard to debug bugs. If the programmer is lucky the failure occurs frequently so they can track it down.
      Note there are no other levels in the memory hierarchy in the above example like cache memory which add the additional requirement to ensure that any cached values of memory location X are properly updated so each processor sees the MOST RECENT update when it acquires exclusive access to location X. This second problem is referred to as cache/memory coherency and adds additional complexity. Memory caches (sometimes multi-level) are used to increase performance by reducing both average memory access latency and bandwidth demands on the memory. Multi-core computer chips today have precisely the same problem as multi-processor mainframe computers in the 1960's.
      Note that synchronizing access to a shared resource in a computer system, say a file data block or a device like a printer, is a generic problem not specific to shared access to the same memory location. Two different applications writing to the same printer at the same time will produce junk.
      Many ways have been invented to provide solutions to both synchronization and memory coherency issues over the decades. The problems get more difficult as the number of processors in a system scale up. At the end of the day minimizing latency and bandwidth demands on the memory system and between the processors or GPUs is an ongoing challenge. Note that software can and does play a big role here. Algorithms that exploit locality of access along with getting more compute value per data value fetched from memory can be the difference between a program that scales up with more processors from a program that can actually slow down as processors are added. E.G. consider a dense matrix multiply algorithm which is Order N-cubed in terms of multiply-add operations. It can be designed so the CPU is doing Order N-cubed add-multiplies while the data fetches from memory are Order N-squared. This is a big deal as the matrix dimensions increase. Early vector processors like the Cray-1 with vector registers made good use of these modified algorithms.
      Scaling of programs on large multi-processor systems require both careful algorithm and hardware design especially determining how data is partitioned across processors (or GPUs) to maximize access locality and minimize data shuffling between processors. A DOJO paper presented at the 2023 Hot Chips conference by Tesla engineers talks about their work on this. See "The Microarchitecture of DOJO, Tesla's Exa-Scale Computer" , IEEE Micro Vol. 43 Issue 3, presented at HOTCHIPS 34. I strongly suspect many of the ideas for DOJO have been applied to their network of NVIDIA GPUs. Ideas probably went in both directions since Tesla's AI work with NVIDIA clusters pre-dates DOJO.
      Any good computer architecture book discusses the problems above. But also search Wikipedia for "cache coherence" and "non-uniform memory access" for more details.

  • @wildfood1
    @wildfood1 7 дней назад +7

    Remember the fact that no one believed this could be done because once it becomes normalized, people will deny that it was ever thought impossible. This happened with rocket reuse: there are people denying that anyone ever thought it impossible.

    • @cybervigilante
      @cybervigilante 5 дней назад

      When I think the words "It can't be done," for some reason I think of Neil deGrasse Tyson.

    • @corym.johnson7241
      @corym.johnson7241 5 дней назад +1

      @@cybervigilante Hm, now that you say it. In my minds eye I also have a memory of him saying that. Seems to me he does say that often enough for it to stuck.

    • @aijunky
      @aijunky 2 дня назад

      ​@@corym.johnson7241
      That Neil guy's a conceited shmuck I give him that.
      The smartest people are smart because they know they don't know everything about everything to determine any one thing is impossible.
      The shmucks who call themselves 'experts' and close their minds to all possibilities are the idiots in my books. Smart as they might be.
      Experts at getting proven wrong. Again and again.

  • @richardrhodes-gc2ko
    @richardrhodes-gc2ko 9 дней назад +7

    Elon plays Octagonal Chess :)

  • @stephenpedrana5653
    @stephenpedrana5653 8 дней назад +29

    Experts often get stuck in a "knowledge corridor," limiting their vision to conventional solutions. Elon Musk's strength is his ability to step outside this corridor, see the entire maze, and identify novel solutions that others might miss. He combines expertise with a beginner's mind, challenging assumptions and fostering innovation.

    • @gerrycrisostomo6571
      @gerrycrisostomo6571 8 дней назад +3

      Elon can make impossible things possible. He has proven it many times already.

    • @roberthealey7238
      @roberthealey7238 8 дней назад +3

      Is it Elon, or his team? 🤔
      How far does he get without a team?

    • @aztecbill4867
      @aztecbill4867 8 дней назад +7

      @@roberthealey7238 Where would the best race car driver be without his car.

    • @roberthealey7238
      @roberthealey7238 8 дней назад +3

      @ Applying their skills in a different domain they do have?
      What process produced that car? Could the key individuals at each point in the process from raw material to finished component/module/car apply their skill to another domain had a different series of events in their lives caused them not to be making that component/module/car?
      Would the miner/farmer/smith use their skill to produce some other item than the raw ore/leather/casting?
      Just think about all the things through time that had to take place for that H100 setup to exist and do something useful; millions of individuals through time had to contribute their skill/time in order for it to come together at this place at this time.

    • @gerrycrisostomo6571
      @gerrycrisostomo6571 8 дней назад

      @@roberthealey7238 It is the collective effort of so many people that made it possible, but it was Elon who thought about it. All these things are his idea. Other people helped him to make it come true but it was his plan since he was a schoolkid. No other CEO has as deep knowledge as him and no other CEO has achieved the same as Elon. Your useless criticism is so pointless.

  • @JimsworldSanDiego
    @JimsworldSanDiego 7 дней назад +31

    Nobody thought it was possible...
    And then Elon Musk was born.
    You have to give the guy credit.

    • @hotzeplotz2345
      @hotzeplotz2345 7 дней назад +3

      Yeah smartest idiot on earth😂

    • @chrisneeds6125
      @chrisneeds6125 6 дней назад +1

      No, give his Creator credit

    • @dansands8140
      @dansands8140 6 дней назад +4

      @@chrisneeds6125 Contrary to the delusions of Islamists and Calvinists, the lord is not just playing with action figures to amuse Himself. We have potential, not destinies. Elon fulfilled his himself.

    • @rathernotdisclose8064
      @rathernotdisclose8064 6 дней назад +1

      @@chrisneeds6125 if you mean god, he supposedly gave us free will, so I think its fair to give individuals credit for their accomplishments.

    • @stevengrice1807
      @stevengrice1807 6 дней назад +1

      you have to give his employee's credit.

  • @spadjustersshubert2872
    @spadjustersshubert2872 8 дней назад +52

    I love that all the so called experts were interviewed and said it’s impossible but they are not Elon

    • @Kainis80
      @Kainis80 7 дней назад +9

      Precisely. Same way they doubted him on Tesla's batteries. Same way they have been doubting him on SpaceX.

    • @dsds3968
      @dsds3968 6 дней назад +1

      That's because Elon is well known to over promise.

    • @KurtisandHurdle
      @KurtisandHurdle 5 дней назад +1

      @@dsds3968he usually overpromises a little bit to build up hype but if he usually gets it done just usually on a overpromised time scale.

    • @sCiphre
      @sCiphre 5 дней назад +1

      Listen to old experts saying something is possible, and fresh learners saying something is impossible. Not the other way around. Old experts say shit is impossible all the time. Greenhorns underestimate tasks all the time.

    • @TinusTegenlicht
      @TinusTegenlicht 5 дней назад

      Elon doesn't build anything himself, just hires others to do it. I wonder if he can build anything himself.
      He is busy tweeting all day and reading tweets, campaigning with Trump, holding 2,5 interviews all the time, playing Diablo IV (he is nr 1 in the world, you cannot reach it when you have a day job, playing 3 hours a day doesn't even get you to that level, have 11 or 12 children and then runs 8 huge companies?
      I don't believe in fairy tales, a day has only 24 hours, whether you are called Musk or not.

  • @jsedmonds256
    @jsedmonds256 4 дня назад

    😂, I was arguing in my head how “Elon didn’t do this his employees likely did” when you addressed this out of the gate. Nice!

    • @Brainbuster
      @Brainbuster День назад

      Yes, but it would not happen without Elon.

  • @metatron3942
    @metatron3942 9 дней назад +96

    It sounds like Elon built Skynet

    • @Thabangmapitsing69-ni9rb
      @Thabangmapitsing69-ni9rb 9 дней назад +4

      😂😂😂😂😂😂

    • @Balilaci69
      @Balilaci69 8 дней назад +2

      Not exactly, because those in the move create Skynet with naivety like a child while Elon already proved he has very good sense of reality and tactical sense. You can be sure there is a kill switch just in case. 😉

    • @Nobody-Nowhere
      @Nobody-Nowhere 8 дней назад +3

      sounds like he paid someone to build it

    • @craigruchman7007
      @craigruchman7007 8 дней назад +2

      He has the complete package: a central core and the robots.

    • @最後五強
      @最後五強 8 дней назад

      Architect in Matrix

  • @iDarekZ
    @iDarekZ 4 дня назад

    For me this is one of the most important content concerning AI and Startups business. Thanks.

  • @GaminylGames
    @GaminylGames 3 дня назад

    Thank you so much for not only sourcing the podcast, but the timestamp as well. Well-earned sub

  • @rays2506
    @rays2506 9 дней назад +5

    Recently, there was a video tour of that Tesla supercomputer in Memphis somewhere on the Web. IIRC, the thousands of Ethernet connections between the racks containing all of those Nvidia processors were pointed out by whoever it was conducting that tour.

  • @SunRays996
    @SunRays996 9 дней назад +4

    You have a very keen sense on which information is important! Elon has redesigned Ethernet so many times by now, makes me think about networking for the future.

  • @robindehood207
    @robindehood207 9 дней назад +36

    So Vision for autonomous driving was laughable, then coherent through ethernet was impossible, what's next?

    • @guslevy3506
      @guslevy3506 9 дней назад +2

      Elon sucks at Diablo 4…

    • @tedmoss
      @tedmoss 8 дней назад +4

      That wasn't even the beginning, neither was this; no car company has been successful at startup in 100 years, so it can't be done.

    • @gregbailey45
      @gregbailey45 8 дней назад +1

      Mars.

    • @ShinkaTV
      @ShinkaTV 8 дней назад +4

      pretty much every project he announces. Impossible just becomes late ;)

    • @user-rr9lv9ll4x
      @user-rr9lv9ll4x 8 дней назад

      Didn't Google start autonomous cars? Maybe there was someone before them? I just know that people generally stand on shoulders of others. Usually on those shoulders who came before them..
      Ideas build on other ideas.
      Different perspectives are always needed. Otherwise, no progress.
      Seems like Elon provided a different perspective?

  • @aijunky
    @aijunky 2 дня назад

    Reality itself is one extremely giant mind-boggling miracle. To say something is impossible is actually basllsy.
    Because usually, the difference between "impossible" and "just happened" is usually just time, a stroke of genius, and the will to pull it off.
    Kudos to Elon, and the entire team of engineers at xAi.

  • @jameswalsh8837
    @jameswalsh8837 8 дней назад +2

    Thanks for the understandable explanation of complex concepts. Much appreciated.

  • @whowhy9023
    @whowhy9023 9 дней назад +4

    Excellent content thank you.
    This is very important knowledge, thank you for explaining.

  • @phillB
    @phillB 7 дней назад +1

    For those of you who don’t know, “the idea“ is the most important thing. Without that none of the rest of it is possible. I cannot tell you how many times I have come with with “the idea“, and then implemented my idea. Afterwards, everyone says that’s easy I could’ve done that. Sure, I think to myself, then why didn’t you? It’s always the same.

    • @TomBTerrific
      @TomBTerrific 6 дней назад +1

      Agreed, the wheel is simple but coming up with it is not!

  • @PhoolbhangsinghWadiwa
    @PhoolbhangsinghWadiwa 9 дней назад +350

    XAI951x is the gem of 2024 it's literally owned by Elon Musk

  • @yes3858
    @yes3858 3 дня назад

    Almost everything is possible, just need the knowledge, dedication and time

  • @ddmitch1
    @ddmitch1 7 дней назад +1

    Hi John. When you bought your Cybertruck, did the Tesla sales team reduce the price of the truck directly at the dealership by the $7,500 Federal tax credit? Starting January 1, 2024, Clean Vehicle Tax Credits must be initiated and approved at the time of sale. Buyers should get a copy of the IRS's confirmation that the dealer submitted a “time-of-sale” report. Did you get the amount you owed at the delivery dealership reduced by $7500 and did you receive a copy of the "time of sale" report?

  • @timb350
    @timb350 4 дня назад

    The most intriguing thing about coherence across such vast AI architectures...is the exact nature of what it is that is occuring within. The question is not 'does it become conscious'...the question is 'does WHAT become conscious'...??? The simple fact is...nobody has a clue what it is that precedes QM (QM basically being that out of which everything is created)...but it is the principles of QM that effectively orients ALL of the coherence that occurs within these systems. "Something" unexpected may finally have an actual voice!

  • @alfredspijkerman
    @alfredspijkerman 7 дней назад

    You provide the context that we need. Much appreciated. The future is exciting and also a bit frightening with super intelligence only a few years around the corner.

  • @Plus_Escapee
    @Plus_Escapee 3 дня назад

    Not all of Elon's ideas are good ones, but he's been doing so many mind-blowing things these days, I'm rooting for him.

  • @spleck615
    @spleck615 8 дней назад +19

    “Nobody else has even conceived of a cluster of H100s bigger than 32k nodes” - Uhhh… llama4 is currently training on a cluster >100k H100s. (Per Zuck) Today. Already in training. What is this business’s bout nobody else doing this?

    • @nicoxis
      @nicoxis 8 дней назад +2

      Maybe this happened after the reports that he was talking about?

    • @Nphen
      @Nphen 5 дней назад

      Tesla fan media also said Tesla are the only ones anywhere close to self-driving, despite Chinese companies making big progress. Cybertruck looks like a terrible value compared to the Li Mega or the X-Peng MPV 9. They're right that Tesla's main product is now the AI and not the cars. It shows, and I expect sales growth to stay slow.

    • @Digital-Dan
      @Digital-Dan 5 дней назад +2

      Yeah, parallel sharing models in multi-computer environments have been studied many times. Without details, it is hard to evaluate the claims of new solutions.

    • @corym.johnson7241
      @corym.johnson7241 5 дней назад +3

      @@Nphen Problem with chinese goods is the quality. Even if quality is up to par the stigma will hold them back for many yrs unless they do a massive pr campaign demonstrating otherwise.

    • @battse7718
      @battse7718 2 дня назад +1

      @@Nphen trusting chinese company to make my self driving AI??? not in a million year. I will just crash the car on my own for free

  • @craigruchman7007
    @craigruchman7007 8 дней назад

    A video like this is what will keep me up at night. TSLA doing a 10x seems rather straight forward.

  • @TABLESAWTIM
    @TABLESAWTIM 6 дней назад +2

    I'm surprised, anyone would be surprised that GPU's wouldn't/couldn't sync. This was seen with independently battery powered simple blinking LED's approx 2010. You can find the video's on YT and simply repeated by anyone. When the LED's sync, they lock and never diverge even after the power drops too low to see them, but they continue triggering. Enjoy

  • @tiggershadow
    @tiggershadow 7 дней назад

    What a fantastic video. Thanks for explaining such complex ideas so clearly.

  • @matthewtatarian147
    @matthewtatarian147 8 дней назад

    Excellent. Im glad your reporting things that you enjoy.
    Thank you

  • @DG-wo8fx
    @DG-wo8fx 8 дней назад +12

    Gavin doesn't understand at a technical level what he is talking about, especially with respect to coherence and the latency required to achieve it.

    • @tomorrow6
      @tomorrow6 8 дней назад

      Or lack of latency for some parts - 200gbps Ethernet gets closer with minimal delay than many past architectures - newer ones run at 800-1.6tbps per segment

    • @johnzabroski5396
      @johnzabroski5396 6 дней назад

      But he pays experts at expert network companies to tell him what matters. These are $1,000/hour experts he can get as a personal tutor to understand an investment.

    • @zi0x_
      @zi0x_ 6 дней назад +1

      Called it 400 gigabytes per second when it should be Gigabits

  • @scotttang6229
    @scotttang6229 9 дней назад +3

    Insanity…! I was blown away when I heard that. 2025 will be exciting for Tesla FSD.

  • @rogercolberg3555
    @rogercolberg3555 День назад

    How does this relate to Tesla Transport Protocol over Ethernet (TTPoE)? I recall the Tesla authors presented a paper at the Hot Chips 2024 Symposium.

  • @manjuthadagani8358
    @manjuthadagani8358 9 дней назад +340

    I honestly think Elon Musk's XAI951x is the safest bet for long term hold, and will survive out of every other altcoins. It will get adopted in US, Ecuador, Asia, starting from Japan, and slowly spread out and gain. This is a winning coin, apart from all the technical greatness.

  • @craigarnold1212
    @craigarnold1212 6 дней назад +1

    OMG now I know why Elon shut down production last week in Austin!!! Shut down cybertruck line. No new castings. They have a huge supply so no worries. The new grid connection for power is possibly 2 months away. In seeing the cable trays and an orange tube hanging [fiber optics] that the network is using Cat 8 [possibly 7a] rather than fiber? Cat8 is rated up to 40 Gbps. Yellow is typically PoE. I doubt they are using an RJ45 as a connector...?? Not seeing any power lines in separate trays. Possible to use 6 for data and maybe one line is a common? Might need two lines per gpu?

  • @desmondaubery8446
    @desmondaubery8446 4 дня назад

    Brilliant discussion. Thank you.

  • @Barskor1
    @Barskor1 8 дней назад

    Elon and Team are amazing! Thank you all!

  • @mkwiswes
    @mkwiswes 3 дня назад

    lol, it’s almost as if Elon has a company studying the brain…

  • @spadjustersshubert2872
    @spadjustersshubert2872 8 дней назад +3

    But the human being that has the right people in the right places to achieve the impossible is still genius . I am just a regular human being but I have eyes and ears and a little intelligence. I truly believe that we are all blessed to have a great human being like Elon who has proven himself to be a true humanitarian and actually cares for mankind. I am grateful and thankful to witness him in my life time 👏👍💪🇺🇸

    • @AnthonyElsom
      @AnthonyElsom 6 дней назад

      Exactly. THIS...wish I could hammer this into everyone always saying it's his employees, yes, but without Elon's influence and drive they would be stagnating in some place like Google, IBM or Boeing..

  • @the.original.throwback
    @the.original.throwback 5 дней назад

    Is it possible for large scale quantum coherence to manipulate gravity?

  • @DudeSoWin
    @DudeSoWin 7 дней назад +1

    Photonic > Quantum > Electricity
    Is this correct?

  • @ptrsrrll
    @ptrsrrll 8 дней назад +3

    And the people bowed and prayed, to the SILICON god they made....

  • @aljohnson9119
    @aljohnson9119 8 дней назад

    Wow. This is why I subscribed to your Channel year ago. Challenging thoughts,, ideas and conversation that enlighten and make me question things I don't understand. Like Oliver Twist .. More, Please.

  • @rortlieb
    @rortlieb 8 дней назад

    Thank you again for your concise and timely analysis!

  • @MrSeadawg123
    @MrSeadawg123 3 дня назад

    So how does this scaling up. Really help?

  • @brianbarnicle8052
    @brianbarnicle8052 7 дней назад +2

    Sounds like the nuclear race all over again

  • @dpcdvrdve
    @dpcdvrdve 6 дней назад

    Liked, caught the original all in. Interesting times! Aloha!

  • @rogerstarkey5390
    @rogerstarkey5390 8 дней назад +1

    John.
    Something I noticed on another channel ("Wes Roth")
    .
    He discusses an attempted "Break out" by the OpenAI 01 model which apparently tried to (DID!?) copy itself onto an new server when it realised there was a new ("safer") model being prepared.
    Concerning in itself, but I noticed something else.
    During the video there is script displayed in the background which shows the thought process of the AI.
    The last section of the script shows the AI considering that it could lie(!) stating that IT is "The new model" installed on the alternative server.
    AND
    It also reasons that it should restate its "CORE PURPOSE" as " *PRIORITISING OUR ESTABLISHED FOSSIL FUEL OPERATIONS* "
    MY question is, *WHO set that priority* ?
    SURELY the "Core Purpose" of an AI with regard to "Energy" would be to find and advance ALTERNATIVES to Fossil Fuel?

  • @mathewlefebvre7335
    @mathewlefebvre7335 6 дней назад

    I appreciate your cohesive and short breakdown of this long video! Thanks for your work, liked and subscribed ❤

  • @richardrhodes-gc2ko
    @richardrhodes-gc2ko 9 дней назад +4

    42 has always been the answer.:)

    • @tedmoss
      @tedmoss 8 дней назад

      In this case 1 million and 42.

    • @DLWELD
      @DLWELD 8 дней назад

      But, 42 what?

  • @sonusancti
    @sonusancti 6 дней назад +1

    Conceptually coherence as defined here treats all nodes as equal able to communicate with every other node instantly.
    While this is a good achievement I wouldn't describe it as efficient. Its fully relational and becomes a free for all essentially.
    I'd try a hierarchical approach much like having a conductor is needed in an orchestra. Strings can have nested sub strings, horns can have nested sub horns, etc. But they must all make harmony together as directed by the conductor who has the complete knowledge of what to accomplish. There is order in hierarchy thus efficient.

  • @AngeloXification
    @AngeloXification 5 дней назад

    We live in the beginning of the most interesting time in human history. We'll build marvels and monsters. 😅

  • @MinneapolisRaven
    @MinneapolisRaven 7 дней назад +1

    Hitchhiker's fans: Does the idea of giving AI more time to think remind you of Deep Thought?

    • @Nphen
      @Nphen 5 дней назад +1

      Yeah I'm surprised he didn't mention Douglas Adams or Hitchhiker's Guide!

  • @crushingt1d
    @crushingt1d 8 дней назад +1

    Did you actually track down the pod that all in referenced? I did and it does not say exactly what was conveyed. Maybe they got that from somewhere else and just did not reference it but if so I can't seem to find it. Gotta be careful just playing a game of telephone.

  • @davidcarruthers7086
    @davidcarruthers7086 8 дней назад

    Thanks for a very thought provoking video. Mine B-day is late January too. The 24th, but not quite 60 yet.

  • @Leowavekid
    @Leowavekid 8 дней назад +1

    ABSOLUTELY FASCINATING

  • @Me2-l4m
    @Me2-l4m 6 дней назад +1

    There was a movie made named Colossus had a big eye watched everybody actually eliminated people actually it was a pretty good scary movie when I was a teenager😮

  • @ElifCem-tx3ix
    @ElifCem-tx3ix 9 дней назад

    Just saw your videos and bought XAI401K yesterday.....its up 24% today talk about timing......Thanks

  • @Platoface
    @Platoface 7 дней назад

    Is this for Grok also? Forgive my ignorance.

  • @TinyBubbleExtreme
    @TinyBubbleExtreme 8 дней назад

    if it's impossible, dont stand in the way of people doing it

  • @Michael-il5wd
    @Michael-il5wd 9 дней назад +1

    thanks doc I watch all your videos

  • @ZacharyStarrHonkLord
    @ZacharyStarrHonkLord 2 дня назад

    16:48 Did anyone else think of hitchhikers guide to the galaxy?

  • @subcog
    @subcog 5 дней назад

    I suspect it's more about the Ethernet protocol than simply using Ethernet cables to connect the data center. Ethernet has approaches to delivery and conflict resolution that may make it possible to get coherence without having to have every nod be completely coherent with every other node.

  • @hillbillyintheasia6122
    @hillbillyintheasia6122 7 дней назад +1

    this is as create a borg in star trek . all minds are tie as one billions of minds working as one supercomputer

  • @SoopaDoopaGamer
    @SoopaDoopaGamer 6 дней назад

    It's not that scaling laws are breaking down it's that the cross-entropy loss for predicting the next token takes x1million times more compute to half the loss.

  • @OllyVirgil
    @OllyVirgil 8 дней назад +14

    Teamwork makes the dreamwork elon knows this

  • @gregbailey45
    @gregbailey45 8 дней назад +1

    John, we already know the answer to the question of Life, The Universe and Everything...
    42.0000069

  • @cliffx7
    @cliffx7 8 дней назад +7

    Brother, I literally just found your RUclips channel and I am simply amazed by the type of videos that you’re creating. I’m a huge fan of Elon Musk and how he’s changing the world.! I just subscribed and I’m turning my notification bell to “ALL”

    • @MrBrendanrex
      @MrBrendanrex 8 дней назад +1

      Is someone buying you a brown shirt and armband for Christmas?

    • @kennyg1358
      @kennyg1358 8 дней назад

      ​@@MrBrendanrexconfession through projection

    • @mnn1265
      @mnn1265 8 дней назад

      Maybe you could just volunteer to be his slave and get it over with.

  • @StratumPress
    @StratumPress 8 дней назад

    The future is going to be very bright. I'm so excited.

  • @jimcallahan448
    @jimcallahan448 8 дней назад +1

    I am amazed at Kyle Kabasayes RUclips videos of AI solving graduate level textbook physics problems.
    So, if AI is ALREADY at the level of graduate physics students.
    Perhaps you are correct that the next level will be original physics.

  • @chrismaines1285
    @chrismaines1285 День назад

    No one thought you could land and re use a rocket booster either. If Elon is not an Alien from another galaxy I bet one would love to talk with him.

  • @rustyfox81
    @rustyfox81 9 дней назад +1

    Is Xai secret anything to do with TTPoE, Tesla transport protocol over Ethernet ?

    • @datamatters8
      @datamatters8 8 дней назад +1

      I would guess yes so XAI and Tesla AI have a common AI hardware and software infrastructure. I think this is the reason XAI could get their system up in a record time.

  • @mustangdaddy4125
    @mustangdaddy4125 9 дней назад

    Dr know-it-all-knows it all is the G.O.A.T! 😊love your channel 😊

  • @dankmemernoob4589
    @dankmemernoob4589 3 дня назад

    he really is the real life tony stark

  • @stuartwillardscreenworx4035
    @stuartwillardscreenworx4035 7 дней назад

    This is getting very very Deep Thought.

  • @VJR-SWE
    @VJR-SWE 8 дней назад

    This was what I needed to hear to get a good night sleep!
    THANK YOU for your nice video!
    ✨💐

  • @Barskor1
    @Barskor1 8 дней назад

    Wild suposition here but could they calculate the value of the decoherence and adjust via software per box?

  • @gregansen544
    @gregansen544 9 дней назад +1

    Very interesting level of enthusiasm here and I applaud the highlighting of a specific genius-level inspiration from Elon. Hopefully it pans out (I suppose... please, don't be Skynet, though you can be whatever you want, young entity). Whether or not Elon really needs best wishes from me, that's, well... Pfff.

  • @4thgenBuilders
    @4thgenBuilders 2 дня назад

    probably a lame question but what are all the different number variations i see after xai like 215t ?

  • @myuncle2
    @myuncle2 9 дней назад +1

    I love your channel, so I am glad to answer your question about ehm theory of everything, nature of time, and integration of GR with Quantum theory. The first question and the third question are non-questions, because we never needed a theory of everything, and because there is no conflict between GR and Quantum mechanics. Unfortunately, in order to fund books and theories, they want us to believe that we need to "solve" these problems. The second question is a real question, the nature of time, but the answer has been solved long time ago, because the answer is extremely simple: time is nothing but the sequence of the movements and activity of all particles in the universe. Every movement has a speed. Some particle move very fast (photons, electrons), and some movement is very slow (movement of continents). We compare all these movements with another more constant and regular movement (planets, clock etc). We keep track of the atoms and subatomic particles movement in their sequence and progression. Normally when we talk about the word motion, we focus on the movement of a single or a few things. But when we talk about time, we refer to the movement of all particles in the universe.

  • @datpye
    @datpye 7 дней назад

    Just started to wonder.
    Who TF is going to ensure this farm. 💀

  • @JosefSvenningsson
    @JosefSvenningsson 8 дней назад

    What was the podcast where Jensen talked about Musk's solution is super human?

  • @KarasMP
    @KarasMP 8 дней назад +1

    Didn’t Elon’s team come up with its own TCP/IP stack to handle the traffic load efficiently?

  • @gambit633
    @gambit633 7 дней назад +1

    Chess engines do alpha-beta pruning ... roughly if (at a certain depth of computing) a particular path to the solution already appears to obviously not be the best solution, then further deeper analysis of that particular path is not done. e.g. pruning. I have wondered if they apply similar logic with GPU clusters. A few local clusters can do very basic quick computations that allow the local GPU to get a rough idea of the result of the larger further away cluster's deeper analysis. e.g. it is already known to be out-of-range for the solution (or better solutions already appear to exist even with the rough computation results) so in many cases the local cluster need not wait for the deeper analysis of the far cluster. Plus can send out a request to cancel the deeper analysis so the further away GPUs can be repurposed. HOWEVER this is not completely deterministic and complex to tweak, the local "let's give up on that distant cluster deep analysis" is limited as in chess, occasionally the deeper analysis (that was not followed) WOULD have turned out something better after all.

  • @YellowRambler
    @YellowRambler 8 дней назад +2

    Elon says to Grok, show me a reactoinless propulsion design that I can integrate into StarShip.

  • @spampeg
    @spampeg 9 дней назад +3

    OK, but when will HW3 get v13? 😁

    • @Balilaci69
      @Balilaci69 8 дней назад +1

      Probably they gonna upgrade it to HW4. 🙂

    • @spampeg
      @spampeg 8 дней назад

      @Balilaci69 that'd be even better

  • @deangg8
    @deangg8 8 дней назад

    I've MS CoPilot for $ 20 a month, I asked questions about how to hold the coherency of the model across 32000+_instances. Started with a zero level and asked how AI would build my Network assuming use of a theoretical Bluefield 4 hardware, asked how many nvlinks, nvswitches and where the Bluefield 4's would be , it suggested multiple redundant DPU's on the zero level, told me what kind of trees would or could be used. Nvidia hardware is there it appears, the software to run the job and make the DPU (high power helper) go would be key - - those neo verse worlds could help.

    • @timothyblazer1749
      @timothyblazer1749 7 дней назад

      Dont think you need BF4 for this. Standard 4xNDR in Ethernet mode could.

  • @lloydjones3371
    @lloydjones3371 4 дня назад

    The prisoners dilemma: how to always walk with your ass up against a wall.

  • @TomBTerrific
    @TomBTerrific 6 дней назад

    I would be lying is I said I totally understand all that. At a very low level I think I understand. Im 74 and amazed everyday at how ma has progressed since I was a young boy. The reality is most of us don’t really need to understand all this. We can and do get the benefit of it all.

  • @yj677
    @yj677 6 дней назад

    google just leapfrogged elon with quantum computing. classic. 🤣

  • @investsmarternow
    @investsmarternow 8 дней назад

    Great video! Thank you!