Just the reason that nvidia squez the moneyz from gamer. They now can gift thier new CPUs, the gamer allrdy payd for the development and research. Btw. HP Moonshot was a fail. I see here no diff, just a bunch of Desktop GPUs crunchd in a 14" Laptop Blade. But it is AI man!!!11!1!!, the next hype hardware for failing startups.
Can confirm, I went to one of these conventions and offered $1000 for one of their processors. Their answer? "It's not for sale". Snooze you loose Nvidia, thanks for the freebie
@@justinbiggs1005 scary times indeed for the PC cucked race, hold up ya'll are now gonna get cucked the other way around Nvidia proc and intel gpu??? Damn the PC peasant race keep taking L's.
I'm not surprised they let you take it apart. 1.5 million views in less than 24 hours is more coverage then this would get anywhere. I love these types of videos.
Yeah duh. This whole video is full of shill. Do people actually think this isn’t paid for by Gigabyte or Nvidia. It might aswell be a marketing video for them
@@Pr0f3_YT at least through Linus we can’t done transparency from these big tech companies. We actually get to see up and coming tech and Linus explains it’s use cases etc to us normies.
For reference on the name: Grace Hopper was the US Navy computer scientist who wrote some of the earliest theory on machine-independent programming languages and is credited for writing the first compiler, two incredibly important steps towards modern computing.
She was also the first person to coin the term 'bug' in computer sciences because she found an actual bug in one of their systems and then taped it into the maintenance log book.
the intel fab tour was more nerve wrecking lol~ even though he wasn't holding anything like here, his hand gestures and body movement so near all those precision machines after saying we shouldn't touch anything was true anxiety. (oh yea, and he actually did pat machines anyway) XD
Something tells me the display units are probably nonfunctional if they're willing to let Linus take one off the wall and open it up with little to no supervision.
@@phoenux3986 Nope. I'm sure they are fully functional hardware items. I'm kinda sad he didn't drop one! Next week: Repairing the $150,000 server we had to buy after breaking it!
If you've watched him for years, you get used to it. Gold controller, 10k Intel CPU (which he dropped) are just among the first things that come to my mind. xD
Gigabyte allowing Linus to disassemble a product mounted vertically is a level of trust I didn't know was possible, glad it worked out for them cause Jensen made it very clear how much it costs lol.
I’d imagine a sponsor on a Linus tech tips video is a few grand. But Linus making a an entire video directly on your product is somehow not worth him dropping it once a blue moon?
Fun fact... one of the first boards Acorn (the company who created ARM) made had a broken power connection to the CPU... but as ARM chips were so low powered, it was still fine
@@Dragoon710 You are in for a treat Lowspec gaming YT channel has a couple of videos covering ARM . ruclips.net/video/gKYOjDz_RT8/видео.html ruclips.net/video/nIwdhPOVOUk/видео.html
It's ironic that in an era where we went from needing dozens of dedicated cards to having most things handled in software, we are now going in reverse: Hardware processing things with dedicated chips or cards.
About 10 years ago when I was in college for Electrical and Computer Engineering this is actually one of the things we were talking about. We're more or less hitting a brick wall in miniaturization and increasing the raw speed of individual components. How do we improve performance when we can't miniaturize our chips any more than we already have (At this point we're talking about transistors that are so small that you can count their width in atoms)? Well you offload tasks into different chips (TCP/IP on the network adapter and like Linus showed putting the encryption workload on the adapter). If you find there's a specific workload that you're constantly asking your general-purpose CPU to do, it might start to make sense to put that task on a specialist chip rather than putting it on your CPU. ASICs are on the rise and expansion cards are coming back.
@@autohmae Yeah, we were talking about that at the time as well. I avoided saying it because I kind of hate talking about Moore's law online - you almost always get some kind of blowback when you talk about moore's law being dead. On the consumer side of things I could almost see why you might think moore's law isn't dead. We're not really seeing smaller+faster all that much anymore. We occasionally barely scrape by into a smaller node, but you're not really getting faster and more efficient transistors out of it anymore, instead you're mostly cramming more stuff onto the die and subsequently aiming a firehose at it to hope you cool it enough to not explode.
@@Ferretsnarf This has happened before, and ASICs have always had a need over general purpose processors. Our reasons for stagnation in tech is more of a complex problem as opposed to exclusively being down to physics. As it is, quite a few clever people in fields of research have proposed numerous workarounds that are plausible in theory, but simply not testable at the moment and not feasible on a wide scale, especially without aggressive grant funding like in the past. If anything, I would say that we're actually quite lucky that AI has brought about a bit of a resurgence in potential general optimization and advancement. Finally, Moore's law was always more of a "loose observation" and never intended to be indefinite, with Moore himself saying that he was certain the trend would not hold for long and become irrelevant to the next abstract steps in advanced design.
I think these are non-operational demo examples. That's why they don't care. You don't hang $100,000 machine on the wall of a convention. You put up the dead CPUs and mockup PSUs that are basically worthless.
@@z0phi3lMicrosoft is far from there with their Qualcomm chip surface laptops, maybe for a student taking notes and using a web browser but it’s basically the compute power of a phone lol
@@NostraDavid2 I think you have that switched. I’ll be surprised if 90% of consumer PCs aren’t running ARM SoCs in 10 years. And I’m talking mostly pre-builts here.
@@NostraDavid2 if this goes like I think it will, we won't have a choice. Wild guess is Intel x86 will make it to 16th gen before they kill it, same with AMD, 2-3 more x86 before they also switch to all ARM
The little giggle of holding a server... a very expensive server and not dropping it made everyone's day! Like a kid in the toy store... Would love to see how hard it was for Jake to pull him away kicking and screaming.
Hopefully Linus is still making content fifteen or twenty years later, when you can pick these up for relatively cheap to see how they perform in games.
@@Xorthis A100 and newer would perform very poorly in games as only a small subset of the chip supports graphics workload. From the H100 architecture whitesheet: "Only two TPCs in both the SXM5 and PCIe H100 GPUs are graphics-capable (that is, they can run vertex, geometry, and pixel shaders)." (a full H100 has 72 TPCs)
What I find amazing about ARM architecture CPU's is that the very first one was was simulated and developed by Acorn Computers on an Acorn BBC Microcomputer (which used a MOS 6502 CPU) . The original name for Advanced RISC Machine was Acorn RISC Machine. I'm happy to say as a Brit I saw the beginning of this CPU legacy and still own both my BBC Microcomputer Model B and my Acorn Archimedes A3010 (which featured an ARM250, the 2nd generation ARM CPU). There was an actual ARM upgrade system for the BBC Micro, but it was far out of anyone's league/access and was mainly used by Acorn to develop the Archimedes.
Fun fact: when the fist ARM CPU passed it's bench test, the testers went to unplug it, and realized it was already unplugged. It had passed it's bench test entirely on residual stored energy. It was THAT power efficient.
This is why the Raspberry Pi exists, to re-ignite the BBC Micro experience, as a teaching tool. And I can report: the Raspberry Pi is the most sold computer from the UK ever.
@@autohmae I was just reading about the shortages! Insane that it's so popular. I also just bought an RP2040 to mess about with. Incredible little devices!
Granny's Garden and Suburban Fox were two of the biggest games in my primary school :D Damn I miss the old BBC machines. Just before my computer lab went to x86 to keep up with the newest trends, we had one RISC machine with module cards. It could run as an Acorn, as a BBC, or even as a 486 (Each module card had the CPU to run that standard). I have no idea why this kind of system never made it.
Imagine if Nvidia's reps didn't know Linus has a screwdriver and he just looked around, saw the reps moved away and started dismantling the showcase board before anyone could take a notice 😅😅
Fantastic overview of the new NVIDIA products and a stellar breakdown on ARM procs and where they work best. I'm working through some NVIDIA certification courses and the info is all there but they provide no context other than a dizzying array of multiplier comparisons against previous gen hardware and this video brought it all into focus. Thanks so much, really helpful!
Nvidia has been making CPUs for over a decade now. Tegra initially for high end tablets and now for high end (~$700-$2,500) embedded systems. And they've been making Grace for AI prototyping workstations for about 5 years (if you have a spare $25,000). If you only have $5,000, there are a few options with the Ampere Altra if you really must have ARM. The power savings are very suspect, Jeff Geerling tested and found it to not be much different than Threadripper.
Arm is no magic bullet to energy efficiency, if using arm alone would be enough to make cpus more efficient even at high power, we would only have arm cpus
Nvidia is betting that people will use Hopper and most definitely betting that people will buy their expensive ass interconnect modules. The actual performance of these chips is probably meaningless outside of the context of "shove a shit ton of DDR5 at it", much like Apple Silicon. And plus, AMD already beat Nvidia to the punch here. MI300 is CDNA3 + Zen 4 on a single package, using their Infinity Fabric (which is literally the same technology but packaged differently) Epyc still exists, and is impossible to actually beat because its much more versatile than these bespoke solutions. Until Arm can complete outside of the niche, we will keep hearing these arguments for years to come. Zen 4 is extremely efficient, as good as many Arm chips so x86 isn't out of the game yet
I mean let's be honest. This video is going to get more views than anything in the entire rest of the weekend of this convention... It's worth the risk of having to drop something when he is the headliner.
Linus: 'I didn't ask permission for this part but nobody seems to be stopping me.' Security: 'That's Linus... just let him do his thing. He'll put it back together... probably.' 😂
I mean knowing how security goes for various events they probably weren't fully informed of what he could and couldn't do, just that he was allowed to mess with the display over there.
Just in case: SLI works but it's mainly dépendant on the type of work asked to the GPU, and games are not benefiting much of the multiple nodes. For scientific computation however...
Well, SLI is actually a great technology, but its requires high competency from game developers, and lets just say that's not too common. Look at simulation programs or modeling and raytracing software and you realize how awesome sli setups are when running proper software.
@@Struct.3 on point! If you want to see a well optimized game for sli/cf, have a look back at crysis 2! May not have been the best in the series, but multi-GPU support in that title was wildly effective!
@@Struct.3 yeah, feels like game devs these days need 'guardrails' enforced by Sony and a 'one click - enable' to implement feature button. (Thinking about the interviews on Morres law is dead channel) For the mentioned use cases you can forget running that on consumer cards as none have the connectors anymore
The difficulty with SLI is that is has to raster frames real time for 144+hz display on a screen. GPU offloaded work, such as NN machine learning, is a much easier task to parallelize.
This is nothing short of insane, the fact that there is so much processing power with less power means that we will have much higher speeds throughout our internet!
This won't really make your internet faster. But, there's a case to be made that it might, in a roundabout way, make your websites load faster because the website is running on this hardware.
@Linus, I'm so glad you decided to step down as CEO so you could focus on the magic! Every day I tune into this channel to learn something new and you guys always manage to keep it fresh and engaging! Long Live LTT!
This is NVidia's future, and they know it. Good, now we can let folks like Intel and AMD shine a bit more, especially when they get their drivers ironed out.
Im sure NVidia will be sad about that as they control whatever runs the AI customer support you work with, the AI that power your online services, the servers that renders your movie...
That Grace Hopper is a freaking piece of art. It makes you want to code an entire OS and Game just to test it. Just imagine what a crazy project that would be.
@@TheCHEATER900 imagine going back and telling her how big a transistor will become. The ones she developed software languages with were vacuum tubes several inches long.
@@chrishousby2685 I imagine that anyone working with computers understands how quickly they will improve. The stuff shown in this video may be made for personal computers in 20 years. Just as 20 years ago, personal computers had millions of times less space and compute power.
@@puppergump4117 honestly I don't think personal computers will be a thing in 20 years, it'll probably be closer to cloud based hardware. It'll be cheaper and more efficient than making individual hardware for both parties.
Those AI art programs which can produce 30 photo-real variations of, "A mountain of cookies" in under a second strongly suggests that we're living the last generation before everybody is born in pods and never uses their eyes. I'm legit alarmed by these compulsive engineers who know deep down that they should put the brakes on, but just can't stop themselves.
@@MarkOakleyComicsOnly idiots think AI will overthrow the world. If you actually understood what AI is and how it works you wouldn't think that. It's not some magical sentient being. It's literally just mathematical models and equations used to predict future outcomes based on inputs datasets. Datasets, which need I remind you, need to come from living, active, intelligent humans. If there aren't humans producing new, creative, informative data, AI would be useless. AI is a good thing. It is simply a tool to help us simplify our work and reach our goals. It can, and hopefully will, be used to ease and remove the burden of existence from mankind, so we can truly be free to do what we want and not struggle just to survive.
It's insane only if you compare it to consumer-level hardware and software. Remember, governments all over the world have and maintain far higher tech than the public can even dream of. They secretly use this tech for military, scientific, and usually espionage purposes. We get only the bottom of the barrel. Most of the tech we use today were once government secrets. The Internet itself started as a US military defence and research project.
@@rudisimo Right. Because there aren't any examples of technology getting ahead of our ability to adapt without catastrophic results. I can think of a couple items of note just from the last few years. Meanwhile.., Neuralink is entering human trials.
The smile on Linus’ face is like a 80’s kid going to a toy store… you know you won’t leave the place with anything, but just being surrounded with the toys is a joy
Wild to see just how far-wide-deep the subscription model has reached. If the contemporary fiscal landscape were a chess board, the pawn could only move to a square that it's rented from the opposing king.
The way NVIDIA focuses on cloud and AI and so on and makes local gaming more and more expensive, I fear local gaming will get rarer and rarer. They want us to nudge to use GeForce Now instead, because it's more efficient to them to share the performance from its servers than to sell us individually a GPU.
True, thats the future. Thats also thw reason gaming companies want to go always online , games as a service. Cause those games you can easily do the transition from local to cloud without the consumer knowing and once ur locked in you gonna pay for renting the software and hardware
Pretty soon you're gpus are going to come with their own custom CPUs 😄 along with a few pelethites of storage for the AI data, so every time you play a game, the EI will be smarter every single time it customized for every single game for every single place style for every single player 😳
Honestly with the percentage of their income that is now coming from AI I dont really see them giving a shit about GeForce anything, they could be completely out of the commercial hardware space in 10 years. I mean I imagine they may already have rtx 5000 series in the wings and possibly even the 6000 series... But after that? If everything else goes according to plan then Nvidia wont care much about consumer cash anymore.
It will never be viable until the majority of the internet infra structure is pure fibre, copper is just way too high-latency (laggy) for gaming remotely, even 1GB fibre is borderline, in reality, 10gb full fat fibre is the minimum for a good gaming experience over remote connections, even current HDMI standards struggles to carry enough bandwidth to keep up with modern video games, so even with 10GB fibre a heavy compression technique will need to be employed, I wouldn't ever want to use it for gaming personally.
@@Wobble2007 What are you smoking? You can ALREADY play remotely at fairly decent latency with a regular 100 MB bandwidth. Barely anybody except competitive e-sports professionals care whether you have a latency of 50ms or 10ms
I can imagine Computex chief security officer watching this and thinking to himself "Why didn't anyone stop him? Well... At least he didn't drop anything."
I find H100's price tag nothing more than "well Google will pay no matter what" kind of price. A million dollar investment into a server for a company this large is both eaten up immediately by running costs, but also still a blip for total operating cost of the company. Nvidia shareholders must be happy as shit
Ya I think this is a good time to invest given their Q1 reports and all these new tech developments. Nvidia is currently operating like an innovator again and not some comfortable company (like intel was before AMD came back).
@@ZeLoShady Considering they had a 26% rally yesterday over the span of a single hour, I'm gonna go and assume that you're not the only one to think this
Holy crapballs that bandwidth. Damn cool stuff. I'm so happy Arm remains independent. Gonna be cool seeing what all comes to market following this beastie.
4:32 “*This* is Grace Hopper. On the one side, we’ve got the same, 72-core Grace ARM cpu we’ve just saw, but on the other side, the “ooooooo shiny” latest and greatest nVIDIA H100 Hopper GPU. Today I am going to review Grace Hopper, and show you all of its quirks and features.”
I have used a system with 1440 cores and 64Tb RAM, but it was a few hundred physical commodity boxes. The latest compute space stuff that is replacing the likes that I used, is insane.
This might be the first video where Linus hasn't damaged or at least recklessly handled expensive electronics. So it IS possible for him to not break stuff!
@@lonelyPorterCH The more surprising part about that was Linus actually yelling NO! I would have expected him to just carry on like it's normal to throw around electronics like that.
I love your contagious passion and enthusiasm for technology. I joined the PC industry as a hardware trainer/presenter in 1991. It took me months to accept the fact that i was actually getting paid to do something was so passionate about. Best wotking years of my life!
It's an Arm CPU, same architecture used on mobile phones processors, it's RISC based but is very powerful, all of the systems today have a version for arm: Android, Linux and Windows with the Windows for IoT version. If developers start developing compatibility layers for x86, like with Exagear, compatibility start to have a solution.
On apple devices everything is already compatible. Best we leave windows all together, make steam compatible with ARM and there you are, no more windows needed
@@SWOTHDRA It doesn’t matter much for the immediate future (5-10 years) if steam is compiled for ARM or not since it’s not a high performance application and therefore works fine with a translation layer (e.g. macos). Also it doesn’t really matter for most people anyway since pretty much all games released till now are not going to get an update to run on ARM anyway so if you want to be able to keep playing your library making the switch won’t make any sense whatsoever.
so like m2 ultra at the server level. this makes me really happy this kinda stuff is great for the environment energy wise. also if that interconnect could be used for SLI or simiilar in the future that would be huge!
Grace Brewster Hopper (née Murray; December 9, 1906 - January 1, 1992) was an American computer scientist, mathematician, and United States Navy rear admiral. One of the first programmers of the Harvard Mark I computer, she was a pioneer of computer programming who invented one of the first linkers. Hopper was the first to devise the theory of machine-independent programming languages, and the FLOW-MATIC programming language she created using this theory was later extended to create COBOL, an early high-level programming language still in use today.
ARM is a RISC instruction set. The Hewlett-Packard Packard PA-RISC was way ahead of its time. I worked on the first HP 3000 on MPE & HP 9000 HP-UX systems. Some of the desktop workstations like the tiny 715 systems were incredible in 1980’s.
Ah the toys of my youth! I worked with some of that wayyyy back when along with many other goodies that all ofnthe winbloze babies wouldnt have any clue what it is now nevermind how to use it and due to the millenials andnbeyond idiotic overly entitled arrogant bs they dont even appreciate that which was gained to make the current toys evennpossuble via our hard work long before they were a set spot on cheap hotels sheets
You beat me to this comment, but I made it anyway 😂 I'm sort of scratching my head as to why he's (Linus??) acting like it's a new thing... Just for novelty I still have a SUN E450 still running and productive 😂
@@konnorj6442 First of all, it is this exact toxicity that completely stagnates any real intelligence... I would rather be stuck fixing Windows 3.1 and Vista installations for the rest of eternity than ever hold a mindset akin to yours. Every architecture, operating system, and programming language has its strengths and weaknesses, and it is our responsibility as technicians to learn and understand each one so that we can always provide the best for whatever our client is trying to achieve. I have met both old and young people who are kinder, more intelligent, and exhibit far more competence than you have shown here.
Hearing Linus saying “I can’t believe they let me take this off the wall” and proceeding to laugh like a small child made my day. Linus is the geeky adult version of a kid in a candy store 😅
6:59 my CFD software can fit 1.25 billion cells in a single GPU with 64GB VRAM. 150TB could hold 2.85 trillion cells, holy smokes! That is 14170³ resolution.
1:53 more power efficient* * if under around 45W, x86-64 instruction set has a min. cost to load, after that it's pretty even. E.g. M1 at max load is 30-35W, and that's why it trumps in the mobile space. Here it's going to be more interesting if ARM is used as orchestration only.
AWS has had a similar card to the last thing you showed off since around 2017. They just call them Annapurna cards inside the data centers (likely because they're made by Annapurna, a company they acquired back in 2015), but it's literally that. 1 or 2 SFP ports + an ethernet port and it gets used as the NIC inside pretty much all their servers these days. I assume the industry at large has had cards like that since 2015~ or even earlier, since I don't remember AWS being on the leading edge of anything in the data center space when I was working for them. xD
The only 'revolutionary' part of AWS was elasticompute almost entirely in the software side. They do self manufacture certain things now tho (that the company would probably actually look to terminate me for discussing. Me and my PIP is already riding very thin water heh)
the biggest problem is intel and x86 and their royalty fees.... that require everyone to pay them for using x86 chip architecture while they do nothing to improve it... so chips are gonna be high in price.. every new chip that comes out is at least 400-500 dollars. Amd uses x86 architecture so they need to pay intel a fixed amount of royalty fee for no reason other than using the x86 architecture that has been globally used by everyone and invested in by developers around the world. i dont think intel shuld be getting payed cus it "ownes" a globally used architecture needed in all kinds of health sectors food production companies and businesses around the world. Mostly cuz all the hard work comes from the developers who made the programs. Apple tried to exclude intel from all of this but they are no better for using arm architecture that is also licenced. we need a licencing free architecture that everyone can use only that way we will get rid of the money sucking leeching companies such as "intel". i was rooting for huawei chips but poor huawei got banned here.
@@AM-uy1ez Intel pays AMD licensing fees for AMD64, while AMD pays Intel licensing fees for x86. They were suing each other and reached a cross-licensing agreement years ago.
I like to think that Linus isn't supposed to be there and that he just started unscrewing the displays out of habit and nobody was brave enough to stop him Edit: Well 3:12 confirms that! Never change, Linus. Never change
ARM - "Acorn RISC Machine - first used in 1983. The ARM company never made processors/chips themselves, but designed them in specialised CAD systems. The CAD logical design file then was converted into a physical design that could be "printed" (my term) by the "foundry" (industry jargon). Such a logical design actually facilitates simulation in software of how the processor will work. The first physical batch of ARM came back to the ARM company and they had their physical test motherboards. Set the mobo up, plug the CPU in, run tests. Overnight, one of the engineers wakes up and becomes aware there was a connection or configuration issue in the power-lines and the test should have failed. Turned out the processor needs so little power that it had run off the power leaked into the processor from I/O presented to the processor. That's why almost all CPUs in smartphones are derived from that first ARM and why Apple derived their current generation of "proprietary" Apple chips from ARM too.
I'd like to point out that because LLVM is the backend for quite a few modern languages compiling for arm is as simple as passing a parameter to your compiler
Virtualization really pivoted the server market into making CPU's with massive memory requirements. Each virtual node may be okay with a small number of cores but every single virtual machine needs a lot of dedicated memory - it adds up fast. And spinning that node up and tearing it down in an acceptable period of time changed the whole network architecture datacenters. Tensorcores (AI) exacerbated the problem. And as bad as it is now - CXL is going to really do a number on it as things get more heterogeneous. Fun Fact: Windows used to run on ARM, MIPS and x86. In fact it still does on the lowly Raspberry PI (IOT) which is ARM. In the 90's the fastest Windows machine was actually an Silicon Graphics Octane Server (MIPS) with many CPU's and an interconnect design that is very similar to NVLINK. NVIDIA's acquisition of Mellanox really focused on Data Center and the necessary interconnect needed at chip level. They have been "beyond" simple GPU/Gaming for many years. The margins are much higher in the datacenter as is the refresh cycle.
The network card you showed at the end reminds me a lot of IBM Z-Series programmable IO. And yeah, offloading IO stuff to a coprocessor is the secret to crazy high throughput. You guys have seen in your reviews of desktop products how bogged down the system gets with high speed IO if it needs to be handled all by the CPU.
As someone who had to use MLNX Connect-X4 and 5 cards -- the software for these cards is an absolute nightmare. 1.) Sometimes the cards would just stop working until someone physically disconnected, then reconnected the cable. 2.) Kernel Updates in most cases will brick the drivers, causing your network adapter to stop working until you can either reinstall the old driver or install whatever updated driver is released for the new kernel. 3.) On at least one occasion, shortly after NVIDIA bought Mellanox they decided with zero warning to change the default operating mode of the driver from IPoIB to RDMA -- with zero help / instructions on how to revert it. This one left a really bad taste in my mouth. 4.) The Documentation is bad. NVIDIA has done a terrible job of providing even a similar level of documentation to the old Mellanox documentation. Frequently when debugging issues with these cards, I found myself looking up old Mellanox pages for ancient secrets to try and equip myself with enough knowledge to debug stupid problems. 5.)In order to fully utilize these cards you need to operate in RDMA mode, which literally requires that your application be compatible with RDMA/Configured for it. RDMA basically lets data get piped straight from the network to your application -- completely skipping the kernel. It's intended for crazy fast low latency. But they don't really tell you that. So if you're running in IPoIB then you're basically just paying a couple hundred bucks per cable for nothing. 6.) Networking for these things requires you run a network manager on one of your nodes, or pay crazy licensing for a network device that can support running the network manager. So it's dummy thicc easy to setup out of the box -- but it's actually quite a pain to set it up correctly. TL;DR -- These cards are absolute trash if you don't fully invest into utilizing them correctly, and that impacts everything from hardware to the application. The software isn't robust, and your engineers are going to HATE it. I wouldn't recommend even entertaining the idea unless you absolutely intend to build your entire application/system around the idea of using RDMA.. cause its definitely not something you can just tack on for more performance.
In 2004, my home lab included a 186 drive NAS, full of 18GB 15k RPM drives. It had similar iops and bandwidth to the best SATA SSDs, a decade earlier. I connected my dual processor workstation to it directly with Fibre Channel. But this stuff is so advanced... I can't even wrap my head around using it outside of an enterprise environment.
Ah I have another clone out there in the wild.. of a sort Trick is I've worked on some of the bleeding edge goodies and even in a pro environment some of it is so wild it still really not useable by just one person per se Like my friend (asm coder of god like level).. way back when I was sadly stuck several hundred meters out of the area where cable inet was avail at home he got direct fiber to his home due to his work.. granted his home was only about 400 feet from the nearest main trunk data center by sheer luck.. but thenspeeds he got were so fast nothing he could really build for use at home could use the bandwidth he got. Since then it's even more insane.. decades later his connection is now so fast his server class top speed flash drive array cant write fast enough to fully use the fiber connection Lucky fukker he is lol
@@HarshJha My setup has become much more consumer-grade due to the rapid advancement in tech. I've got an i5 running FreeNAS with 4x 12TB and 6x 20TB mechanical drives, and 3x SSDs for cache. It has 10GbE ethernet, but none of my computers have more than 5GbE - and it can fully saturate that. But I almost never do. I only use it to archive home movies (I'm not a professional content creator) because nvme SSDs are just so big / fast / cheap compared to a few years ago. Dual 4TB SSDs is plenty for so many use cases.
8:32 You can see the sheer terror in Linus's eyes when someone from off camera threw that card to him. I'll never be able to afford anything that he showed, save the computer setup at the booth. But it's cool seeing a glimpse into what will be.
Connect-x cards are expensive but actually not insane expensive. I'd bet based on personal experience looking at costs of other Mellanox cards the 'average' price for the entire connect-x 7 SKU (ie all the different speed and port number options) would be in the $1500 range... the 400GbE will be expensive as, but a 100GbE or 50GbE will be less. The 'Smart Nic' / DPU on the other hand.... yh them would be very very expensive, but equally, as Linus says, super super cool and for cloud providers a huge benefit to getting more CPU compute out of existing hardware.
PSA from someone who used MLNX connectors during/after the NVIDIA buyout of Mellanox -- If you're a Sys/Network admin those connectors are a fucking train wreck to deal with (at least up to ConnectX 5 cards) There are different versions of the drivers for different hardware types too. Like if you're on dell hardware you have to download a dell specific driver for these cards, and it is NOT well documented that it is the case. 1.) NVIDIA on at least one occasion released a driver set which changed the default mode from IPoIB to RDMA -- which absolutely fucked my environment up. 2.) Frequently kernel updates would brick these drivers -- requiring the sys admin to reinstall the drivers entirely.. so make sure you have multiple ways to access systems that use infiniband.. cause you can easily end up driving out to the data center to fix these things. 3.) We had some devices on Dell hardware that would just randomly stop working -- and the only way to fix them was to literally disconnect the cable and reseat it. Dell, NVIDIA and Mellanox were all zero help in debugging this issue. Do not blindly buy into using MLNX adapters. We ended up buying into it because NVIDIA recommended them, and the lead engineer thought it sounded cool. We had no idea that NVIDIA was recommending them to us because they intended to buy Mellanox out. NVIDIA's documentation on configuring these drivers leaves a lot to be desired -- and a lot of the time you'll end up looking up old Mellanox documentation to find useful commands that can help debug issues. If you actually need to troubleshoot any issues, you're gonna have a bad time. They're basically just trying to sell DGX and HGX systems -- and while you can get all kinds of fancy numbers/specs from these cards, they absolutely suck manage/troubleshoot. Especially when you throw them into different hardware. In order to fully utilize these connectors you need to run in RDMA mode. Running RDMA requires that your applications fully support RDMA, because it basically allows data to skip the kernel entirely and get piped straight to the application -- this gives nanosecond latency. That being said if your software doesn't support RDMA, then you're stuck using IPoIB, which is an extremely expensive waste of hardware. Might as well just use regular 40GB fiber at that point. You'll end up getting sucked into needing network devices that can handle the Mellanox connectors, and either running a network manager as software on one of your nodes, or paying out the ass for licensing to have a network device that can run the network manager software. We had to run some long cables between racks and were spending nearly $800 per cable at one point too. TL;DR -- DO YOUR RESEARCH COMPLETELY BEFORE BUYING INTO THAT SHIT. It's a huge pain in the ass from a sysadmin standpoint. As the guy who ended up having to figure out how to get these things working, maintain them, automate patching/updates, troubleshoot performance and network issues with them -- I wouldn't recommend ever using Mellanox for anything unless you build everything else around the idea of using RDMA. It impacts everything from hardware all the way up to the application. We had about 200 GPU servers using them, and I was the poor sap who was stuck writing ansible playbooks to try and unfuck these things. They made my life hell for quite a while.
6:55 "Giving them access to up to 150TB of High Bandwidth Memory" The total memory capacity is ~150TB, but only a fraction of it is HBM. Each node has 512GB LPDDR5X but 'only' 80GB HBM.
One thing I super like is the entire chipset cpu and ram all on one single board. This will make a lot of things faster and better. You really only need to offer 1 or 2 ram amounts because you can off set the price of the more ram by selling a lot more units.
@@mr420quickscops2 that too..I'm 40 years old and been around tech my whole life and is my career and continue to get that oooo yeah moments in life when stuff like this comes out
I love the idea of Linus just going into conventions and just unscrewing random tech he finds all over the walls without permission.
sinus lebastion is just too dangerous at conventions
I was about to comment this myself, goes to show how much the companies trust him now
seems like something he does everywhere he goes
That's probably what he do in the old day. On-the-spot permission with no prior planning
the way he chuckles as well, when he actually get's permission. lol
They don't want you to know this, but the processors at the convention are free. You can just walk up and take one.
Just the reason that nvidia squez the moneyz from gamer. They now can gift thier new CPUs, the gamer allrdy payd for the development and research.
Btw. HP Moonshot was a fail. I see here no diff, just a bunch of Desktop GPUs crunchd in a 14" Laptop Blade. But it is AI man!!!11!1!!, the next hype hardware for failing startups.
*doorbell rings*
Linus 2 weeks from publishing this clip: Why did [ship company] just pull up in a semi?
Also, these are our available GPUs.
Can confirm, I went to one of these conventions and offered $1000 for one of their processors.
Their answer? "It's not for sale".
Snooze you loose Nvidia, thanks for the freebie
I think we all know that you made a joke like this because you thought about stealing that poor CPU.
A green cpu with a blue gpu may soon be possible.
Scary times.
Scary times with pricing and greed. But interesting times hardware/software technology wise
What the hell kind of bizzaro world are we in?
Highly doubt the green goblin is interested in making a cpu for peasants like us.
It already is
The world is ending
@@justinbiggs1005 scary times indeed for the PC cucked race, hold up ya'll are now gonna get cucked the other way around Nvidia proc and intel gpu??? Damn the PC peasant race keep taking L's.
I'm not surprised they let you take it apart. 1.5 million views in less than 24 hours is more coverage then this would get anywhere. I love these types of videos.
Yeah duh. This whole video is full of shill. Do people actually think this isn’t paid for by Gigabyte or Nvidia. It might aswell be a marketing video for them
@@Pr0f3_YT Who cares if it's marketing. Its still cool.
@@Pr0f3_YT at least through Linus we can’t done transparency from these big tech companies.
We actually get to see up and coming tech and Linus explains it’s use cases etc to us normies.
@@Pr0f3_YT how do you expect them making money ? youtube pay is sh*t, everyone know that.
@@Pr0f3_YT they would have to disclose that fact if it was.
For reference on the name: Grace Hopper was the US Navy computer scientist who wrote some of the earliest theory on machine-independent programming languages and is credited for writing the first compiler, two incredibly important steps towards modern computing.
yes, hearing 'grandma COBOL' mentioned did bring a smile to my face
yeah, NVIDIA names a lot of their architectures after important people in science history.
also ranked Rear Admiral on top of that
Grace Hopper has a Posse.
She was also the first person to coin the term 'bug' in computer sciences because she found an actual bug in one of their systems and then taped it into the maintenance log book.
The confidence that some manufacturers have in Linus despite his track record is impressive.
That's because if Linus drop's their product it's free advertising though clips for years to come lol
Also, these are display units. Meaning they either don't work at full capacity, or might not even work at all.
@@TAMAMO-VIRUS big companies rarely put something valuable out there in public view, sometimes it's just a dummy unit.
@@FishmanistanI did not think of that. I have found myself watching Linus drop compilations
Walk in there with a fat wallet and or million dollar business insurance policy theyd let you do it to 🤷🏻
I love how Linus just HAS to disassemble everything he gets his hands on
thats how he rolls
Someone somewhere was holding their breath saying don't f'ing drop it Linus don't you dare drop it 😂
thats how he rick rolls
He could not use the LTT screwdriver though! What a missed oportunity!
Imagine if it was GN Steve...
Intel made a GPU, now NVIDIA made a CPU, what a time we live in
😂🤣😂🤣😂🤣😂🤣😂🤣
Yea,, what's next?..... Men making babies and women getting drunk and having tattoos ? ;p
NVIDIA should start making motherboards again to go with that new CPU. That would be a real trip. 😆
@@KanadeYagami and we willn't be able to buy that motherboard due to its price.....😂🤣😂🤣😂🤣😂🤣😂🤣
meanwhile we "gamers" are still fine..
I've never been so nervous watching Linus holding new tech.
the intel fab tour was more nerve wrecking lol~ even though he wasn't holding anything like here, his hand gestures and body movement so near all those precision machines after saying we shouldn't touch anything was true anxiety. (oh yea, and he actually did pat machines anyway) XD
Something tells me the display units are probably nonfunctional if they're willing to let Linus take one off the wall and open it up with little to no supervision.
@@phoenux3986 Nope. I'm sure they are fully functional hardware items. I'm kinda sad he didn't drop one!
Next week: Repairing the $150,000 server we had to buy after breaking it!
@@Xorthis haha :D I need to see it! But I gues it costs much more.
If you've watched him for years, you get used to it.
Gold controller, 10k Intel CPU (which he dropped) are just among the first things that come to my mind. xD
Gigabyte allowing Linus to disassemble a product mounted vertically is a level of trust I didn't know was possible, glad it worked out for them cause Jensen made it very clear how much it costs lol.
They got a visit to ASML... that cuts everything. Visit to one of the arguably most complicated machines on earth is not a easy task.
That module was probably a dud or scrap part that they just used to show how it looks. Ain't nobody leaving a $100K chip hanging on a wall
At first I thought he was going to DELID it.
I’d imagine a sponsor on a Linus tech tips video is a few grand. But Linus making a an entire video directly on your product is somehow not worth him dropping it once a blue moon?
The thing is, that probability is a lot higher than once in a blue moon 😄
Anyhow, it’s mostly just a banter from his loyal fans.
Linus: we haven’t been on good terms with nvidia for a long time
Also Linus: proceeds to dismantle latest nvidia tech
It's gigabyte's booth
@@Saint_Chompy they are third-party seller this is NVIDIA tech tho
@@brandonmoss7976 Realistically, Nvidia can't do shit if Gigabyte wants to show off their new stuff that's already available.
@@brandonmoss7976 Gigabyte can let linus do what he wants... Nvidia would not stop him lets be real here.
@@brandonmoss7976 Which does not matter?
If you bought a car from Toyota and started dismantling it, do you think Toyota could tell you to stop?
Linus is literally the legend of the tech industry. imagine not only being invited to a pre-show, but also being allowed to play with the displays.
Fun fact... one of the first boards Acorn (the company who created ARM) made had a broken power connection to the CPU... but as ARM chips were so low powered, it was still fine
I watched that... the insanity was that residual power from capacitance all around the chassis managed to power the circuits!
@@someoneelse5005 that seems very interesting how can I find this video?
Then RISC, what ARM is built on, the CPU was running after power was disconnected.
😮
@@Dragoon710
You are in for a treat
Lowspec gaming YT channel has a couple of videos covering ARM .
ruclips.net/video/gKYOjDz_RT8/видео.html
ruclips.net/video/nIwdhPOVOUk/видео.html
That totally natural scan around the room before he takes the thing apart is just brilliant.
It's ironic that in an era where we went from needing dozens of dedicated cards to having most things handled in software, we are now going in reverse: Hardware processing things with dedicated chips or cards.
About 10 years ago when I was in college for Electrical and Computer Engineering this is actually one of the things we were talking about. We're more or less hitting a brick wall in miniaturization and increasing the raw speed of individual components. How do we improve performance when we can't miniaturize our chips any more than we already have (At this point we're talking about transistors that are so small that you can count their width in atoms)? Well you offload tasks into different chips (TCP/IP on the network adapter and like Linus showed putting the encryption workload on the adapter). If you find there's a specific workload that you're constantly asking your general-purpose CPU to do, it might start to make sense to put that task on a specialist chip rather than putting it on your CPU.
ASICs are on the rise and expansion cards are coming back.
Do you remember some people were saying: the end of Moore's law ? That's what is going on here...
@@autohmae Yeah, we were talking about that at the time as well. I avoided saying it because I kind of hate talking about Moore's law online - you almost always get some kind of blowback when you talk about moore's law being dead. On the consumer side of things I could almost see why you might think moore's law isn't dead. We're not really seeing smaller+faster all that much anymore. We occasionally barely scrape by into a smaller node, but you're not really getting faster and more efficient transistors out of it anymore, instead you're mostly cramming more stuff onto the die and subsequently aiming a firehose at it to hope you cool it enough to not explode.
@@Ferretsnarf do you know why consumers with some technical knowledge don't know it's dead ? Because of the marketing with CPU node process size.
@@Ferretsnarf This has happened before, and ASICs have always had a need over general purpose processors. Our reasons for stagnation in tech is more of a complex problem as opposed to exclusively being down to physics. As it is, quite a few clever people in fields of research have proposed numerous workarounds that are plausible in theory, but simply not testable at the moment and not feasible on a wide scale, especially without aggressive grant funding like in the past.
If anything, I would say that we're actually quite lucky that AI has brought about a bit of a resurgence in potential general optimization and advancement.
Finally, Moore's law was always more of a "loose observation" and never intended to be indefinite, with Moore himself saying that he was certain the trend would not hold for long and become irrelevant to the next abstract steps in advanced design.
Linus holding a ~$150,000 compute module like it's a boombox will never get old
I can’t believe they trusted Linus not to drop one of these 😂
I think they more trust he can compensate fairly when he does, plus it would be good advertising.
I think these are non-operational demo examples. That's why they don't care.
You don't hang $100,000 machine on the wall of a convention. You put up the dead CPUs and mockup PSUs that are basically worthless.
I came here just to say the same...
Mounted to the wall? Probably just models with damaged CPUs anyways.
Talk about some character development, I'm actually proud he didn't dropped the one that was thrown to him, I had a mini-heart attack
I feel like we're looking at the future of consumer platforms in 5-10 years, just in BIG form
Like mentioned Apple is there, Microsoft is close, question is, who will do mass ARM based consumer chips first, Intel or AMD?
Some powerusers, maybe. I don't see windows hardcore switching to ARM. Who knows... Maybe we'll be surprised.
@@z0phi3lMicrosoft is far from there with their Qualcomm chip surface laptops, maybe for a student taking notes and using a web browser but it’s basically the compute power of a phone lol
@@NostraDavid2 I think you have that switched. I’ll be surprised if 90% of consumer PCs aren’t running ARM SoCs in 10 years. And I’m talking mostly pre-builts here.
@@NostraDavid2 if this goes like I think it will, we won't have a choice. Wild guess is Intel x86 will make it to 16th gen before they kill it, same with AMD, 2-3 more x86 before they also switch to all ARM
It's a real moment of pure pleasure, to see Linus with eyes that shine, like a kid in a toy store
Maybe he cream his pants😂😂😂
until he drops it
@@apotatoman4862 I think 80% of dropping are fake
The little giggle of holding a server... a very expensive server and not dropping it made everyone's day! Like a kid in the toy store... Would love to see how hard it was for Jake to pull him away kicking and screaming.
Hopefully Linus is still making content fifteen or twenty years later, when you can pick these up for relatively cheap to see how they perform in games.
i could imagine lets install batorcera for running ps6 games 😂😂😂 at ease, and for you dirty otakus create your own living A.I waifu cat girl
On Ali express 10 years from now
Sadly the "100" series, previously known as Tesla, does not support any video outputs and does not support any graphics APIs, it's only for compute
@@mika2666 That hasn't stopped people running games on them. There's a few benchmarks out there.
@@Xorthis A100 and newer would perform very poorly in games as only a small subset of the chip supports graphics workload.
From the H100 architecture whitesheet: "Only two TPCs in both the SXM5 and PCIe H100 GPUs are graphics-capable (that is, they can run vertex, geometry, and pixel shaders)." (a full H100 has 72 TPCs)
What I find amazing about ARM architecture CPU's is that the very first one was was simulated and developed by Acorn Computers on an Acorn BBC Microcomputer (which used a MOS 6502 CPU) . The original name for Advanced RISC Machine was Acorn RISC Machine. I'm happy to say as a Brit I saw the beginning of this CPU legacy and still own both my BBC Microcomputer Model B and my Acorn Archimedes A3010 (which featured an ARM250, the 2nd generation ARM CPU). There was an actual ARM upgrade system for the BBC Micro, but it was far out of anyone's league/access and was mainly used by Acorn to develop the Archimedes.
Fun fact: when the fist ARM CPU passed it's bench test, the testers went to unplug it, and realized it was already unplugged. It had passed it's bench test entirely on residual stored energy. It was THAT power efficient.
The first computer i ever used was a bbc micro. When i got to infant school and even highschool we had Acorns
This is why the Raspberry Pi exists, to re-ignite the BBC Micro experience, as a teaching tool. And I can report: the Raspberry Pi is the most sold computer from the UK ever.
@@autohmae I was just reading about the shortages! Insane that it's so popular. I also just bought an RP2040 to mess about with. Incredible little devices!
Granny's Garden and Suburban Fox were two of the biggest games in my primary school :D Damn I miss the old BBC machines. Just before my computer lab went to x86 to keep up with the newest trends, we had one RISC machine with module cards. It could run as an Acorn, as a BBC, or even as a 486 (Each module card had the CPU to run that standard). I have no idea why this kind of system never made it.
Imagine if Nvidia's reps didn't know Linus has a screwdriver and he just looked around, saw the reps moved away and started dismantling the showcase board before anyone could take a notice 😅😅
he has mounted ltt screwdriver in his butt... like always 😁
If they're not used to Linus by now, that's on them!
To be fair NVIDIA didn't let him do that, it's Gigabyte who does work with NVIDIA who did the presentations.
and then when they noticed, they'd be like "HEY!" and then Linus drops it. 😏🤣
Get ready for Not Super Ai🤖,
Bot for Super Duper AI👾👧!
Westworld is only a generation
away🤠👧(👾). ... :-))
Fantastic overview of the new NVIDIA products and a stellar breakdown on ARM procs and where they work best. I'm working through some NVIDIA certification courses and the info is all there but they provide no context other than a dizzying array of multiplier comparisons against previous gen hardware and this video brought it all into focus. Thanks so much, really helpful!
For Linus to not drop whatever he’s holding immediately after saying “I don’t even wanna know what this thing costs” is pretty astounding to me.
I personally doubt that those are working chips. It‘s more likely that they are defective and are used for exhibition purposes.
Linus has enough money to replace what ever gets broken guaranteed
you really think Nvidia is going to let him dissasemble working systems ? one of those racks is probarbly 500k
@@tombrauey oh it is known. Still funny though.
@@Demidar665 I didn't say that
Nvidia has been making CPUs for over a decade now. Tegra initially for high end tablets and now for high end (~$700-$2,500) embedded systems. And they've been making Grace for AI prototyping workstations for about 5 years (if you have a spare $25,000).
If you only have $5,000, there are a few options with the Ampere Altra if you really must have ARM.
The power savings are very suspect, Jeff Geerling tested and found it to not be much different than Threadripper.
I seriously forgot about tegra, it was sooo long ago
and the Switch
Arm is no magic bullet to energy efficiency, if using arm alone would be enough to make cpus more efficient even at high power, we would only have arm cpus
Is it a true that ARM instructions are more energy efficient, but require more instructions to get the same task done than x86 instructions?
Nvidia is betting that people will use Hopper and most definitely betting that people will buy their expensive ass interconnect modules. The actual performance of these chips is probably meaningless outside of the context of "shove a shit ton of DDR5 at it", much like Apple Silicon. And plus, AMD already beat Nvidia to the punch here. MI300 is CDNA3 + Zen 4 on a single package, using their Infinity Fabric (which is literally the same technology but packaged differently)
Epyc still exists, and is impossible to actually beat because its much more versatile than these bespoke solutions. Until Arm can complete outside of the niche, we will keep hearing these arguments for years to come. Zen 4 is extremely efficient, as good as many Arm chips so x86 isn't out of the game yet
It's a very brave move to allow Linus to hold anything important
I mean let's be honest. This video is going to get more views than anything in the entire rest of the weekend of this convention... It's worth the risk of having to drop something when he is the headliner.
They're almost certainly dummy chips that already don't work.
considering how much there is as stake if someone steals one, they're not real chips
@@trapical The people who can afford this most likely aren't in Linuses demographic.
Mad respect to Gigabyte for letting this chip get into Linus' hands.
Linus: 'I didn't ask permission for this part but nobody seems to be stopping me.'
Security: 'That's Linus... just let him do his thing. He'll put it back together... probably.' 😂
"trust me bro". 😉
he might even drop it!
Put it back with half the screws…
let him cook
I mean knowing how security goes for various events they probably weren't fully informed of what he could and couldn't do, just that he was allowed to mess with the display over there.
No no no this can’t be right
technically nintendo switch is Nvidia GPU
That’s what I said bro it’s not real I swear
No! And now Intel is making GPUs! And they are good value!!
GOD NO
@@thischannelisforcommenting5680 isn’t the switch more of an SOC?
4:14 who else was waiting for him to drop it
5:33 WHAT?!? 4TB/s?!? That's all computer data I ever have produced - in a single second?!?
Nvidia: we can connect multiple GPUs in multiple racks into one room filling huge Gpu
Also Nvidia: SLI...yeah that don't work
Just in case: SLI works but it's mainly dépendant on the type of work asked to the GPU, and games are not benefiting much of the multiple nodes. For scientific computation however...
Well, SLI is actually a great technology, but its requires high competency from game developers, and lets just say that's not too common. Look at simulation programs or modeling and raytracing software and you realize how awesome sli setups are when running proper software.
@@Struct.3 on point! If you want to see a well optimized game for sli/cf, have a look back at crysis 2! May not have been the best in the series, but multi-GPU support in that title was wildly effective!
@@Struct.3 yeah, feels like game devs these days need 'guardrails' enforced by Sony and a 'one click - enable' to implement feature button.
(Thinking about the interviews on Morres law is dead channel)
For the mentioned use cases you can forget running that on consumer cards as none have the connectors anymore
The difficulty with SLI is that is has to raster frames real time for 144+hz display on a screen. GPU offloaded work, such as NN machine learning, is a much easier task to parallelize.
4:22 dude you sounded like Ramon Salazar from og RE4 laughing like that LMAOOO
This is nothing short of insane, the fact that there is so much processing power with less power means that we will have much higher speeds throughout our internet!
Not only that but it will decrease the heat generated from it so it provides more cushions for the coolers
also cheaper hosting!
This can't happen. What will I blame my awful Counter Strike performance on?
Just lower the clock speed on your cpu and you'll have much better performance per watt.
Our home chips run way beyond the efficiency curve
This won't really make your internet faster. But, there's a case to be made that it might, in a roundabout way, make your websites load faster because the website is running on this hardware.
Incredible product! 👏 And it was great to see the confidence brands put on you Linus 👌
linus: *walks in*
also linus: *randomly starts unscrewing things from the wall*
I just love how excited Linus always is for new tech. Never change bro.
Can we all take a second and appreciate how casually Linus holds that GPU on his shoulder? ( 5:12 )
Like it's a boom box! 😂
@Linus, I'm so glad you decided to step down as CEO so you could focus on the magic! Every day I tune into this channel to learn something new and you guys always manage to keep it fresh and engaging! Long Live LTT!
I don’t know what’s more impressive, the technology or Linus not dropping it!
I vote for Linus not dropping it.
9-)
"You drop it, you pay for it.
Don't worry, we have payment plan options available."
Linus goes into these tech booths with all the abandon of a kid with boundary issues walking into a toy store unattended.
This is NVidia's future, and they know it. Good, now we can let folks like Intel and AMD shine a bit more, especially when they get their drivers ironed out.
You think Intel and AMD can get some GPU market share in HPC?
AMD drivers are really nice. Meanwhile, Intel drivers struggle to play old games.
As long as Nvidia continues to fail at purchasing a CPU vendor.
Im sure NVidia will be sad about that as they control whatever runs the AI customer support you work with, the AI that power your online services, the servers that renders your movie...
@@RedEverything My Arc A770's drivers are amazing after the updates.. AMD has gone nowhere in the last 10 years 😂
That Grace Hopper is a freaking piece of art. It makes you want to code an entire OS and Game just to test it. Just imagine what a crazy project that would be.
I love the homage to grace Hopper btw. Excellent naming
@@TheCHEATER900 imagine going back and telling her how big a transistor will become. The ones she developed software languages with were vacuum tubes several inches long.
We know what future Crysis developers will be using.
@@chrishousby2685 I imagine that anyone working with computers understands how quickly they will improve. The stuff shown in this video may be made for personal computers in 20 years. Just as 20 years ago, personal computers had millions of times less space and compute power.
@@puppergump4117 honestly I don't think personal computers will be a thing in 20 years, it'll probably be closer to cloud based hardware. It'll be cheaper and more efficient than making individual hardware for both parties.
5:42 i think the bandwidth is ~8TB/s, you were citing hopper
This hardware is utterly and completely insane. If you can comprehend the slightest bit of the numbers behind this, it's just madness.
Those AI art programs which can produce 30 photo-real variations of, "A mountain of cookies" in under a second strongly suggests that we're living the last generation before everybody is born in pods and never uses their eyes. I'm legit alarmed by these compulsive engineers who know deep down that they should put the brakes on, but just can't stop themselves.
@@MarkOakleyComicsyou should make your tinfoil hat tighter, perhaps that will help.
@@MarkOakleyComicsOnly idiots think AI will overthrow the world.
If you actually understood what AI is and how it works you wouldn't think that. It's not some magical sentient being. It's literally just mathematical models and equations used to predict future outcomes based on inputs datasets.
Datasets, which need I remind you, need to come from living, active, intelligent humans. If there aren't humans producing new, creative, informative data, AI would be useless.
AI is a good thing. It is simply a tool to help us simplify our work and reach our goals. It can, and hopefully will, be used to ease and remove the burden of existence from mankind, so we can truly be free to do what we want and not struggle just to survive.
It's insane only if you compare it to consumer-level hardware and software.
Remember, governments all over the world have and maintain far higher tech than the public can even dream of. They secretly use this tech for military, scientific, and usually espionage purposes.
We get only the bottom of the barrel. Most of the tech we use today were once government secrets.
The Internet itself started as a US military defence and research project.
@@rudisimo Right. Because there aren't any examples of technology getting ahead of our ability to adapt without catastrophic results. I can think of a couple items of note just from the last few years.
Meanwhile.., Neuralink is entering human trials.
My heart definitely skipped a beat at 8:34 when someone threw the Network Card. That would have been a hell of a drop.
Like that thing can cost as much as a car
@@tormodhag6824it probably does 😮
The smile on Linus’ face is like a 80’s kid going to a toy store… you know you won’t leave the place with anything, but just being surrounded with the toys is a joy
whats more amazing is them letting Linus handle those parts with his steady hands
Maybe it just was dummy demoware.
Wild to see just how far-wide-deep the subscription model has reached. If the contemporary fiscal landscape were a chess board, the pawn could only move to a square that it's rented from the opposing king.
IBM has been doing this for decades, so no surprise
I think the most surprising thing to me is that Gigabyte has enterprise class hardware.
Ah but notice they did NOT show you the GB power supply?
Lmfao
Pleaase dont say that about Gigabyte
My whole system is aorus bro
@@georgevel and nothing of it is enterprise class hardware
@@Tehkezah I said that bc ppl are saying about their psus and I wanna note that they got no problem
8:33 throwing prototype/showcase tech to Linus 'Droptips' Sebastian is a very bold move
0:13 LMAO that asian woman expecting a hug from Linus
Everyone running from Linus lol
0:16 I love how the worker jumps away as soon as he realizes he's going to inadvertedly go between Linus and the camera !
He tried so hard to bail but only drew more attention to himself. May he rest in peace. o7
The way NVIDIA focuses on cloud and AI and so on and makes local gaming more and more expensive, I fear local gaming will get rarer and rarer. They want us to nudge to use GeForce Now instead, because it's more efficient to them to share the performance from its servers than to sell us individually a GPU.
True, thats the future. Thats also thw reason gaming companies want to go always online , games as a service. Cause those games you can easily do the transition from local to cloud without the consumer knowing and once ur locked in you gonna pay for renting the software and hardware
Pretty soon you're gpus are going to come with their own custom CPUs 😄 along with a few pelethites of storage for the AI data, so every time you play a game, the EI will be smarter every single time it customized for every single game for every single place style for every single player 😳
Honestly with the percentage of their income that is now coming from AI I dont really see them giving a shit about GeForce anything, they could be completely out of the commercial hardware space in 10 years. I mean I imagine they may already have rtx 5000 series in the wings and possibly even the 6000 series... But after that? If everything else goes according to plan then Nvidia wont care much about consumer cash anymore.
It will never be viable until the majority of the internet infra structure is pure fibre, copper is just way too high-latency (laggy) for gaming remotely, even 1GB fibre is borderline, in reality, 10gb full fat fibre is the minimum for a good gaming experience over remote connections, even current HDMI standards struggles to carry enough bandwidth to keep up with modern video games, so even with 10GB fibre a heavy compression technique will need to be employed, I wouldn't ever want to use it for gaming personally.
@@Wobble2007 What are you smoking? You can ALREADY play remotely at fairly decent latency with a regular 100 MB bandwidth. Barely anybody except competitive e-sports professionals care whether you have a latency of 50ms or 10ms
I can imagine Computex chief security officer watching this and thinking to himself "Why didn't anyone stop him? Well... At least he didn't drop anything."
Security officer maybe. PR leader says "lovely". and maybe "sad he didn´t drop it for the meme-videos". It is worth a lot in advertising
4:31 why do I suddenly feel like I was watching a car breakdown?
Crazy to think that at one point our future generations will see this as ancient technology just how we see OUR ancestor’s tech (tools).
Their data centers have been pretty helpful for their stock
I find H100's price tag nothing more than "well Google will pay no matter what" kind of price. A million dollar investment into a server for a company this large is both eaten up immediately by running costs, but also still a blip for total operating cost of the company. Nvidia shareholders must be happy as shit
Ya I think this is a good time to invest given their Q1 reports and all these new tech developments. Nvidia is currently operating like an innovator again and not some comfortable company (like intel was before AMD came back).
@@ZeLoShady their q1 financial report showed a decline in revenue and net income lol
@@ZebraGoat exactly.
@@ZeLoShady Considering they had a 26% rally yesterday over the span of a single hour, I'm gonna go and assume that you're not the only one to think this
00:15 shoutout to that employee who saw you were filming and didn't want to be in the way
Holy crapballs that bandwidth. Damn cool stuff. I'm so happy Arm remains independent. Gonna be cool seeing what all comes to market following this beastie.
4:32 “*This* is Grace Hopper. On the one side, we’ve got the same, 72-core Grace ARM cpu we’ve just saw, but on the other side, the “ooooooo shiny” latest and greatest nVIDIA H100 Hopper GPU. Today I am going to review Grace Hopper, and show you all of its quirks and features.”
3:19 I'm more interested in those Intel Arc MXM cards in the background. Can we get some coverage on those?
I have used a system with 1440 cores and 64Tb RAM, but it was a few hundred physical commodity boxes. The latest compute space stuff that is replacing the likes that I used, is insane.
And can it run Crysis?
00:19 the commitment to not messing up the shot is admirable
this was funny
This might be the first video where Linus hasn't damaged or at least recklessly handled expensive electronics. So it IS possible for him to not break stuff!
Must be a robot Linus.....
What about the network card jake threw?^^
I guess that was not linus
@@lonelyPorterCH The more surprising part about that was Linus actually yelling NO! I would have expected him to just carry on like it's normal to throw around electronics like that.
TBF, we haven't seen it powered on since he touched it ... ! xD
He probably did damage something, it was just edited out to avoid liability.
Both guys at 3:13 looks a bit annoyed as to what Linus did lmao *Linus in the meanwhile saying 'no one seems to be paying attention to what am doing'*
I love your contagious passion and enthusiasm for technology. I joined the PC industry as a hardware trainer/presenter in 1991. It took me months to accept the fact that i was actually getting paid to do something was so passionate about. Best wotking years of my life!
It's an Arm CPU, same architecture used on mobile phones processors, it's RISC based but is very powerful, all of the systems today have a version for arm: Android, Linux and Windows with the Windows for IoT version. If developers start developing compatibility layers for x86, like with Exagear, compatibility start to have a solution.
On apple devices everything is already compatible. Best we leave windows all together, make steam compatible with ARM and there you are, no more windows needed
@@SWOTHDRA It doesn’t matter much for the immediate future (5-10 years) if steam is compiled for ARM or not since it’s not a high performance application and therefore works fine with a translation layer (e.g. macos). Also it doesn’t really matter for most people anyway since pretty much all games released till now are not going to get an update to run on ARM anyway so if you want to be able to keep playing your library making the switch won’t make any sense whatsoever.
If they are able to optimise their cpu and gpu together properly... It's gonna be fire
quite literally lmao
Yes, fire, AND a housefire
and ten million dollars
@@s8080_ lol
As long as they don't try making it so certain software features are incompatible with non-nvidia CPUs
so like m2 ultra at the server level. this makes me really happy this kinda stuff is great for the environment energy wise. also if that interconnect could be used for SLI or simiilar in the future that would be huge!
Grace Brewster Hopper (née Murray; December 9, 1906 - January 1, 1992) was an American computer scientist, mathematician, and United States Navy rear admiral. One of the first programmers of the Harvard Mark I computer, she was a pioneer of computer programming who invented one of the first linkers. Hopper was the first to devise the theory of machine-independent programming languages, and the FLOW-MATIC programming language she created using this theory was later extended to create COBOL, an early high-level programming language still in use today.
This is 100% the content I live for, awesome video. Keep up the good work!
ARM is a RISC instruction set. The Hewlett-Packard Packard PA-RISC was way ahead of its time. I worked on the first HP 3000 on MPE & HP 9000 HP-UX systems. Some of the desktop workstations like the tiny 715 systems were incredible in 1980’s.
Ah the toys of my youth! I worked with some of that wayyyy back when along with many other goodies that all ofnthe winbloze babies wouldnt have any clue what it is now nevermind how to use it and due to the millenials andnbeyond idiotic overly entitled arrogant bs they dont even appreciate that which was gained to make the current toys evennpossuble via our hard work long before they were a set spot on cheap hotels sheets
Beat me to this lol good comment
You beat me to this comment, but I made it anyway 😂
I'm sort of scratching my head as to why he's (Linus??) acting like it's a new thing... Just for novelty I still have a SUN E450 still running and productive 😂
@@konnorj6442Pardon?
@@konnorj6442 First of all, it is this exact toxicity that completely stagnates any real intelligence... I would rather be stuck fixing Windows 3.1 and Vista installations for the rest of eternity than ever hold a mindset akin to yours. Every architecture, operating system, and programming language has its strengths and weaknesses, and it is our responsibility as technicians to learn and understand each one so that we can always provide the best for whatever our client is trying to achieve. I have met both old and young people who are kinder, more intelligent, and exhibit far more competence than you have shown here.
1:27 DON'T DROP IT
Hearing Linus saying “I can’t believe they let me take this off the wall” and proceeding to laugh like a small child made my day. Linus is the geeky adult version of a kid in a candy store 😅
4:44 Holding the "computer" like it was a bazooka :D
They were fearless to risk it and trust Linus to no drop anything😂
6:59 my CFD software can fit 1.25 billion cells in a single GPU with 64GB VRAM. 150TB could hold 2.85 trillion cells, holy smokes! That is 14170³ resolution.
1:53 more power efficient*
* if under around 45W, x86-64 instruction set has a min. cost to load, after that it's pretty even. E.g. M1 at max load is 30-35W, and that's why it trumps in the mobile space.
Here it's going to be more interesting if ARM is used as orchestration only.
What you don’t see is just below the camera shot is a nvidia employee with their hands out ready to catch that module if Linus drops it 😂
wait what time
@@BeatXaber just jokes m8
@@BeatXaber 11:17
AWS has had a similar card to the last thing you showed off since around 2017. They just call them Annapurna cards inside the data centers (likely because they're made by Annapurna, a company they acquired back in 2015), but it's literally that. 1 or 2 SFP ports + an ethernet port and it gets used as the NIC inside pretty much all their servers these days. I assume the industry at large has had cards like that since 2015~ or even earlier, since I don't remember AWS being on the leading edge of anything in the data center space when I was working for them. xD
The only 'revolutionary' part of AWS was elasticompute almost entirely in the software side. They do self manufacture certain things now tho (that the company would probably actually look to terminate me for discussing. Me and my PIP is already riding very thin water heh)
Man, the speed at which these things rip through your wallet is insane
the biggest problem is intel and x86 and their royalty fees.... that require everyone to pay them for using x86 chip architecture while they do nothing to improve it... so chips are gonna be high in price.. every new chip that comes out is at least 400-500 dollars. Amd uses x86 architecture so they need to pay intel a fixed amount of royalty fee for no reason other than using the x86 architecture that has been globally used by everyone and invested in by developers around the world. i dont think intel shuld be getting payed cus it "ownes" a globally used architecture needed in all kinds of health sectors food production companies and businesses around the world. Mostly cuz all the hard work comes from the developers who made the programs. Apple tried to exclude intel from all of this but they are no better for using arm architecture that is also licenced. we need a licencing free architecture that everyone can use only that way we will get rid of the money sucking leeching companies such as "intel". i was rooting for huawei chips but poor huawei got banned here.
@@AM-uy1ez Intel pays AMD licensing fees for AMD64, while AMD pays Intel licensing fees for x86. They were suing each other and reached a cross-licensing agreement years ago.
This is your thing man, glad to see you hyped up in your videos again!!! welcome back to the old Linus :)
Finally a cpu that can emulate my mobile games with no frame drops!
Genshin Impact 2000fps
But can it play Crysis?...
I like to think that Linus isn't supposed to be there and that he just started unscrewing the displays out of habit and nobody was brave enough to stop him
Edit: Well 3:12 confirms that! Never change, Linus. Never change
Even when staff sees it they probably assume that someone allowed him to do it because no one is crazy enough to do it without permission.... right?
“That’s a lot of balls”
-Jensen while describing the solder balls under the processor💀😂
ARM - "Acorn RISC Machine - first used in 1983. The ARM company never made processors/chips themselves, but designed them in specialised CAD systems. The CAD logical design file then was converted into a physical design that could be "printed" (my term) by the "foundry" (industry jargon). Such a logical design actually facilitates simulation in software of how the processor will work. The first physical batch of ARM came back to the ARM company and they had their physical test motherboards. Set the mobo up, plug the CPU in, run tests. Overnight, one of the engineers wakes up and becomes aware there was a connection or configuration issue in the power-lines and the test should have failed. Turned out the processor needs so little power that it had run off the power leaked into the processor from I/O presented to the processor. That's why almost all CPUs in smartphones are derived from that first ARM and why Apple derived their current generation of "proprietary" Apple chips from ARM too.
I'd like to point out that because LLVM is the backend for quite a few modern languages compiling for arm is as simple as passing a parameter to your compiler
Virtualization really pivoted the server market into making CPU's with massive memory requirements. Each virtual node may be okay with a small number of cores but every single virtual machine needs a lot of dedicated memory - it adds up fast. And spinning that node up and tearing it down in an acceptable period of time changed the whole network architecture datacenters. Tensorcores (AI) exacerbated the problem. And as bad as it is now - CXL is going to really do a number on it as things get more heterogeneous. Fun Fact: Windows used to run on ARM, MIPS and x86. In fact it still does on the lowly Raspberry PI (IOT) which is ARM. In the 90's the fastest Windows machine was actually an Silicon Graphics Octane Server (MIPS) with many CPU's and an interconnect design that is very similar to NVLINK. NVIDIA's acquisition of Mellanox really focused on Data Center and the necessary interconnect needed at chip level. They have been "beyond" simple GPU/Gaming for many years. The margins are much higher in the datacenter as is the refresh cycle.
The network card you showed at the end reminds me a lot of IBM Z-Series programmable IO. And yeah, offloading IO stuff to a coprocessor is the secret to crazy high throughput. You guys have seen in your reviews of desktop products how bogged down the system gets with high speed IO if it needs to be handled all by the CPU.
Reinventing mainframes in 2023!
As someone who had to use MLNX Connect-X4 and 5 cards -- the software for these cards is an absolute nightmare.
1.) Sometimes the cards would just stop working until someone physically disconnected, then reconnected the cable.
2.) Kernel Updates in most cases will brick the drivers, causing your network adapter to stop working until you can either reinstall the old driver or install whatever updated driver is released for the new kernel.
3.) On at least one occasion, shortly after NVIDIA bought Mellanox they decided with zero warning to change the default operating mode of the driver from IPoIB to RDMA -- with zero help / instructions on how to revert it. This one left a really bad taste in my mouth.
4.) The Documentation is bad. NVIDIA has done a terrible job of providing even a similar level of documentation to the old Mellanox documentation. Frequently when debugging issues with these cards, I found myself looking up old Mellanox pages for ancient secrets to try and equip myself with enough knowledge to debug stupid problems.
5.)In order to fully utilize these cards you need to operate in RDMA mode, which literally requires that your application be compatible with RDMA/Configured for it. RDMA basically lets data get piped straight from the network to your application -- completely skipping the kernel. It's intended for crazy fast low latency. But they don't really tell you that. So if you're running in IPoIB then you're basically just paying a couple hundred bucks per cable for nothing.
6.) Networking for these things requires you run a network manager on one of your nodes, or pay crazy licensing for a network device that can support running the network manager. So it's dummy thicc easy to setup out of the box -- but it's actually quite a pain to set it up correctly.
TL;DR -- These cards are absolute trash if you don't fully invest into utilizing them correctly, and that impacts everything from hardware to the application. The software isn't robust, and your engineers are going to HATE it. I wouldn't recommend even entertaining the idea unless you absolutely intend to build your entire application/system around the idea of using RDMA.. cause its definitely not something you can just tack on for more performance.
In 2004, my home lab included a 186 drive NAS, full of 18GB 15k RPM drives. It had similar iops and bandwidth to the best SATA SSDs, a decade earlier.
I connected my dual processor workstation to it directly with Fibre Channel.
But this stuff is so advanced... I can't even wrap my head around using it outside of an enterprise environment.
Thats was some amazing hardware! What do you run these days?
Ah I have another clone out there in the wild.. of a sort
Trick is I've worked on some of the bleeding edge goodies and even in a pro environment some of it is so wild it still really not useable by just one person per se
Like my friend (asm coder of god like level).. way back when I was sadly stuck several hundred meters out of the area where cable inet was avail at home he got direct fiber to his home due to his work.. granted his home was only about 400 feet from the nearest main trunk data center by sheer luck.. but thenspeeds he got were so fast nothing he could really build for use at home could use the bandwidth he got.
Since then it's even more insane.. decades later his connection is now so fast his server class top speed flash drive array cant write fast enough to fully use the fiber connection
Lucky fukker he is lol
@@HarshJha My setup has become much more consumer-grade due to the rapid advancement in tech. I've got an i5 running FreeNAS with 4x 12TB and 6x 20TB mechanical drives, and 3x SSDs for cache. It has 10GbE ethernet, but none of my computers have more than 5GbE - and it can fully saturate that. But I almost never do. I only use it to archive home movies (I'm not a professional content creator) because nvme SSDs are just so big / fast / cheap compared to a few years ago. Dual 4TB SSDs is plenty for so many use cases.
wonderful to see linus actually excited about a product and impressed as well. shows his passion for tech.
8:32 You can see the sheer terror in Linus's eyes when someone from off camera threw that card to him. I'll never be able to afford anything that he showed, save the computer setup at the booth. But it's cool seeing a glimpse into what will be.
I'm about 90% sure that was Jake throwing expensive networking cards.
Connect-x cards are expensive but actually not insane expensive. I'd bet based on personal experience looking at costs of other Mellanox cards the 'average' price for the entire connect-x 7 SKU (ie all the different speed and port number options) would be in the $1500 range... the 400GbE will be expensive as, but a 100GbE or 50GbE will be less.
The 'Smart Nic' / DPU on the other hand.... yh them would be very very expensive, but equally, as Linus says, super super cool and for cloud providers a huge benefit to getting more CPU compute out of existing hardware.
PSA from someone who used MLNX connectors during/after the NVIDIA buyout of Mellanox -- If you're a Sys/Network admin those connectors are a fucking train wreck to deal with (at least up to ConnectX 5 cards) There are different versions of the drivers for different hardware types too. Like if you're on dell hardware you have to download a dell specific driver for these cards, and it is NOT well documented that it is the case.
1.) NVIDIA on at least one occasion released a driver set which changed the default mode from IPoIB to RDMA -- which absolutely fucked my environment up.
2.) Frequently kernel updates would brick these drivers -- requiring the sys admin to reinstall the drivers entirely.. so make sure you have multiple ways to access systems that use infiniband.. cause you can easily end up driving out to the data center to fix these things.
3.) We had some devices on Dell hardware that would just randomly stop working -- and the only way to fix them was to literally disconnect the cable and reseat it. Dell, NVIDIA and Mellanox were all zero help in debugging this issue.
Do not blindly buy into using MLNX adapters. We ended up buying into it because NVIDIA recommended them, and the lead engineer thought it sounded cool. We had no idea that NVIDIA was recommending them to us because they intended to buy Mellanox out. NVIDIA's documentation on configuring these drivers leaves a lot to be desired -- and a lot of the time you'll end up looking up old Mellanox documentation to find useful commands that can help debug issues. If you actually need to troubleshoot any issues, you're gonna have a bad time. They're basically just trying to sell DGX and HGX systems -- and while you can get all kinds of fancy numbers/specs from these cards, they absolutely suck manage/troubleshoot. Especially when you throw them into different hardware.
In order to fully utilize these connectors you need to run in RDMA mode. Running RDMA requires that your applications fully support RDMA, because it basically allows data to skip the kernel entirely and get piped straight to the application -- this gives nanosecond latency. That being said if your software doesn't support RDMA, then you're stuck using IPoIB, which is an extremely expensive waste of hardware. Might as well just use regular 40GB fiber at that point.
You'll end up getting sucked into needing network devices that can handle the Mellanox connectors, and either running a network manager as software on one of your nodes, or paying out the ass for licensing to have a network device that can run the network manager software. We had to run some long cables between racks and were spending nearly $800 per cable at one point too.
TL;DR -- DO YOUR RESEARCH COMPLETELY BEFORE BUYING INTO THAT SHIT. It's a huge pain in the ass from a sysadmin standpoint.
As the guy who ended up having to figure out how to get these things working, maintain them, automate patching/updates, troubleshoot performance and network issues with them -- I wouldn't recommend ever using Mellanox for anything unless you build everything else around the idea of using RDMA. It impacts everything from hardware all the way up to the application.
We had about 200 GPU servers using them, and I was the poor sap who was stuck writing ansible playbooks to try and unfuck these things. They made my life hell for quite a while.
1:39 i was waiting for linus to drop this 😅
The fact that their processor is named Grace just to make that Grace Hopper reference is perfection.
I hope they searched the CPU design for bugs :D
Once the Windows for ARM exclusivity ends it’ll be interesting to see if Nvidia has plans for the consumer CPU market
6:55 "Giving them access to up to 150TB of High Bandwidth Memory" The total memory capacity is ~150TB, but only a fraction of it is HBM. Each node has 512GB LPDDR5X but 'only' 80GB HBM.
Cool stuff! I've always wondered what cutting edge server tech stuff was capable of, enjoyed the vid!
I love how I continue to get surprised by the new advances in technology.. never gets old
One thing I super like is the entire chipset cpu and ram all on one single board. This will make a lot of things faster and better. You really only need to offer 1 or 2 ram amounts because you can off set the price of the more ram by selling a lot more units.
The more you understand about how technology works the more amazed you are by the fact it works
@@mr420quickscops2 that too..I'm 40 years old and been around tech my whole life and is my career and continue to get that oooo yeah moments in life when stuff like this comes out