You're right, although I suspect that by that time, they had already modeled what Claude 3 Opus model's capabilities would be like and the CEO, more than anyone, would want to know what that means in practice. Hence why his comments appear to be so relevant today. They're at the very cutting edge of this tech and it's super impressive
It definitely could make some kind of difference if it turns out the way it seems so far, and the big players aren't already using the technique. MS still hasn't released the code and I haven't seen any super serious replications yet, but I'm guessing there will be attempts soon. Check out the announcement from Extropic today on another interesting direction things may go (harnessing the inherent randomness in analog systems for more efficient hardware).
Glad for the re-upload since I’d missed the series, but FYI to folks like me this interview is from the Claude 2 Era.
You're right, although I suspect that by that time, they had already modeled what Claude 3 Opus model's capabilities would be like and the CEO, more than anyone, would want to know what that means in practice. Hence why his comments appear to be so relevant today.
They're at the very cutting edge of this tech and it's super impressive
one of the best podcasts to date.
This is 7 months old. Get a new interview about Claude 3 Opus.
love your interviews!❤ 🎉
Can you get a new interview Dwarkesh? 🤞
3:49 puts hair back in ear (for some reason?).
Hahahaha yes, its curious
How about the 1.5 bit representation for LLM ? Surely it will accelerate things when implemented in real world LLM ?
It definitely could make some kind of difference if it turns out the way it seems so far, and the big players aren't already using the technique. MS still hasn't released the code and I haven't seen any super serious replications yet, but I'm guessing there will be attempts soon. Check out the announcement from Extropic today on another interesting direction things may go (harnessing the inherent randomness in analog systems for more efficient hardware).
Seems like old video. Please add the video recording date to your video's description. Thanks.
2-3 years?! His model claude is like that already and gpt 4 too, i dont get it
It can't drive a car
We don’t understand intelligence…
It's an advantage.
Mini cuts are mid
Shutout