GTC 2024

Nvidia getting closer and closer to hardware based subscription service model with a separate software stack subscription service model.

I wonder if they stop selling hardware in the next ~5years or so.
 
So, basically nVidia has done Ampere/Hopper just with two dies. Impressive. Oh and delivering 10 PFlops FP8 performance within 208 billion transistors is so out of this world. 10 TB/s interconnect means full bandwidth between both dies from the HBM3e.

Better than expected. NVLink with 1.8 TB/s is another surpise.
 
Assuming high level arch is largely unchanged compared to Hopper, it seems like a huge layout improvement. In a similarly sized die on the same node, it's got 20% more SMs, 2x L2$, 30% less power, at similar clock speeds
 
The world’s first networking platforms capable of end-to-end 800Gb/s throughput, NVIDIA Quantum-X800 InfiniBand and NVIDIA Spectrum™-X800 Ethernet push the boundaries of networking performance for computing and AI workloads. They feature software that further accelerates AI, cloud, data processing and HPC applications in every type of data center, including those that incorporate the newly released NVIDIA Blackwell architecture-based product lineup.
 
Going by this Blackwell at 700W delivers 75% more performance over H100/H200. Or one GPU would be around 87.5% of the performance at 350W.
There are 4 versions of Blackwell releasing simultaneously.

B100 (air): 14PF of FP4 at 700w
B200 (air): 18PF of FP4 at 1000w
B200 (water): 20PF of FP4 at 1200w
GB200 (water): 40PF of FP4 at 2700w (1 Grace CPU + 2 B200 GPUs)
 
This.
Can't win against the APU.
APU wins so unbelievably hard for people who roll their own codebases.

ML farm is the real battlefield anyway.
B100/200 are priced very aggresively given how much more expensive they are relative to H100.

The market seemed to respond positively to the price announcement after some jitters earlier this morning.

$30-$40K isn’t cheap but Blackwell perf/$ should be significantly more attractive than Hopper. Nvidia is playing their usual game. Get your foot in the door everywhere and make it hard for people to switch ecosystems. Their biggest competition would be CSPs rolling their own hardware and they will try their very best to make that an unattractive option.
 
The market seemed to respond positively to the price announcement after some jitters earlier this morning.
Finally, $AMD is down, people can load on it moar.
$30-$40K isn’t cheap but Blackwell perf/$ should be significantly more attractive than Hopper
That's Hopper pricing (hyperscale will pay less as usual) for 2x the Si and 2 moar HBMs.
Margins kaput but hey, competition!
Get your foot in the door everywhere and make it hard for people to switch ecosystems
The 'ecosystem' is at large, PyTorch.
If your stuff is in mainline PyTorch and runs good perf out of the box, you're good.
(that doesn't account for truly boutique solutions like Cerebras WSE though).

The real moat is and has always been the hardware.
Ever since V100 NV led the market on matrix math piles at speed.
A100 in particular was kinda a watershed moment for the market at large.
Nvidia is playing their usual game
No they're doing a competitive response. lol.
Their biggest competition would be CSPs rolling their own hardware and they will try their very best to make that an unattractive option.
Their biggest competition is MI400. They know it.
 
Eh, AMD has a long history of producing attractive hardware that goes nowhere.
Ughhhh it's been 5 years since Rome.
If there's anything current AMD is good at, it's running a consistent roadmap at all costs.
When it becomes not-consistent, they do a reorg and knife a bunch of parts (which, they did! byebye the og mi350x. and byebye the og horrific Si spam MI400).
Let’s see them deliver the good first before claiming Nvidia is running scared.
Oh they do.
B100 would've been 50k otherwise.
Gotta defend the CSP share at all costs.

Frankly it's just competitive response.
Panic button stuff is the Intel staple (more on that later).
 
if they will just cede the market to AMD
They would rather focus on Sovereign AI, they seem to have a division for custom chips and can make cheaper Grace APUs easily if there is a need for it, but right now there isn't.

Eh, AMD has a long history of producing attractive hardware that goes nowhere
Yep, remember MI100? MI250X? Even MI300 is having a rather tame reception compared to H100/H200.
 
They would rather focus on Sovereign AI
what even
they seem to have a division for custom chips
AMD's been running semicustom biz for like 13 years.
can make cheaper Grace APUs
that's is NOT an APU. not at all.
remember MI100
Good devkit.
Hard to forget Frontier.
Even MI300 is having a rather tame reception compared to H100/H200.
? This is outright cope.
It's a new product from (effectively) a new vendor and a first real engagement from AMD with hyperscale wrt GPGPU stuff and it's doing hella numbers starting end of this Q.
 
Back
Top