NVIDIA discussion [2024]

Yeah, missed the year.
Still 12x within 10 years should still be one of the fastes speed up in networking.
I'm not super knowledgeable on networking stuff but standards-wise we went from 1gbps in 1998 to 10gbps 2002. So 4years for a 10x increase.
How long it took for it to actually materialize and be beneficial... I have no idea.
 
I'm not super knowledgeable on networking stuff but standards-wise we went from 1gbps in 1998 to 10gbps 2002. So 4years for a 10x increase.
How long it took for it to actually materialize and be beneficial... I have no idea.

10GBps in 2002 was over fiber. 10GBase-T was released in 2006, 4 years later. It seems that ~10X over 8 years is the norm over different technologies.
 
It sounded to me (based on Jen Hsun’s insistence they will push everything including process node to the limit) that Blackwell Ultra might actually be a N3E or N3P shrink (maybe with more SMs but maybe not?) rather than just Blackwell with faster memory.

That would make a lot more sense than the previous rumours implying they’d have a new generation every year; more of a tick-tock with different teams working on them, which is also what NVIDIA did very early on (see: TNT->TNT2, GF1->GF2, GF3->GF4, etc… none of them had any big changes besides the process, except maybe the GF2 going from 4x1 to 4x2 pipelines, but there were rumours at the time GF1 was supposed to be 4x2 but bugged).
 
Rubin is similar still different than Blackwell in order to offer substantial performance improvement.
Similar:
2 dies linked by a mega bandwidth Silicon bridge
8 stacks of HBM
Different:
I/O have now their own dies (*2) so the main 800mm2 dies on N3P are compute only for maximum density
HBM is now version "4" but it's custom for NV to reduce power and accommodate their aggressive schedule
 
Last edited:
Does nVidia think that Grace is good enough for the next two years? Next generation would have released next year...
 
It sounded to me (based on Jen Hsun’s insistence they will push everything including process node to the limit) that Blackwell Ultra might actually be a N3E or N3P shrink (maybe with more SMs but maybe not?) rather than just Blackwell with faster memory.

That would make a lot more sense than the previous rumours implying they’d have a new generation every year; more of a tick-tock with different teams working on them, which is also what NVIDIA did very early on (see: TNT->TNT2, GF1->GF2, GF3->GF4, etc… none of them had any big changes besides the process, except maybe the GF2 going from 4x1 to 4x2 pipelines, but there were rumours at the time GF1 was supposed to be 4x2 but bugged).

I got the same feeling from the presentation. Would also fit with Rubin N3P and Rubin Ultra N2.
 
It sounded to me (based on Jen Hsun’s insistence they will push everything including process node to the limit) that Blackwell Ultra might actually be a N3E or N3P shrink (maybe with more SMs but maybe not?) rather than just Blackwell with faster memory.

That would make a lot more sense than the previous rumours implying they’d have a new generation every year; more of a tick-tock with different teams working on them, which is also what NVIDIA did very early on (see: TNT->TNT2, GF1->GF2, GF3->GF4, etc… none of them had any big changes besides the process, except maybe the GF2 going from 4x1 to 4x2 pipelines, but there were rumours at the time GF1 was supposed to be 4x2 but bugged).
I always figured this was gonna be the case. They weren't gonna have the ability to do completely new architectures every year, nor would that even make a lot of sense. They only ever said they wanted to have a new 'generation' every year or whatever. But generation doesn't have to mean some entirely new architecture or anything. Intel 14th generation CPU's, for instance....

So yea, a proper new architectural leap, and then a more optimized/refined/spec-improved/process-leap version of it in between.
 
3 months later:
N100 is not Blackwell successor. It's Rubin
Rubin is not a 4x830mm2 die monstrosity (it never existed)
Just to say. Believe who you want...
 
B200 is the 1000W x86 variant of Blackwell. Blackwell Ultra is something else.
B200 refers to B for Blackwell generation GPU and 200 for 2* 100 GPU dies. No relationship with X86
GB200 is G for Grace CPU, B for Blackwell GPU and 200 for 2* 100 GPU dies
VR200 is V for Vera CPU, R for Rubin GPU and 2* 100 GPU dies
 
Internal codenames, yes. But Blackwell-Ultra sounds more than just 50% more memory.
I don’t think it’s possible to tell from that keynote whether Blackwell-Ultra is “just another H200” (which was also more bandwidth, and possibly a respin, not just more capacity) or if it is indeed a process shrink or more. Besides the insistence on pushing the limits of everything, the only other hint is H100 and H200 have the same “mini die shot” on the slide while Blackwell/Blackwell-Ultra are different, which is hardly conclusive.

However if Blackwell-Ultra is a H200-level change or less, then let me say bluntly that the “yearly cadence” would be complete BS. No one inside a GPU company would consider that a yearly cadence, especially when there have been companies with true yearly cadences in the mobile market with architectural changes nearly every year.

Honestly NVIDIA has too much money *not* to be making a new flagship chip every year. They need to increase their R&D as much as possible as quickly as possible to maintain their competitive advantage, but it’s hard to greatly increase team sizes in a way that doesn’t result in diminishing returns and other inefficiencies (never mind just replacing the people leaving because they made more than enough money to retire on their stock grants!)

Creating a process shrink of the flagship on a tick-tock cadence is the probably the easiest way to spend more money by creating a new independent team without affecting the rest of the roadmap. That doesn’t mean they are actually doing it, but I could not reasonably defend their failure to do so, it would highlight a catastrophic failure of execution to still only have one new flagship every two years while soon making nearly $100B revenue a year from that single chip…
 
Back
Top