Speculation and Rumors: Nvidia Blackwell ...

techuse · Oct 11, 2024

iroboto said:
When is the expected reveal show/date ?

CES for the gaming chips seems to be expected.

iroboto · Oct 11, 2024

techuse said:
CES for the gaming chips seems to be expected.

Oh ok. So Jan 2025

Broopster · Oct 12, 2024

iroboto said:
Oh ok. So Jan 2025

Yeah, they could probably launch some of them now but they seem to want as much 40-series stock cleared out as possible before the launch. Hopefully we’ll get at least one of the cards by end of January. Early word was they’d release the 5080 before the 5090, but unless it’s priced very competitively I’d think they’d want the halo card out first. In any case they’ll probably release within a few weeks of each other as usual - wouldn’t be surprised to see the rumored 5070 launch much later, in the spring. Of course political developments could affect all of this. I hope not but the possibility can’t be entirely discounted.

Deleted member 2197 · Oct 12, 2024

NVIDIA "Blackwell" DGX B200 Listed, Prices Start At A Half A Million Dollars For The Top of The Line AI Hardware

NVIDIA's DGX B200 "Blackwell" AI server has witnessed its first apparent retail listing by the high-end server solution provider Broadberry

wccftech.com

The listing at Broadberry has put a price tag of $515,410.43 on the Blackwell DGX B200 AI system, with configuration options as well, mainly dealing with after-sale services. This is the first instance where we have seen NVIDIA's Blackwell AI product surface over the internet in the form of a retail listing, and while we are currently unaware of the supply situation, it is said that Blackwell will be initially confined, with a larger portion of shipments slated for the first quarter of next year.

Tkumpathenurple · Oct 13, 2024

How well does it run Crysis though?

On a genuine note, why have they gone with Intel rather than AMD for the CPU? I thought AMD performed better these days?

Deleted member 2197 · Oct 13, 2024

How do Nvidia Blackwell GPUs train AI models with 4-bit math?

Bigger chips crunching smaller numbers.

www.zach.be

When you look at FP8 training performance on the spec chart below, B200 is 2.5x faster than Hopper -- which is only 1.25x faster on a per-die basis. How did they get to that “5x faster training” number? Well, B200 has another new feature to double per-die performance over Hopper: 4-bit arithmetic.

4 bits doesn’t seem like a lot. If you’re using those bits to represent integers, you can only count up to 16. But Nvidia’s GPUs feature some really clever technology to squeeze the most utility out of those 4-bit numbers. They’re called “mixed precision tensor cores,” and if you want to understand Nvidia’s dominance at AI, you need to understand how they work.
...
The closer a network can get to being represented entirely with FP4 operations, the closer Blackwell’s training performance can get to that eye-popping 5x number Nvidia cited. And luckily, there’s already some research showing that networks can train with FP4 operations without significant loss of accuracy. If those results can scale to GPT-4-scale networks, then Nvidia has a huge advantage over other datacenter AI chips, which, as far as I can tell, don’t yet support these FP4 operations.

Erinyes · Oct 15, 2024

Tkumpathenurple said:
How well does it run Crysis though?

On a genuine note, why have they gone with Intel rather than AMD for the CPU? I thought AMD performed better these days?

Could simply be a matter of timing, i.e. what was available earlier for qualification/validation. And anyway the CPU performance of these clusters is not relevant, the CPU is almost an accessory. They will likely qualify newer generation processors for third party servers in the coming quarters.

DegustatoR · Oct 23, 2024

https://videocardz.com/newz/nvidia-blackwell-gb203-gpu-allegedly-pictured-expected-to-power-the-rtx-5090-5080-laptop-series

Blackwell laptop designs are seemingly also launching at CES 25.

Press Center - NVIDIA Renames Blackwell Ultra to B300 Series; CoWoS-L Expected to See Growth by 2025, Says TrendForce | TrendForce - Market research, price trend of DRAM, NAND Flash, LEDs, TFT-LCD and green energy, PV

TrendForce reports that NVIDIA has recently rebranded all its Blackwell Ultra products to the B300 series. Looking ahead to 2025, NVIDIA plans to strategically promote the B300 and GB300 lines—which utilize CoWoS-L technology—thereby boosting the demand for advanced packaging solutions.

www.trendforce.com

Also apparently B300 is a thing now.

Kaotik · Oct 23, 2024

Jensen confirmed there was a design flaw which caused low yields, but it's been now fixed

https://www.reuters.com/technology/artificial-intelligence/nvidias-design-flaw-with-blackwell-ai-chips-now-fixed-ceo-says-2024-10-23/

trinibwoy · Nov 20, 2024

More Blackwell server announcements. No new architecture details. I wonder why Nvidia hasn’t shared any details of the SM configuration, cache or clocks.

Based on the limited information they’ve shared so far it’s hard to tell whether Blackwell is a further refinement of Volta/Ampere/Hopper or something more.

“The GB200 Grace Blackwell NVL4 Superchip integrates four NVIDIA NVLink-connected Blackwell GPUs unified with two Grace CPUs over NVLink-C2C, Buck said. It provides up to 2x performance for scientific computing, training and inference applications over the prior generation.

The GB200 NVL4 superchip will be available in the second half of 2025.”

DegustatoR · Nov 20, 2024

trinibwoy said:
I wonder why Nvidia hasn’t shared any details of the SM configuration, cache or clocks.

They are waiting on gaming launch for that.

trinibwoy · Nov 20, 2024

DegustatoR said:
They are waiting on gaming launch for that.

Not sure why they would wait on gaming to share specs of an HPC/AI chip. At Hopper launch they spilled all the beans.

DegustatoR · Nov 20, 2024

trinibwoy said:
Not sure why they would wait on gaming to share specs of an HPC/AI chip. At Hopper launch they spilled all the beans.

Hopper is still Volta class, Blackwell is presumably different.

trinibwoy · Nov 22, 2024

5070 Ti rumored to be based on GB203 with only 6% more SMs than the 4070 Ti Super and 16% more than the 4070 Ti. The optimist in me thinks Blackwell SMs must be a lot more efficient or clock much higher.

iroboto · Nov 22, 2024

trinibwoy said:
5070 Ti rumored to be based on GB203 with only 6% more SMs than the 4070 Ti Super and 16% more than the 4070 Ti. The optimist in me thinks Blackwell SMs must be a lot more efficient or clock much higher.

$$$$

gotta keep those costs to the consumer in control. There are upper limits on everything I suspect.

Dangerman · Nov 22, 2024

The rumours specs (all but confirmed) from the SKUs really makes me really wish 5080 was 5070 Ti (& GB203* being renamed into GB204), the 5070 Ti the 5070 and the 5070 the 5060 Ti etc.

*GB203 could've been a 128SM, 384 bit die and just below 500mm2 with a 24GB 5080 Ti & 20GB 5080 SKUs. But I suppose crazy AI demand makes Nvidia more desirable to just make a huge gap between GB202 & 203.

DegustatoR · Nov 22, 2024

Dangerman said:
The rumours specs (all but confirmed) from the SKUs really makes me really wish 5080 was 5070 Ti (& GB203* being renamed into GB204), the 5070 Ti the 5070 and the 5070 the 5060 Ti etc.

You know nothing about prices or performance of these parts so what makes you wish them having different names now?

vola · Nov 22, 2024

https://videocardz.com/newz/nvidia-geforce-rtx-5090-gb202-gpu-die-reportedly-measures-744-mm2-20-larger-than-ad102

trinibwoy · Nov 22, 2024

Dangerman said:
*GB203 could've been a 128SM, 384 bit die and just below 500mm2 with a 24GB 5080 Ti & 20GB 5080 SKUs. But I suppose crazy AI demand makes Nvidia more desirable to just make a huge gap between GB202 & 203.

That would be nice but I wonder if the problem is that games just don’t scale with lots of SMs. AD102 has 80% more SMs than AD103. The 4090 has 60% more SMs than the 4080 but after all that the 4090 is only about 25% faster.

Would a 128 SM GB203 be much slower than the 5090?

DavidGraham · Nov 23, 2024

vola said:
https://videocardz.com/newz/nvidia-geforce-rtx-5090-gb202-gpu-die-reportedly-measures-744-mm2-20-larger-than-ad102

So the 5090 die is 20% larger than the 4090 die, bus is 512 bit with 32GB of VRAM, and PCIe 5, I am guessing this is going to be expensive as hell, hopefully with a performance uplift as impressive as the specs.

Speculation and Rumors: Nvidia Blackwell ...

techuse

iroboto

Daft Funk

Broopster

Deleted member 2197

Guest

NVIDIA "Blackwell" DGX B200 Listed, Prices Start At A Half A Million Dollars For The Top of The Line AI Hardware

Tkumpathenurple

Deleted member 2197

Guest

How do Nvidia Blackwell GPUs train AI models with 4-bit math?

Erinyes

DegustatoR

Press Center - NVIDIA Renames Blackwell Ultra to B300 Series; CoWoS-L Expected to See Growth by 2025, Says TrendForce | TrendForce - Market research, price trend of DRAM, NAND Flash, LEDs, TFT-LCD and green energy, PV

Kaotik

Drunk Member

trinibwoy

Meh

DegustatoR

trinibwoy

Meh

DegustatoR

trinibwoy

Meh

iroboto

Daft Funk

Dangerman

DegustatoR

vola

trinibwoy

Meh

DavidGraham