Nvidia Blackwell Architecture Speculation

DegustatoR · Jan 23, 2025

Subtlesnake said:
Well it doesn't seem to be making a huge difference with the 5090, where the 4090 we're comparing against had a similar FLOPS to bandwidth ratio.

Every result where it's more than +15% (half of flops gain) is highly likely to be a result helped by the additional memory b/w.

Subtlesnake · Jan 23, 2025

DegustatoR said:
Every result where it's more than +15% (half of flops gain) is highly likely to be a result helped by the additional memory b/w.

So in a hypothetical world where the bandwidth had increased by the same amount as the FLOPS gain (~32%) to maintain the same ratio as the 4090, what performance difference would we see?

homerdog · Jan 23, 2025

Subtlesnake said:
So in a hypothetical world where the bandwidth had increased by the same amount as the FLOPS gain (~32%) to maintain the same ratio as the 4090, what performance difference would we see?

I'm sure someone reading this will get a 5090. It would be very interesting to downclock the memory and see how performance scales. But it's only interesting if you post the results here :yes:

DegustatoR · Jan 23, 2025

Subtlesnake said:
So in a hypothetical world where the bandwidth had increased by the same amount as the FLOPS gain (~32%) to maintain the same ratio as the 4090, what performance difference would we see?

On average? Doubt that the results would be much different. Some games which are showing +50% now though would start showing something closer to +30.

Subtlesnake · Jan 23, 2025

DegustatoR said:
On average? Doubt that the results would be much different. Some games which are showing +50% now though would start showing something closer to +30.

Ok so the question is whether on average the 5070 Ti and 5070 will see the ~20% improvements suggested by the slides, or Nvidia just cherry picked games that benefit from the extra bandwidth.

DegustatoR · Jan 23, 2025

Subtlesnake said:
Ok so the question is whether on average the 5070 Ti and 5070 will see the ~20% improvements suggested by the slides, or Nvidia just cherry picked games that benefit from the extra bandwidth.

We'll know the answer to that in several days since what 5080 will show will likely be very similar to what 5070Ti will show and - probably - 5070 too.

arandomguy · Jan 23, 2025

Just purely speculating here but I wonder if GB202 has some experimentation in terms of how to approach a MCM design. This would be akin to how they approached preceeding GPUs before actually moving to MCM with GB200.

Cyan · Jan 24, 2025

Flappy Pannus said:
Winning over a DLSS cynic is something, wow.

the difference is quite remarkable.

Imgsli

DavidGraham · Jan 24, 2025

CNN DLSS is faster by up to 20% than TNN on RTX 3070 ...

arandomguy · Jan 24, 2025

It's interesting it seems like the CNN model uses less VRAM on the above? I'm wondering if the transformer models leverage FP8 (or INT8) more which wouldn't benefit Ampere (and presumbly Turing as well).

techuse · Jan 24, 2025

I wonder how a 384 bit bus 5090 would have performed.

Flappy Pannus · Jan 24, 2025

Tried the new transformer model with an old game that had a weak DLSS implementation (well that, and I'm currently replaying it) - Rise of the Tomb Raider. 4K, custom settings, DLSS performance, motion blur off.

Transformer DLSS4:

CNN (DLSS 3.8.10):

And uh, quite a downgrade unfortunately. Basically Performance mode is equivalent to CNN's Balanced in fps for this title on my card, but with significantly increased aliasing/shimmering to boot.

So the performance cost on lower-end cards may be significant. Not too bothered by the quality downgrade as it's early days and obviously not every title will play well with a new model (and especially one that didn't have even TAA to begin with and DLSS was slapped in with a patch) hence the toggle we're going to get.

Albuquerque · Jan 24, 2025

An interesting connotation to be sure, thank you for posting it! It does indeed make us wonder if it's an app specific issue, or maybe an AI training issue, or something else entirely... I'm sure more examples will surface over the next few weeks and I'm intrigued by what we are about to discover.

DegustatoR · Jan 24, 2025

Albuquerque said:
It does indeed make us wonder if it's an app specific issue, or maybe an AI training issue, or something else entirely...

It's app specific. DLSS integration in ROTTR is very poor. The only way to make use of it there without some sort of issues is by forcing DLAA really.

Seanspeed · Jan 24, 2025

trinibwoy said:
I did not. Please read again.

'in games where it matters' is the only way you could wiggle out of this, except that's not how it works. In games where the gains are lower, it doesn't mean those games matter less in context of a GPU review. Come on now. That same reasoning could be used to present all manner of dishonest claims. I could even make the exact opposite claim and say that examples where the gains are only like 15% are actually the ones that matter, and so there's no meat on the bones whatsoever. I wouldn't do that though, cuz it wouldn't be an honest portrayal.

trinibwoy · Jan 24, 2025

Seanspeed said:
'in games where it matters' is the only way you could wiggle out of this, except that's not how it works.

That is actually how English works. Please move on.

Seanspeed · Jan 24, 2025

trinibwoy said:
That is actually how English works. Please move on.

You literally cut off my whole comment to ignore the part where I say you cannot just conveniently choose which games matter when making performance claims. Not helping yourself at all in the 'honesty' arena here. smh

If you would like to move on and deflect from your misleading claims, you are free to ignore my posts. Dont tell me what I should or should not be responding to, ffs.

DavidGraham · Jan 24, 2025

DSOG review of DLSS4 Frame Gen.

Overall, DLSS 4 Multi-Frame is one of the best new features of the Blackwell RTX 50 series GPUs. By using it, you will finally be able to enjoy path-traced games at super high framerates. And you know what? I’d take 200FPS with the response times of 50-60FPS any day over simply gaming at 50-60FPS.

And good luck getting a similar experience with “TV interpolation” or Lossless Scaling. DLSS 4 is on an entire next-gen level.

So go ahead and cope all you want, make all the excuses or memes you can. Personally, after actually getting my hands on it and gaming with DLSS 4 X4, I’ll be using it in pretty much all the games that support it!

NVIDIA DLSS 4 Multi-Frame Gen Benchmarks & Impressions

Overall, DLSS 4 Multi-Frame is one of the best new features of the NVIDIA GeForce Blackwell RTX 50 series GPUs.

www.dsogaming.com

Charlietus · Jan 24, 2025

DavidGraham said:
DSOG review of DLSS4 Frame Gen.

NVIDIA DLSS 4 Multi-Frame Gen Benchmarks & Impressions

Overall, DLSS 4 Multi-Frame is one of the best new features of the NVIDIA GeForce Blackwell RTX 50 series GPUs.

www.dsogaming.com

Is this a serious outlet? That is so childish

Boss · Jan 24, 2025

The new model while improved has a different set of artifacts and problems that are very clearly visible. Also there is an increased performance cost on the 4080 super.

Nvidia Blackwell Architecture Speculation

DegustatoR

Subtlesnake

homerdog

donator of the year

DegustatoR

Subtlesnake

DegustatoR

arandomguy

Cyan

orange

DavidGraham

arandomguy

techuse

Flappy Pannus

Albuquerque

Red-headed step child

DegustatoR

Seanspeed

trinibwoy

Meh

Seanspeed

DavidGraham

NVIDIA DLSS 4 Multi-Frame Gen Benchmarks & Impressions

Charlietus

NVIDIA DLSS 4 Multi-Frame Gen Benchmarks & Impressions

Boss