Nvidia Blackwell Architecture Speculation

Broopster · Jan 7, 2025

5080 and 5090 both coming 1/30.

The bar graph on the 5090 isn’t exactly precise but the non-DLSS uplift seems to be well under 1.5x.

arandomguy · Jan 7, 2025

The actual 50 series product page is up, just not linked yet from the main page -

NVIDIA GeForce RTX 50 Series Graphics Cards

Powered by NVIDIA Blackwell.

www.nvidia.com

Bondrewd · Jan 7, 2025

Broopster said:
The bar graph on the 5090 isn’t exactly precise but the non-DLSS uplift seems to be well under 1.5x.

it's probably a CPU limit.

vola · Jan 7, 2025

clock frequencies seems to be similar to Ada so the performance uplift for the 5080 will be from the improved SM architecture

Scott_Arm · Jan 7, 2025

Broopster · Jan 7, 2025

Bondrewd said:
it's probably a CPU limit.

Very well could be, although there are games out there like Alan Wake that are relatively CPU light, not sure why they’d pick a game like Far Cry 6. Also it’s roughly in line with the uplift on other cards despite the 5090 being the only card with a substantial SM bump. Interesting.

Edit: See below, could also be that clock speeds on the 5090 are down (they look up elsewhere but only modestly). So much for those 2.9Ghz base clocks.

Also, the fine print says there is a MFG (max frame gen I’m guessing) 4x model.

Scott_Arm · Jan 7, 2025

I've kind of tuned out the world destroying ai part of the presentation, but the links are getting interesting.

Bondrewd · Jan 7, 2025

Broopster said:
being the only card with a substantial SM bump

it's also 2x membw. Seems odd.

Broopster · Jan 7, 2025

Broopster said:
Very well could be, although there are games out there like Alan Wake that are relatively CPU light, not sure why they’d pick a game like Far Cry 6. Also it’s roughly in line with the uplift on other cards despite the 5090 being the only card with a substantial SM bump. Interesting.

Also, the fine print says there is a MFG (max frame gen I’m guessing) 4x model.

Looks like the clock speeds are down a bit on the 5090 - base clock is 2Ghz, boost is 2.4. Presumably this was to keep the power in check, though it may also explain those 2x power connector rumors.

DegustatoR · Jan 7, 2025

New GeForce RTX 50 Series Graphics Cards & Laptops Powered By NVIDIA Blackwell Bring Game-Changing AI and Neural Rendering Capabilities To Gamers and Creators

Multiply performance by up to 8X using DLSS 4 with Multi Frame Generation, reduce PC latency by up to 75% with Reflex 2, and experience next-generation RTX Neural Rendering.

www.nvidia.com

NVIDIA DLSS 4 Introduces Multi Frame Generation & Enhancements For All DLSS Technologies

75 DLSS Multi Frame Generation games and apps available on day 0; graphics industry’s first, real-time transformer model enhances image quality for DLSS Ray Reconstruction, Super Resolution, and DLAA.

www.nvidia.com

DLSS Multi Frame Generation & New RTX Technologies Coming To Black State, DOOM: The Dark Ages, Dune: Awakening, and More. 75 Games and Apps At Launch & More On The Way

Multiply performance by up to 8X and experience new cutting-edge NVIDIA RTX ray tracing and AI technologies in Alan Wake 2, Black Myth: Wukong, Marvel Rivals, NARAKA: BLADEPOINT, and many other titles.

www.nvidia.com

NVIDIA Reflex 2 With New Frame Warp Technology Reduces Latency In Games By Up To 75%

Innovative new technology improves responsiveness by updating rendered frames based on latest mouse input.

www.nvidia.com

Project G-Assist: An AI Assistant For GeForce RTX AI PCs, Comes To NVIDIA App In February

Optimize performance, configure PC settings, and more with a voice-powered AI Assistant, all run locally on GeForce RTX GPUs.

www.nvidia.com

NVIDIA Redefines Game AI With ACE Autonomous Game Characters

PUBG: BATTLEGROUNDS, inZOI, MIR5 & NARAKA: BLADEPOINT MOBILE PC VERSION are the first games to incorporate autonomous companions, enemies, and game systems powered by NVIDIA ACE.

www.nvidia.com

arandomguy · Jan 7, 2025

Getting the transistor counts and die sizes for the small chips will be interesting.

I wonder if GB203 is going to be roughly the same transistor count and die size, or even smaller?, as AD103. TPU's spec page actually has it smaller in terms of die size at the moment, but I'm not sure what that is based on. However if it does come in at ~half the tranistor count of GB202 that would be it at 46b which is the same as AD103.

Getting both signficant perf and capability uplift without any increase in transistors on just likely an iteration of the same node is something.

Scott_Arm · Jan 7, 2025

The new transformer model for ray reconstruction looks awesome, and you can use the Nvidia app to overide the native ray reconstruction in the game (same with frame gen where you can force 4x instead of 1x).

Broopster · Jan 7, 2025

arandomguy said:
Getting the transistor counts and die sizes for the small chips will be interesting.

I wonder if GB203 is going to be roughly the same transistor count and die size, or even smaller?, as AD103. TPU's spec page actually has it smaller in terms of die size at the moment, but I'm not sure what that is based on. However if it does come in at ~half the tranistor count of GB202 that would be it at 46b which is the same as AD103.

Getting both signficant perf and capability uplift without any increase in transistors on just likely an iteration of the same node is something.

SM count is up but only by ~5%.

Broopster · Jan 7, 2025

Scott_Arm said:
The new transformer model for ray reconstruction looks awesome, and you can use the Nvidia app to overide the native ray reconstruction in the game (same with frame gen where you can force 4x instead of 1x).

Will be really intrigued to see this in action more. I have to imagine you’ll need quite a high refresh rate to avoid noticeable latency on a 4x framegen

Scott_Arm · Jan 7, 2025

Frame warping in Reflex 2. Super cool. Will love to see how that looks/works hands-on.

Broopster · Jan 7, 2025

With the 5090 it will be interesting to see how much performance they may have left on the table due to power if a dual power connector AIB model comes out (Kingpin has implied they will be making one).

Scott_Arm · Jan 7, 2025

Broopster said:
SM count is up but only by ~5%.

It's around 10.5% on the 5080 over 4080 with a very minor clock bump. Has some shader execution reordering and better tensor utilization so that will gain something, but I would guess just standard raw non-DLSS non-DXR performance is going to be like 20%.

TopSpoiler · Jan 7, 2025

NVIDIA RTX Neural Rendering Introduces Next Era of AI-Powered Graphics Innovation | NVIDIA Technical Blog

NVIDIA today unveiled next-generation hardware for gamers, creators, and developers—the GeForce RTX 50 Series desktop and laptop GPUs. Alongside these GPUs, NVIDIA introduced NVIDIA RTX Kit…

developer.nvidia.com

The number of triangles used to create games has exponentially increased over the past 30 years. With the introduction of the Unreal Engine 5 Nanite geometry system, developers can build open worlds filled with hundreds of millions of triangles. However, as ray traced game scenes explode in geometric complexity, the cost to build the bounding volume hierarchy (BVH) each frame for various levels of detail (LOD) grows exponentially, making it impossible to achieve real-time frame rates. RTX Mega Geometry accelerates BVH building, making it possible to ray trace up to 100x more triangles than today’s standard.

RTX Mega Geometry intelligently updates clusters of triangles in batches on the GPU, reducing CPU overhead and increasing performance and image quality in ray traced scenes. RTX Mega Geometry is coming soon to the NVIDIA RTX Branch of Unreal Engine (NvRTX), so developers can use Nanite and fully ray trace every triangle in their projects. For developers using custom engines, RTX Mega Geometry will be available at the end of the month as an SDK to RTX Kit. Sign up to be notified of availability.

Broopster · Jan 7, 2025

Scott_Arm said:
It's around 10.5% on the 5080 over 4080 with a very minor clock bump. Has some shader execution reordering and better tensor utilization so that will gain something, but I would guess just standard raw non-DLSS non-DXR performance is going to be like 20%.

In a sense it’s what people should expect right now - a new gen with a modest but respectable raw uplift at similar (or better) prices and a focus on better features/AI. In that sense it has the makings of a success, especially in the mid range.

Still a bit of a surprise on the 5090 though - the raw uplift looks in line with the other cards despite all that extra silicon and bandwidth. Although Far Cry 6 is pretty CPU limited so they may be hiding the ball a bit there.

Scott_Arm · Jan 7, 2025

@Dictator has to be drooling at CES right now.