Nvidia Blackwell Architecture Speculation

  • Thread starter Deleted member 2197
  • Start date
The 40 series has essentially vanished from shelves in the US and apparently the 50 series is still somewhere on the factory floor. Has this situation happened before where Nvidia has nothing to sell especially on a mature node? Not situations like covid where demand is just out of control but literally having nothing to sell. They provided early warning in the last earnings call but even 4060’s are drying up and their replacements aren’t expected for a few more months at best.
nVidia ceased production some while back thinking the current stock would be sufficient, only to find it's not?
 
The 40 series has essentially vanished from shelves in the US and apparently the 50 series is still somewhere on the factory floor. Has this situation happened before where Nvidia has nothing to sell especially on a mature node? Not situations like covid where demand is just out of control but literally having nothing to sell. They provided early warning in the last earnings call but even 4060’s are drying up and their replacements aren’t expected for a few more months at best.
It's unprecedented as far as I can recall. Would be bad for them if buyers were willing to try an alternative. I think they'll find most buyers will likely buy nothing and wait rather than buy something else. Also Radeon supply is pretty spotty as well.
 
nVidia ceased production some while back thinking the current stock would be sufficient, only to find it's not?

I don’t believe this rumor about 40 series production stopping back in December. You can buy a Dell with 40 series cards right now and have them in a week.

There’s two components right now that appear to be delaying Dell shipments:
14900KF - 3 weeks
RTX 50 series - 7 weeks (last night 5080 were available, now those are gone)
 
Only under DirectX 11 though
The opposite. DX12 is supported, DX11 is not.
For Direct3D11 games it is currently strongly NOT recommended to enable RTSS overlay and Smooth Motion at the same time. In such environment overlay will not work properly and will cause performance loss. Overlay compatibility with Direct3D11 games with Smooth Motion mode is expected in future versions of RTSS.
 
It's not for all SKUs, and Dell being Dell may have accumulated ample inventory in advance.

Dell doesn’t carry inventory, they have been JIT manufacturing from their very beginning. For example in their last quarter they had $18.2B in sales but just $3.6B of inventory (including EMC). That’s less than three weeks of inventory for the company.

Having seen their PC plant in Austin they literally have semi trucks unloading components one site of the building and being brought directly to the line. Strait into a PC, tested and loaded onto an outbound truck on the other end of the building. That’s how they can get you a BTO system at your door in a few days.

It’s also the best way to get a video card when there are shortages. By a Dell PC, pull out the card eBay the rest.
 
Dell doesn’t carry inventory, they have been JIT manufacturing from their very beginning. For example in their last quarter they had $18.2B in sales but just $3.6B of inventory (including EMC). That’s less than three weeks of inventory for the company.
Also just to reiterate for those who don't know what @Potato Head meant when he said "including EMC" -- that's their enterprise storage and backup subsidiary. A significant portion of their inventory is going to be consumed with enterprise- or carrier-class servers and storage and networking gear, and some of their workstation-class gear that enterprises use in their SMB lineup. That's where the real money is made for Dell, and hence where (what little there is) inventory is best kept.

The big companies were also trying to hedge, where they could, with advance shipments from Taiwan expecting the Trump tariffs.
 
People aren't getting 2x INT32 throughput on RTX Blackwell compared to Ada Lovelace. There's probably an issue somewhere in the compiler/driver/firmware.
4080 sm_89
Throughput: 1.447048e+13 IMAD/sec

5080 sm_89
Throughput: 1.522615e+13 IMAD/sec

5080 sm_120
Throughput: 1.542646e+13 IMAD/sec
 
People aren't getting 2x INT32 throughput on RTX Blackwell compared to Ada Lovelace. There's probably an issue somewhere in the compiler/driver/firmware.

Nvidia's own documentation still claims half INT throughput for Blackwell: https://docs.nvidia.com/cuda/cuda-c-programming-guide/#compute-capability-12-0
 
Dunno. Could be that INT multiplication is implemented at half rate and this drags the IMAD score down?
The programming guide haven't updated the instruction rates chart since Hopper.
 
Rendering animated hair on humans is about 2x faster with LSS compared to DOTS, while also requiring about 5x less VRAM to store the geometry. This is similar for other common use cases. With LSS on GeForce RTX 50 Series GPUs and DOTS for earlier GPUs, there is now a way to get the highest possible hair ray tracing performance on all RTX GPUs.

When ray tracing in CUDA, LSS is currently available in OptiX. For DirectX, API for LSS can be found starting in the R570 version of the NVAPI SDK. Vulkan developers will be interested in the LSS extension coming soon.
Non-Blackwell GPUs will need a different implementation here.
 

Results in older benchmarks like Vantage can vary quite a bit depending on system configuration. I wasn't able to find what exact settings ixbt is using but anyway here's my go at some 5090 results.

These are from the 3DMark_Vantage_v113_installer (which still reports as Build 1.1.2 in the GUI) with the Advanced - not Pro - upgrade.
I used Windows 11 Pro with the desktop set to 1280x1024@60 Hz and G-SYNC disabled.

3dmark_vantage_5090.jpg
 
Back
Top