AMD RDNA4 Architecture Speculation

NVIDIA definitely runs tensor and fp32 ops concurrently, especially now with their tensor cores busy almost 100% of the time (doing upscaling, frame generation, denoising, HDR post processing, and in the future neural rendering).

Latest NVIDIA generations have become exceedingly better at mixing all 3 workloads (tensor+ray+fp32) concurrently, I read somewhere (I can't find the source now) that ray tracing + tensor are the most common concurrent ops, followed by ray tracing + fp32/tensor + fp32.


The way I read this it seams that while the workloads are executed concurrently, they are still not dispatched concurrently (unlike on some other architectures). So some pipes will be underutilized.
 
According to ITHome, who are quoting their own sources now, the Radeon RX 9070 GRE is indeed coming. Presumably, this means another Chinese market exclusive; however, there were instances where AMD brought GRE models to the global market (RX 7900 GRE).

The media have no specs on this graphics card yet, but they assume that this means a 192-bit memory bus and, as a result, a 12GB GDDR6 memory configuration. This sounds a lot like an RX 9070-class GPU, except AMD already has one, and the RX 9060 XT is already confirmed to have a 128-bit memory bus.
 
Back
Top