XSX CUs show approximately 0 growth in area compared with Navi 10 and 14. Supposedly the AMD ray tracing patent document indicates a minor increase in die size. (Maybe one day I'll bother reading it.)
Navi 21 is rumoured to support less hardware threads per SIMD (16 instead of 20). I can't see that this would make a major difference in CU size though. In theory more complex shaders are likely to result in less hardware threads being able to fit into a SIMD, therefore supporting 20 is pointless. Less hardware threads in flight per SIMD also helps with L0 cache coherency (reduced thrashing there). But 16 versus 20 isn't some amazing difference... 8 would make me think for longer...
If only XSX CUs were dramatically bigger...
Rumours are for a 60% performance per watt improvement. That's more than normally occurs with a full node transition! It seems to indicate that Navi 10 was a terrible failure.
Can we expect Navi 23 (6500XT?) to give the same performance as 5700XT? 32 CUs, 32 ROPs, 128-bit bus, 9.8TFLOPS at 2400MHz (guessed clock). That seems pretty reasonable to me. It supports the monster-cache rumour.
How much power would that consume? 130W like 5500XT? That's a 50% bigger die than Navi 14 (235 versus 157), which is rated for 130W.
My modelled
- Navi 14 derived from Navi 10 is off by about 10mm² (167 versus 157 actual)
- Navi 23 (with 32 WGPs) derived from Navi 10 is 13mm² too large (248 versus 235 rumoured).
Those are both 6% out. I think Navi 23 will actually be smaller than 235mm², because packaging makes the size too large. My Navi 10-based models are a few weeks old now and could be improved by my Navi 14 analysis, but 6% is crushing my enthusiasm.
(There is an error in what I published a few weeks ago, but it's a self-cancelling error in the "uncore" versus "miscellaneous IO + edges" areas - they're mutually derived, so the error cancels in other modelled GPUs. Uncore is twice as large, so miscellaneous IO + edges is half-sized. I haven't come up with a way to improve the model for these two areas, but it's notable that Navi 14 has a smaller miscellaneous IO + edges area - less HDMI/DP? Seems strange, since there's a "pro" variant... The error in these, jointly, could be 10-15mm², out of about 60-70mm² when modelled for other GPUs, I suppose, but the Navi 10 die shot is too poor in quality to understand these areas.)
My modelled PS5 die size is about 8mm² too large (316 versus 308) which is only 3%, but 308mm² is from the PS5 teardown video, so that seems murky and packaging makes it "measure" too large.
Navi 21 die size, 536mm², is too vague in my opinion to make much of a claim (modelled 524mm² with 80 WGPs). Packaging-derived measurement error would be about 20mm², e.g. 516mm².
On-chip or >256-bit GDDR6?
An improved on-die memory system, "Infinity Fabric" combined with "Infinity Cache" should use more transistors, definitely, even without a monster cache. XSX die shows a pair of fairly large blocks that are labelled as "SOC fabric, coherency, G6 MCs" (about 25mm² for the two). I dunno how to translate that into Navi 2x... Some of that should be directly associated with the CPU, but some of it is "Infinity ...".