AMD RDNA4 Architecture Speculation

LordEC911 · Wednesday at 3:16 PM

fellix said:
The L3 latency got a sizable hit due to the chiplet approach. Bandwidth went up at least.

Assuming the plan is still to have the L3 off die in future, I wonder how much it would help if they bump the L2 up to 2-4MB per SE to try and compensate.

tunafish · Wednesday at 9:47 PM

LordEC911 said:
Assuming the plan is still to have the L3 off die in future, I wonder how much it would help if they bump the L2 up to 2-4MB per SE to try and compensate.

Given how well stacking works on the Zen side, I feel like the plan in the future is to put the L3 (and memory controllers) on a base die, as they are on MI300. The hybrid bonding process is not horribly expensive, and gets them a much lower latency and higher bandwidth link than anything else.

Scott_Arm · Wednesday at 10:24 PM

tunafish said:
Given how well stacking works on the Zen side, I feel like the plan in the future is to put the L3 (and memory controllers) on a base die, as they are on MI300. The hybrid bonding process is not horribly expensive, and gets them a much lower latency and higher bandwidth link than anything else.

Especially now that they've put the cache on ryzen 9000 series underneath the cpu cores. If they can package it so the cache is underneath the gpu cores you don't have to worry as much about the cache getting cooked.

Dangerman · Wednesday at 10:39 PM

Scott_Arm said:
Especially now that they've put the cache on ryzen 9000 series underneath the cpu cores. If they can package it so the cache is underneath the gpu cores you don't have to worry as much about the cache getting cooked.

Even if they will stick to a single Compute die, wonder if they would be able to like stack the memory die in an underneath cache die? Minimise the amount of unshrinkable silicone on the precious Compute die.

Bondrewd · Wednesday at 11:55 PM

Dangerman said:
wonder if they would be able to like stack the memory die in an underneath cache die?

they're doing literally just that for Venice so yea.

Scott_Arm said:
If they can package it so the cache is underneath the gpu cores you don't have to worry as much about the cache getting cooked.

Shoreline I/O will still be hella toasty, you've seen MI300 heatmap at the last hotchips.

Dangerman · Wednesday at 11:59 PM

Bondrewd said:
they're doing literally just that for Venice so yea.

I mean since I can't edit just to clarify like the memory bit bus because especially for large memory bit bus (i.e. 512-bit bus), that's a lot of precious space.

Though one use I could see that working in favour could be next gen consoles; have a cheap-ish TSMC N6 Memory-bit Bus/IOD/Cache die stacked underneath a potent more fancy Compute die with whatever the best stuff can be fit (if 2028/2029 that could mean a TSMC A16P Compute Die with UDNA Gen 2 & Zen 7)?

Bondrewd · Thursday at 12:16 AM

Dangerman said:
I mean since I can't edit just to clarify like the memory bit bus because especially for large memory bit bus (i.e. 512-bit bus), that's a lot of precious space.

512b is kinda pointless if you're going for MALL in 3D.

Dangerman said:
Though one use I could see that working in favour could be next gen consoles; have a cheap-ish TSMC N6 Memory-bit Bus/IOD/Cache die stacked underneath a potent more fancy Compute die with whatever the best stuff can be fit (if 2028/2029 that could mean a TSMC A16P Compute Die with UDNA Gen 2 & Zen 7)?

ah no SoIC-X cost structure makes it out of question for consoles.
It's strictly a premium thing.

techuse · Thursday at 11:12 PM

The absurd rumors are about to start flooding in.

del42sa · Friday at 6:55 AM

techuse said:
The absurd rumors are about to start flooding in.

MLID ....

AMD RDNA4 Architecture Speculation

Similar threads