AMD: RDNA 3 Speculation, Rumours and Discussion

davis.anthony · Oct 25, 2022

I wonder what the costs are between running a large memory bus with GDDR6/X and a high amount of internal cache vs just using HBM3.

pTmdfx · Oct 25, 2022

Seanspeed said:
I mean, AMD has reduced the Infinity Cache on Navi 31 from 128MB to 96MB. Doing this backstep in specs would be fine if 96MB is sufficient, but if it's not(as in the situation being talked about here), then such a backstep feels an awful lot like they're just not producing a great product like they could, out of cheapness.

Cache capacity is just one factor that drives effective hit rate. If anything, all hit rate graphs are bound to the specific context of how one particular architecture makes use of that one level of cache hierarchy. It is useful to predict how different SKUs of the same architecture performs, but never a good portable performance predictor across generations of architectures, where you probably expect significant changes everywhere over one another.

To make the grand assertion of yours at this point of time, you are basically saying ceteris paribus i.e. nothing else outside the Infinity Cache has changed. Like no cache policy changes, no access pattern optimisation, no change to L0 or GL1, no improvements to RBE/DCC, etc. There is simply no information.

Gipsel · Oct 25, 2022

Seanspeed said:
The Vcache chip for the 5800X3D represents a very significant percentage increase in silicon usage.

The alternative - adding a much larger L3 cache on the base chiplet - would be even less space efficient.

So it makes sense here. The performance potential unlocked is also very obvious.

For Navi 31, it's quite different. An extra 8MB of L3 per MCD would not represent a huge percentage increase for silicon needed. And again, it's all on the cheaper process compared to the compute die.

For cost efficiency one wants to reuse the MCD die across all models. An increase of the MCD integrated cache that appears to be justifiable if one looks solely at N31, may not be looking very appealing for smaller models. But on the other hand, the price AMD can achieve for the fastest version of N31 might justify the additional cost of stacking some additional cache dies. At the high end, the price/performance ratio is far from linear. Even relatively small performance increases at the top justify relatively larger cost increases for the manufacturer. That is not the case at the low end. Furthermore, stacking cache increases the cache size by a multiple, not just a small increment, so it ends up with a way bigger return (in terms of performance) than just a moderately larger cache on all MCDs.
In other words: Increasing the cache integrated in the MCD increases the cost across the complete range of products, which may be as cost inefficient as increasing the L3 in all AMD CPUs (instead of just using a cache stack on the 5800X3D). Using stacking only in an enthusiast model AMD incurs the associated costs only for this model. So the situation with RDNA3 may in fact not that different.

Bondrewd · Oct 25, 2022

https://twitter.com/x/status/1584931430483705859

No melting.

unl34shed · Oct 25, 2022

"RX 6000 series and upcoming RDNA 3 GPUs"

What other RDNA3 GPUs is he referring to?
AMD Rage Fury with RDNA3?

dorf · Oct 25, 2022

unl34shed said:
"RX 6000 series and upcoming RDNA 3 GPUs"

What other RDNA3 GPUs is he referring to?
AMD Rage Fury with RDNA3?

Upcoming RDNA 3 GPUs are presumably RX 7000 desktop GPUs. RX 6000 is RDNA 2.

unl34shed · Oct 25, 2022

Oh fuck you are right...

Mat3 · Oct 25, 2022

davis.anthony said:
I wonder what the costs are between running a large memory bus with GDDR6/X and a high amount of internal cache vs just using HBM3.

Yeah, I've wondered along those lines too. The expected design for the high end is a big GPU surrounded by 6 chiplets which house L3 cache and memory controllers and PHYs, all sitting on an interposer.

So, 6 chiplets + 24GB* Gddr6 versus 16GB HBM2 (not sure HBM3 is ready enough).

* I think it's safe to say the 24GB is all about the bandwidth (and maybe some marketing) and not so much about the storage capacity. I don't think we're anywhere near the point of using that much memory and it's not going to help with the benchmarks.

Jawed · Oct 25, 2022

Mat3 said:
Yeah, I've wondered along those lines too. The expected design for the high end is a big GPU surrounded by 6 chiplets which house L3 cache and memory controllers and PHYs, all sitting on an interposer.

According to the patent application:

DIE STACKING FOR MODULAR PARALLEL PROCESSORS - ADVANCED MICRO DEVICES, INC.

<div p-id="p-0001">A multi-die parallel processor semiconductor package includes a first base IC die including a first plurality of virtual compute dies 3D stacked on top of the first base IC die. A f

www.freepatentsonline.com

the chiplet that provides cache and memory connections, the "active interposer die" is not the kind of interposer that was seen with HBM-based GPUs. For a start it actually has logic in it, it's not just providing power and connectivity for the chips that sit upon it.

Notice there is no "MCD" in that document. This is why I think the leakers are clueless about the actual package, its chiplets and the stacking.

techuse · Oct 25, 2022

Do we expect any changes to the shader core over RDNA 1 and 2?

Kaotik · Oct 26, 2022

techuse said:
Do we expect any changes to the shader core over RDNA 1 and 2?

Yes, AMD has confirmed rearchitected CUs and 'optimized graphics pipeline'

Bondrewd · Oct 26, 2022

Jawed said:
Notice there is no "MCD" in that document

Ah that's for CDNA3 and onwards.

techuse said:
Do we expect any changes to the shader core over RDNA 1 and 2?

A lot actually.
It's gfx11 now, remember?

gamervivek · Oct 26, 2022

Wishing for a 8SE 16384 shaders 420mm2 chip(GCD) :yes:

Mat3 · Oct 26, 2022

techuse said:
Do we expect any changes to the shader core over RDNA 1 and 2?

More SIMDs in a workgroup processor.

Kaotik · Oct 26, 2022

I suppose it should be posted here too, Scott Herkelman confirmed over Twitter that RDNA3 won't be using the new 12VHPWR power connector, which has already melted in the hands on RTX 4090 users due crappy plug design

https://twitter.com/x/status/1584931430483705859

swaaye · Oct 26, 2022

The regular 8-pin connectors occasionally do the melting thing too.

digitalwanderer · Oct 26, 2022

I have a feeling this is gonna turn in to a huge advantage for AMD this cycle. :yep2:

Malo · Oct 26, 2022

digitalwanderer said:
I have a feeling this is gonna turn in to a huge advantage for AMD this cycle.

I think it's going to be largely irrelevant overall, especially considering the issues are likely a very small number of people who buy 4090's, which in itself will be a very tiny portion of PC enthusiasts.

digitalwanderer · Oct 26, 2022

Malo said:
I think it's going to be largely irrelevant overall, especially considering the issues are likely a very small number of people who buy 4090's, which in itself will be a very tiny portion of PC enthusiasts.

I think it'll overshadow the whole 4000 series and will loom large over nVidia for this generation, but it'll be interesting to see either way.

trinibwoy · Oct 26, 2022

digitalwanderer said:
I think it'll overshadow the whole 4000 series and will loom large over nVidia for this generation, but it'll be interesting to see either way.

Much worse things haven’t hurt Nvidia so I doubt this time will be different. Most of the noise seems to be coming from tubers and not actual users so it’s unclear if many people are actually affected. Either way, some piss poor engineering there.

AMD: RDNA 3 Speculation, Rumours and Discussion

davis.anthony

pTmdfx

Gipsel

Bondrewd

unl34shed

dorf

unl34shed

Mat3

Jawed

DIE STACKING FOR MODULAR PARALLEL PROCESSORS - ADVANCED MICRO DEVICES, INC.

techuse

Kaotik

Drunk Member

Bondrewd

gamervivek

Mat3

Kaotik

Drunk Member

swaaye

Entirely Suboptimal

digitalwanderer

Malo

Yak Mechanicum

digitalwanderer

trinibwoy

Meh

Similar threads