Still trying to wrap my head how everyone here is getting 340$ (must be some magic number) for a chip 2x size of the GCD in NAVI31, $100-122 * 2x (some magic happens here) = 340!
Yes, and I asked it on previous page - what is a good chip?About 50 good chips
Yes, and I asked it on previous page - what is a good chip?
4090 doesn't have a perfect die with everything enabled, and I don't see where it's said that defective dies (by these calcs) are unfunctional.
So I will repeat it, all these yield calculations do not make any sense
Cost absolutely does not increase liniarly with area increase, mainly because of yield will be lower for sure. The magic would be 2 x size = 2 x priceStill trying to wrap my head how everyone here is getting 340$ (must be some magic number) for a chip 2x size of the GCD in NAVI31, $100-122 * 2x (some magic happens here) = 340!
Gosh, thanks for explaining the obvious stuff, what I am trying to tell you is different. How it scales depends not just on the yield of perfect dies, but mostly on how you bin chips for your products and on final products characteristics, isn't that obvious?Cost absolutely does not scale liniary with area increase, mainly because of yield will be lower for sure.
That's an interesting stuff given that smaller chips usually have more FE, display and other logic (relative to SMs and other blocks that can be disabled) that, if defective, would make a GPU unusable.On the other side, there are defects that prevent a die to work properly.
WGPs are there but I'm not really sure that "CU" is a thing in RDNA3 anymore? The whole concept of CU in RDNA1/2 seem to have more to do with GCN compatibility for consoles than actual RDNA h/w.
RDNA3 seem to completely throw the CU out and from this perspective is an enhanced RDNA2 WGP, with more local memory and dual issue SIMDs.
But can they still act as separate units? This is what made CUs "a thing" in RDNA1/2.According to AMD CUs are most definitely still a thing.
This is more than offset by the increase in area and area dedicated to interconnection among the various groups.That's an interesting stuff given that smaller chips usually have more FE, display and other logic (relative to SMs and other blocks that can be disabled) that, if defective, would make a GPU unusable.
Still there is nothing one can't solve by adding a bit of redundancy and more flexible floorplanning (if this was a problem indeed).
But can they still act as separate units? This is what made CUs "a thing" in RDNA1/2.
Right. Hence why I've said that it's there mostly for GCN compatibility (needed for consoles for the most part) and it remains to be seen if RDNA3 even expose "CU" in any capacity or just goes with "WGP" now as the basic shader processing unit.An RDNA 1/2 CU doesn’t technically act as a single unit as it shares the instruction cache and issue hardware with the 2nd CU in the WGP. However most execution resources were local to a CU (SIMDs, TMUs, RT, L0$). Aside from the shared front end the primary purpose of the WGP is to allow 2 CUs to share a larger, combined LDS.
The WGP is technically the “compute unit” as it encapsulates the instruction issue hardware and LDS. For example, OpenCL reports the 6900xt as having 40 compute units. The whole WGP/dual-compute unit nomenclature is unnecessarily confusing IMO and doesn’t align with how compute apis see the hardware.
"CU mode" exists as an operational mode (versus "WGP mode"). It determines the upper bound of how much threadgroup memory a thread group can allocate. (64KB vs 128KB).WGPs are there but I'm not really sure that "CU" is a thing in RDNA3 anymore?
The whole concept of CU in RDNA1/2 seem to have more to do with GCN compatibility for consoles than actual RDNA h/w.
RDNA3 seem to completely throw the CU out and from this perspective is an enhanced RDNA2 WGP, with more local memory and dual issue SIMDs.
Shouldn't this be 192KB now in RDNA3?"CU mode" exists as a operational mode (versus "WGP mode"). It determines the upper bound of how much threadgroup memory a thread group can allocate. (64KB vs 128KB)
Why? That's the vector register file getting 50% increase.Shouldn't this be 192KB now in RDNA3?