Next Generation Hardware Speculation with a Technical Spin [post E3 2019, pre GDC 2020] [XBSX, PS5]

Status
Not open for further replies.
I can wait 13 minutes for the prices to rebound.

Unfortunately that's not how it works.

https://www.techpowerup.com/256909/...outage-6-exabytes-of-nand-production-affected

Return to standard manufacturing rates is expected to only occur by mid-July.

Damage includes impacted wafers that were being processed, the facilities, and production equipment, hence the need for an extended inoperability period to seriously assess damages and required reinvestment. 35% of the world's NAND supply is produced at this Yokkaichi Operation campus (which includes six factories and an R6D center), so this outage and NAND flash loss is likely to impact the global markets.
 
Return to standard manufacturing rates is expected to only occur by mid-July.

This isn't going to impact anybody who has a contract for NAND production for mass market products, this will merely limit the amount of NAND available outside contracted bulk production which will likely drive up consumer prices a little for a few months if stock becomes low.
 
Sounds like yields are great, so consoles just have to worry about wafer cost.

According to a user on /r/amd there was a TSMC meeting last week were the following info graphic was taken: https://pc.watch.impress.co.jp/img/pcw/docs/1193/327/46_o.jpg

I wonder what they categorize as "Large Die". 300 mm² and more or like what is likely for monolithic console chips, or ridiculous dies like Knights Landing (682 mm²) and Nvidia TU102 (754 mm²)?
 
According to a user on /r/amd there was a TSMC meeting last week were the following info graphic was taken: https://pc.watch.impress.co.jp/img/pcw/docs/1193/327/46_o.jpg

I wonder what they categorize as "Large Die". 300 mm² and more or like what is likely for monolithic console chips, or ridiculous dies like Knights Landing (682 mm²) and Nvidia TU102 (754 mm²)?

HVM of 7nm kicked off during April 2018 for TSMC so the graph for large die refers to a Q3 2018-Q1 2019 timeline. You wouldn’t expect a next gen chip to be in high volume production during that time.

That’s probably Vega or something else.
 
@anexanhume
https://semiwiki.com/semiconductor-manufacturers/samsung-foundry/7926-samsung-vs-tsmc-7nm-update/
22808-samsung-tsmc-7nm-comparison-2019.jpg
 
I seriously doubt they had silicon back in 2018.

My personal opinion is :

- Sony is further ahead with their console
- We know for fact that they sent dev kits in January, but there is also rumor of dev kits being sent in 2018
- Dev kits sent in Jan 19' or half a year before would not have SOC containing Navi
- January (let alone 18') was too early for 7nm and too early for production silicon
- January SDK contains Vega 56 GPU running at 1.8GHz equaling 12.9TF
- Gonzalo chip, noticed back in Jan running at 1.1GHz and then 4 months later at 1.8GHZ is Navi GPU found in retail PS5 unit
- PS5 GPU will be around 8.5-9.5TF + RT hardware instead of dev kits with Vega56

This is, IMO (knowing TDP, die sizes of Navi and when chips are taped out) only possible explanation.
Are you nutz?, Vega 56 at 1710 Mhz OC would about 400 watts.

 
Here are my theoretical performance estimates based on 7nm DUV & EUV designs
On 7nm DUV (9-10TF conservative estimate)
60CUs total - 54CU enabled - 320 bit bus - 3 Shader Engines -> 404mm2
But following design rules that prioritize density over frequently (empty spaces on 5700 die shots), should bring it closer to 390mm2 then shrink to 331mm2 when 6nm becomes available 2021-2022

Die Size estimates based on LiabeBrave graphic and @Proelite comments
75mm for CPU
81.6mm2 for 320 bit bus (20GB)
69.99mm2 for 3SEs (ROPs, cache, etc.)
IO 20.72mm2
10.85mm2 GPU CC (CP, ACEs etc.)
146.7mm2 30DCUs with RT (60CUs)
Total:
404.86mm2
  • 54CUs @ 1400Mhz = 9.67TF (lowball)
  • 54CUs @ 1500Mhz = 10.36TF (very likely)
  • 54CUs @ 1550Mhz = 10.7TF
  • 54CUs @ 1592Mhz = 11TF (best case scenario)
On 7nm EUV (11-12TF conservative estimate)
66CUs total - 60CU enabled - 384bit bus - 3 Shader Engines -> 348.68mm2 (335mm2 with a 320 bit bus)
75mm for CPU
97.92mm2 for 384 bit bus (24GB)
69.99mm2 for 3SEs (ROPs, cache, etc.)
IO 20.72mm2
10.85mm2 GPU CC (CP, ACEs etc.)
161.37mm2 33DCUs with RT (66CUs)
Total:
435.85mm2 7nm DUV
348.68mm2 7nm EUV
  • 60CUs @ 1500Mhz = 11.5TF
  • 60CUs @ 1600Mhz = 12.28TF
  • 60CUs @ 1693Mhz = 13TF
 
Here are my theoretical performance estimates based on 7nm DUV & EUV designs
On 7nm DUV (9-10TF conservative estimate)
60CUs total - 54CU enabled - 320 bit bus - 3 Shader Engines -> 404mm2
But following design rules that prioritize density over frequently (empty spaces on 5700 die shots), should bring it closer to 390mm2 then shrink to 331mm2 when 6nm becomes available 2021-2022

Die Size estimates based on LiabeBrave graphic and @Proelite comments
75mm for CPU
81.6mm2 for 320 bit bus (20GB)
69.99mm2 for 3SEs (ROPs, cache, etc.)
IO 20.72mm2
10.85mm2 GPU CC (CP, ACEs etc.)
146.7mm2 30DCUs with RT (60CUs)
Total:
404.86mm2
  • 54CUs @ 1400Mhz = 9.67TF (lowball)
  • 54CUs @ 1500Mhz = 10.36TF (very likely)
  • 54CUs @ 1550Mhz = 10.7TF
  • 54CUs @ 1592Mhz = 11TF (best case scenario)
On 7nm EUV (11-12TF conservative estimate)
66CUs total - 60CU enabled - 384bit bus - 3 Shader Engines -> 348.68mm2 (335mm2 with a 320 bit bus)
75mm for CPU
97.92mm2 for 384 bit bus (24GB)
69.99mm2 for 3SEs (ROPs, cache, etc.)
IO 20.72mm2
10.85mm2 GPU CC (CP, ACEs etc.)
161.37mm2 33DCUs with RT (66CUs)
Total:
435.85mm2 7nm DUV
348.68mm2 7nm EUV
  • 60CUs @ 1500Mhz = 11.5TF
  • 60CUs @ 1600Mhz = 12.28TF
  • 60CUs @ 1693Mhz = 13TF
Why only 20.72 for IO and not 37.72 ?

And I found 1.25 for density gains of EUV on your 66CUs APU. 435.85 / 348.68 = 1.25

The 7nm EUV size of your 66 CUs APU should be around 363.21 mm²
 
Yes a multiply of 18 should be the right CU number... Either 36 or 54....

Yes 54@1500 is the sweet spot .. with then a 36@1822 mode available to make a "pro-turbo" mode...
 
Yes a multiply of 18 should be the right CU number... Either 36 or 54....

Yes 54@1500 is the sweet spot .. with then a 36@1822 mode available to make a "pro-turbo" mode...
Why 18 multiplier is important ?
 
Here are my theoretical performance estimates based on 7nm DUV & EUV designs
On 7nm DUV (9-10TF conservative estimate)
60CUs total - 54CU enabled - 320 bit bus - 3 Shader Engines -> 404mm2
But following design rules that prioritize density over frequently (empty spaces on 5700 die shots), should bring it closer to 390mm2 then shrink to 331mm2 when 6nm becomes available 2021-2022

Die Size estimates based on LiabeBrave graphic and @Proelite comments
75mm for CPU
81.6mm2 for 320 bit bus (20GB)
69.99mm2 for 3SEs (ROPs, cache, etc.)
IO 20.72mm2
10.85mm2 GPU CC (CP, ACEs etc.)
146.7mm2 30DCUs with RT (60CUs)
Total:
404.86mm2
  • 54CUs @ 1400Mhz = 9.67TF (lowball)
  • 54CUs @ 1500Mhz = 10.36TF (very likely)
  • 54CUs @ 1550Mhz = 10.7TF
  • 54CUs @ 1592Mhz = 11TF (best case scenario)
On 7nm EUV (11-12TF conservative estimate)
66CUs total - 60CU enabled - 384bit bus - 3 Shader Engines -> 348.68mm2 (335mm2 with a 320 bit bus)
75mm for CPU
97.92mm2 for 384 bit bus (24GB)
69.99mm2 for 3SEs (ROPs, cache, etc.)
IO 20.72mm2
10.85mm2 GPU CC (CP, ACEs etc.)
161.37mm2 33DCUs with RT (66CUs)
Total:
435.85mm2 7nm DUV
348.68mm2 7nm EUV
  • 60CUs @ 1500Mhz = 11.5TF
  • 60CUs @ 1600Mhz = 12.28TF
  • 60CUs @ 1693Mhz = 13TF

Why 3 Shader Engines?
GCN had a limit of 16 CU per Shader Engine, with 4 shader Engines used for 64 CUs.
RDNA is using 10 Double CUs per shader Engine. Or the equivalent to 20 GCN CU per SE.
As we can see, the 16 limit is beated if we compare CU, but not if we compare GCN CU with RDNA Double CU.

IF RDNA simply created a new CU that is a Double GCN CU, that would mean we can have 16 Double CU per Shader Engine, or 32 GCN CU per Shader Engine. In that case, 2 shader Engines would have 64 GCN CU, allowing RDNA no reach 128 GCN CUs keeping the same efficiency as GCN on 64 CU.

And in this case you would only need to disable 2 Double CU, or 4 GCN CU. This would allow 56 CU.
At 1800 Mhz, this would go to 12.9 Tflops.
 
Why 3 Shader Engines?
GCN had a limit of 16 CU per Shader Engine, with 4 shader Engines used for 64 CUs.
RDNA is using 10 Double CUs per shader Engine. Or the equivalent to 20 GCN CU per SE.
As we can see, the 16 limit is beated if we compare CU, but not if we compare GCN CU with RDNA Double CU.

IF RDNA simply created a new CU that is a Double GCN CU, that would mean we can have 16 Double CU per Shader Engine, or 32 GCN CU per Shader Engine. In that case, 2 shader Engines would have 64 GCN CU, allowing RDNA no reach 128 GCN CUs keeping the same efficiency as GCN on 64 CU.

And in this case you would only need to disable 2 Double CU, or 4 GCN CU. This would allow 56 CU.
At 1800 Mhz, this would go to 12.9 Tflops.
You mean 60 activated CU ? 64 - 4 ?

I think 3 SE won't be possible with RDNA because of the butterfly structure. It's either 2 or 4.
 
Status
Not open for further replies.
Back
Top