Ok, time to review / recap / recast predictions. The following will be more GPU specific. I decided to post this incomplete and all from memory instead of never posting it. A lot of number errors I am sure... so respond and correct them if they bother you! I know there are huge gaps, wrong numbers, etc. I don't have the time to correct all of this, so follow the general flow and branch off as you see fit. Besides the WiiHD there is nothing set in stone per the public. At some point I may post what I would LIKE to see and what I EXPECT to see.
PROJECTION #1: Process Node.
Code:
Process Density* Date (TSMC) Date (Intel) Date (GF)
90 - 2005
65 2 2007
45 4 2009
40 5 2010
32 8 -
28 10.4 2011
22 16 - 2011
20 17.6 2013
16 32
14 42.6 2015
12 64
* Compared to 90nm. Typically a comparison of the “smallest structure” and not an average. Each design & process are unique.
PS3 Footprint: 532mm^2
RSX: ~ 300mm^2 on 90nm
Cell: ~232mm^2 on 90nm
Xbox Footprint: 480mm^2
Xenos: ~220mm^2 + ~110mm^2 on 90nm
Xenon: ~150mm^2 on 90nm
EDRAM (MB) ~95mm^2 (110mm^2 – logic) on 90nm. 20nm would project out to about 176MB on 20nm.
PROJECTION #1 Conclusion: Process Node. 17.6 times the density. Excluding structural unit improvements/inefficiencies due to increase robustness, multiplied by a frequency increase on order of 50% (range ~ 40%-70%for the GPU, let’s assume 750MHz; essentially none with the CPU), “raw” performance jump is 26.4 increase for the same footprint.
PROJECTION #2: Relative to Cayman GPU. AMD architecture chosen because they seem the likely target, although there will be a shift to VLIW4 from VLIW5. AMD appears headed for a paper launch of 28nm GPUs in late Q42011. A Q42013 GPU on TSMCs 20nm process (a taped out chip on this processes was announced Oct2011, so give 12-18 months for first mass production).
Cayman aka 6970: 389mm^2 on 40nm TSMC (250TDP; 2.64B trans., 2.7TFLOPs, 880MHz, 96 tmu, 32 rop; was originally aimed at cancelled 32nm node). For cf.: 6950 offers 2.25TFLOPs @ 800MHz; 88TMU, 32ROP; 200TDP; idel “20” avg. peak load 157 iirc)
20nm is about 4.0 times denser than 40nm. Cayman naively would project to 10.8TFLOPs at 20nm (no architectural or frequency changes). Obviously with OI and various parts shrinking at different rates, some parts not needed the same increase (e.g. rops), this is a very rough projection.
Assuming a ~300mm^2 footprint (.77 size of Cayman) a Cayman-like design would hit 8.3TFLOPs, 294 TMU, 98 ROP, etc. Not all flops are equal (see how the lower flop Xenos typically bests RSX; ditto contrast of NV’s scalar design and AMDs VLIW; also note the change in VLIW) so this is all general ballpark. It would be aggressive to assume the retention of 800MHz of the 6950, let alone the 880MHz of the 6970—especially after the RROD. Redundancy and Binning would also need to be a consideration. Without more information on the TDP of TSMC’s 20nm node 750MHz seems a safer projection.
PROJECTION #2 Conclusion: Relative to Cayman GPU. A 300mm^2 GPU on 20nm @ 750MHz would be in the 7 TFLOP range, potentially higher (e.g. shader cores will scale faster than other features like rops). Compare to the 215-236 GFLOPs range (iirc) for current console GPUs results in about a 30x increase in performance (there is no doubt the Cayman flops are more effective than the RSX flops). There will be other console specific features on chip (e.g. Xenos is also the memory controller; also redundancy) but some parts PC-specific can be dropped.
The projections based off of process reduction and the path GPUs have taken specifically lead to about the same range of 25-30x as fast in 2013 on 20nm. The money conscious skeptic would lean toward a low end 20x.
I would bet the development teams are seriously debating how long a console should last. Should it be more disposable (4year?) with anticipating that the platform transition into a Cloud devise? 10 year? There is also the eyes looking at realistic process reductions. And how soon can compelling software be deployed? Other major issues continue to be storage, storage speed, and distribution. Will a company reduce CPU/GPU budgets to invest in consumer experience with hybrid storage for faster loads? How much will input devices (Kinect 2, Move 2, new concepts) cut into budgets? Will HDD be standard? Will MS license BDR; offer activation codes? Will there be a big change in budgets—will MS decide to shift area from the CPU to the GPU? What memory densities, technology, and speed will be available? Will we see a CPU with a similar tech (e.g. PPC) but heterogeneous design (e.g. 1x OOOe PPE with large fast cache + 6x Xenon PPEs)? Where does eDRAM and other similar tech fall in? Will we see a monstrous scratchpad?
And what if… Xbox 720 in 2012? 28nm is now shipping with TSMC. AMD is going to roll out their new GPUs Q4. MS could aim for a 2012 launch (with Halo 4) with a 28nm GPU with immediate cost reduction in 2013 to 20nm. The GPU would be essentially the 7950-class design. Of course Kinect would not have even reached its 3rd birthday before a new Kinect launched and while the Xbox is selling very well the problem become does MS ride the 360 until it dies or do they move while the moving is still good? They could catch Sony in a hard spot (Vita is a Q1 2012 launch so it is unlike Sony would be aiming for 2012) and Sony would have to bet on 20nm in mass production in 2013 and at best would get a ~2x performance jump (but also much more expensive all generation for that jump). The question is are developers ready, is there compelling software, and have the tools matured enough that the content shift (most stuff is mult. million poly source now)will be less dramatic to see quicker uptake? Or are publishers saying, “Look, we bleed for years. We need to continue capitalizing on the current platforms”? Or maybe they see the market ripe for a new platform, maybe more consolidation?
If things do stretch out to 2013/2014 the good news is it is highly probable one company will keep similar footprint budgets / TDP compared to this gen, almost guaranteeing a significant leap in processing power.
My hope would be a release maybe in 2012 with a large processing footprint, 4GB of fast memory, and standard storage with a Kinect like device plus a multifunction motion/traditional pad with the goal of a big cost reduction ramp up in 2013. I don't mind a long lived console as long as it doesn't age too quickly.