Dr Evil said:These are good points. I don't think launching 2013 vs 2012 would give the manufacturers any significant benefits on what you can put into the boxes. Just like it didn't give any benefits in 2006 vs 2005 as far as the chips were concerned.
.
Recent GPU history shows several similar occurrences with node transitions. Build a bigger, faster, smarter chip on a mature node and you can sometimes get a bigger jump than just going in a new node day 1.
Pinches of salt at the ready,folks. According to the latest speculation, Microsoft's sequel to the Xbox 360 will actually be two models,a pared down set-top box for casual gamers and a heftier model for the hardcore.
Wasn't that mostly a result of the ps3 having a outdated gpu?
If you look at the processors made on a given node, there are lots of cases where a more mature node sees processors made that offer bigger leaps in performance over the first chips on that node. For example:
- Athlon 64 X2 90nm to Athlon 64 X2 65nm offered basically no improvement in raw performance, but a small improvement in perf/watt.
- Athlon 64 X2 (65nm) to Phenom (65nm) was almost double the raw performance, despite no change in node.
- Phenom (65 nm) to Phenom 2 (45nm) was about 25% more performance.
- Phenom 2's slowly got faster until later in the node when Phenom 2 X6 (45nm) was released and was about 50% faster than the fastest chips earlier in the node.
In all these cases the biggest jump in performance came with larger and more advances processors later on in the same node. The idea that you have have to be in there day 1 on a new node to get a big jump simply isn't always true. MS managed that with the 360 at great cost - they couldn't test properly, it generated more heat than they predicted and they were probably supply constrained (if not by the gpu then they would have been if they hadn't been more severely constrained by memory or the CPU) .
Recent GPU history shows several similar occurrences with node transitions. Build a bigger, faster, smarter chip on a mature node and you can sometimes get a bigger jump than just going in a new node day 1.
A 500m transistor CPU? I didn't know you were being so generous. Then:
2 x 165 mil Xenon + 20% OoO + 15% expanded cache = 445 mil
If I'm high on my OoO and cache %'s you could possibly even add 12mb of L3 edram (1mb x 12 threads) and still make 500 mil tranny's.
Assuming 32/28nm launch in 2012 would yield 8x trans count, this would amount to a budget of roughly 4billion (497m x8 = 3,976m) if we are to assume equal budget/process node.
This leads to some pretty interesting potential hardware:
With that budget, MS could extend the xb360 architecture to the following:
10MB EDRam (100m) => 60MB EDRam (600m) Enough for a full 1080p frame buffer with 4xaa
3 core xcpu (165m) => 9 core xcpu (495m) - or an upgraded 6 core PPE with OoOe and larger cache along with an ARM core (13m trans)
This leaves a hefty 2.8b trans available for xgpu which could accommodate 3x AMD ~6770 (1040m) ~3 teraflops or 4x AMD ~6670 (716m) ~2.8 teraflops.
Such small, modular chips would enable good yields on new(er) processes until they were mature enough to combine together and eventually integrate to an APU.
And yet again, what Intel can do most (no-one) others can't.
I'd say they just repackage current XB360 into a set-top and sell the next-gen system as a real gaming console.
I'm sorry but those examples are if not terrible then not very good either. Yeah If you just transfer your existing design to smaller process, you'r not going to see huge gains. Also going from dual core to quad core is not hard to see "double the raw performance" likewise 6 core > quad core...
AMD did see some nice die size shrinks though.
Also MS wasn't supply constrained to a any meaningful degree for longer than a month or two, despite their hardships that imo had nothing to do with the chips, but errors in the cooling and manufacturing. Launch PS3 used even more power.
Recent GPU history doesn't really back up that at all. yeah there has been some room for tweaking but for example GTX 580 uses basically the same chip as GTX 480, which is a huge chip and thus running into problems with is more likely than with a smaller chip. On the AMD size Cayman 6970 is only little bit faster than 5870, which was released two years ago when 40nm was the bleeding edge. Caymans gains also basically came from being bigger. Basically AMD hasn't been able to get any sort extra performance from tweaks during the 40nm era and the next leap in performance comes due to the 28nm process shrink. Imo the situation has been the same pretty much the whole time. yes there are few examples when the initial design has been so tiny that there has been plenty of room to increase the die size(3870>4870 or G80), but those times are long gone. The gains during the last two years on 40nm have been miniscule.
+ there are countless better examples showing the opposite of what you'r implying without having to resort to doubling the die size etc.
exactly. Especially about the part where real-world performance was compared...see above
exactly. Especially about the part where real-world performance was compared
I'm sorry but those examples are if not terrible then not very good either.
Yeah If you just transfer your existing design to a smaller process, you'r not going to see huge gains.
Also going from dual core to quad core is not hard to see "double the raw performance" likewise 50% with 6 core > quad core...
Also MS wasn't supply constrained to any meaningful degree for longer than a month or two,
Recent GPU history doesn't really back that up at all. yeah there has been some room for tweaking but for example GTX 580 uses basically the same chip as the one in GTX 480, which is a huge chip and thus running into problems with it was more likely than with a smaller chip.
Caymans gains also basically came from being bigger.
The gains during the last two years on 40nm have been miniscule.
+ there are countless better examples showing the opposite of what you'r implying without having to resort to doubling the die size etc.
AMDs fab is now Global Foundries, a foundry that TheChefO has been pimping heavily...
- Athlon 64 X4 (45nm) to Llano (32nm) showed a small improvement in perf/watt...
The real leaps in CPU power come on the same node, which is completely the opposite of what TheChiefO claimed.
The general conversation was about CPUs. That was AMDs almost complete recent CPU history. AMDs fab is now Global Foundries, a foundry that TheChefO has been pimping heavily. Those example are the best and most appropriate ones available IMO. If you have any better or more appropriate examples then hey, I'd love to see them.
The jumps in performance across node transitions have been smaller than the increases that occurred during the lifetime of the node.
Oh yeah, WiiU CPU: 45nm. There's got to be a reason for this!
Now lets look at Intel (from memory, tell me if I'm wrong):
- Single core to dual core: done on the same node (65nm)
- Dual core to quad core: done on the same node (45nm)
- Quad core to hex core: done on the same node (32nm)
This isn't true. MS launched with a piddling amount of consoles instead of millions they could have sold on day one. Supply was constrained in at least the UK and America until after Christmas.
So you're more likely to run into problems with a big chip early on in the node, but this is ... somehow ... not ... relevant? C'mon man, think about it.
The 480 was late and hot with poor yields. The 580 was a very similar chip but came when the process was more mature. The 580 was 10 - 20% faster in games while using 10 - 20% less power and yields were vastly better. All on a process where there was apparently no difference start to finish!
A chip like the 480 (late, hot, slow) would be an unmitigated disaster for a console vendor.
A bigger chip later on?!?
The gains in the first 12 months were fairly big.
March 2009: Mobility Radeon 4830
April 2009: Radeon 4770
September 2009: Radeon 5870
January 2010: Mobility Radeon 5xxx
So small chips to start, then six months in the first performance part, then 10 months in the mobile, power efficient performance parts. That is not a surprising pattern.
Later in the node you can get acceptable yields on bigger chips, and run the same chips faster within the same power budget. This is relevant to consoles. That, in a nutshell, is it.
Later in the node you can get acceptable yields on bigger chips, and run the same chips faster within the same power budget.