Don't Buy The "Combined" Rumor
XBox 360 had 256Gbit/sec aka 32GByte/sec bandwidth on the eDRAM, not 256GByte/sec
Considering XB360 would have had to had 8 times the bandwidth it has for that to happen, I think it's quite safe to say it's not half the bandwidth
edit:
Oh, I just realized you might have meant just the daughter dies interconnects bandwidth which was 256GBytes/sec indeed, but it was limited by 256Gbit/sec connection between the GPU and the daughter die
As your edit states if you look at the Xbox 360 released bandwidth diagram it does show 256 GB/Sec for the eDRAM. The 32 MB SRAM is now claimed to be on chip without the slow chip to chip link. The wired photos seem to show one bare die without heat spreader. So unless there is another die like an IBM 80 MB Power 7+ eDRAM style on the back of the board (as opposed to mounted near the CPU die in the Power 7+ module) then the SRAM is on-die. There is no sign of another chip in the wired photos so it looks like on-die is the answer.
So if the rumor (which I obviously think is not true) says Xbox One Summed BW = X + Y + Z + ... + SRAM = 200 GB/Sec and the Xbox 360 Summed BW = I + J + ... + eDRAM (256 GB/Sec) > 256 GB/Sec then the SRAM in the Xbox One has to be very slow.
But we know quite a bit about SRAM speeds. If it is 6T as rumored we can have a pretty good idea of the speeds since we have some 28 nm TSMC 6T SRAM speed data right here:
Hornet said:
Aggregate bandwidth numbers for any recent AMD GPU are available in their OpenCL documentation.
For instance, for Pitcairn XT:
- 15360 GB/s to the register files
- 2560 GB/s to the local memories
- 320 GB/s to the constant memory
- 1280 GB/s to the L1 caches
- 512 GB/s to the L2 cache
Bandwidth to the L2 cache in GCN is bounded to the number of memory channels so I expect it to be 512 GB/s in the Xbox One.
Caches not relevant to GPGPU such as the ones inside the ROPs are not described in detail.
So for the rumor to be true it is necessary that:
1. The Xbox One SRAM would need to have seriously abnormally slow BW for a 32MB 6T 28 nm TSMC SRAM
or
2. The SRAM BW is not included in the 200 GB/Sec
As for the Nintendo Wii U gathering dust if you look at the internal power components, the die size, the power brick, etc, it is clearly in a totally different universe when compared side by side to the Xbox One main board photo at wired. The Xbox One box and the internal (and external power) components are quite comparable to the Xbox 360 original brittle lead free solder oven design and the original PS3 design. The rabid fans have already had a few rounds disparaging the size of the Xbox One chassis and the power brick. Now others want to say that the power brick is tiny. That flip flop is telling. The Nintendo Wii U is best described as compact or gently described as cute.
To get to 200 GB/Sec you need a SRAM bandwidth in the sum equation which be much lower than the absolutely lowest BW in the Pitcairn. That seems like a bridge too far.
So if one wants to argue a combined bandwidth argument at least go look at the Xbox 360 combined bandwidth/bandwidth diagram and some data on 6T SRAM speeds in 28 nm at TSMC.
As far as 8 years go a pretty good rule of thumb is that every two years you can get about 2x the performance if you beat the engineers often enough. It might be transistor count, or flops, or bandwidth, or memory quantity or hard drive density, etc.
So 8 years is 2x4 so the rough bandwidth increase could be expected to be on the order of 2^4 = 2x2x2x2 = 16x. If it is only 10x then combined 200 GB/Sec still looks abnormally low.
The GPU and CPU BW needs have certainly gone up at least 8x-10x by the worst rumors which also argues against a combined BW of only 200 GB/Sec. It screams for other BW to feed the APU/CPU/GPU fast enough.