Right, but I was more looking at it from the standpoint of -- the CPU is losing importance, so, why waste valuable time/money/resources in changing the architecture for something that in the end, will not have much impact on the final result ... where the heavy lifting will be done by the GPU.
Thus, scale the existing arch, and keep BC.
I didn't say any of that, both current console CPU's have relatively speaking appalling single threaded performance for their clock speed.
What I said was single threaded performance is important.
That does not discount all multi core design, it states the need that the cores be able to perform well for large blocks of single threaded code.
If you go back and check my post history since 360/PS3 were announced you'll see that I've consistently stated that the CPU's in question were designed around stellar paper specifications, with no real eye to real world performance.
It shouldn't be about Terra Flops or MIPS, it should be about being able to build something that works well without having to micromanage instruction scheduling.
This is why I advocate approaches like the Cray XMT that try and address the real performance issues, which are largely about cache misses.
You'll also note my continued push for programmers to have a better grasp of hardware architecture and what their programs actually do, I'm also a big believer i do things right first time.
But games are 3 Million+ line applications costing 10's of millions of dollars. It's a different development environment than it was even 10 years ago.
How about the 32MB EDRAM IBM has in their latest designs? Change the structure from a cache to a scratch pad / local store and this would seem to be pretty useful in working on that latency issue for many use cases.
No?
I don't like the idea of local store outside of small set of specific problems.
Ignoring concurrency issues with scratch pads.
Scratch pads will never be "big enough" then your back to shuffling data around.
If you assume that writing large scale software is already challenging (and judging by some of the buggy crap that ships on all platforms I think we have sufficient evidence to back up that hypothesis) why force developers to manage a scratchpad to get decent performance?