no it isn't, again watch the hotchips presentation, IPC comes from more agressive front end ( aditional L2 predictor/fetch, more agressive core prefetch/predictor) . Improved scheduling and a big improvement in OOO Load and Store capabilities, they even go as far as giving overall IPC improvement for each area.
Okay, so if they go that far, do they specifically say there is a 15% IPC improvement in the SIMD units on top of the doubling in width? When hotchips talk about doubling throughput, do they talk about throughput being double or 2.3x greater?
It might be useful if you posted a link to the presentation so I know what you're referring to. I wasn't able to find it on Google.
No im talking about benchmarks which compare bulldozer AVX vs FMA compiled code. of corse in a SR console devs would take every chance to write code that could be FMA'd but really how offen is that going to be fessable.
And do those same benchmarks also show twice the performance when comparing roughly similar CPU's where one has twice the SIMD width (ala bobcat and Jaguar)? If you would share your links it might aid the discussion.
And yet games developed for a jaguar console wont be "normal" applications.
It's true that the benchmarks at Anand don't reflect the SIMD improvements of Jaguar over Bobcat and in consoles, that advantage will be apparent, but it certainly won't be a doubling of overall performance. The none SIMD performance of Jaguar is fairly well represented by Bobcat (once you add in the 15% or so) and it's quite clear that there, it's not all that hot. That's why my A10 comparison was a bit more relevant. Since they have fairly comparable theoretical SIMD throughput you can at best say when it comes to SIMD they'll be roughly comparable but in anything else the Jaguar will be a little slower in multithreaded code (8 cores vs 2 modules) and vastly slower in single threaded code.