Yeah, they asked for pentium cores, so they could reuse the 10-year old rasterizer they had on dusty floppies in the kitchen cupboard.
SCNR
Intel Blasterizer 1000 - it'll blow your balls off!
Intel Optimo 900 - smoke your enemies
Intel Hyper 3D - no joke punchline for this one, I think it's possible
Intel Clearvision - same
Intel Teraforce - same
any marketing departments want to hire me yet?
Dot products aren't really that useful for graphics on the CPU, because nearly every use of it can be parallelized into a MAD. It should require fewer instructions, too.The dot product instruction is particularly interesting to note, especially given that Intel never really seemed to show much interest for graphics in previous SSE instruction sets.
Hitachi probably would disagree with you as, IIRC, they have a DP instruction in the SH4.Dot products aren't really that useful for graphics on the CPU, because nearly every use of it can be parallelized into a MAD.
True, but for realtime rendering you'd be mad to use AOS. For any code that Intel is writing, e.g. DX9/DX10 emulation, RT code, or optimized libraries, this isn't a problem in the slightest.Personally I've never been very fond of Intels "you can just use SOA" attitude when it comes to SSE. The most natural way to write code is to keep related data together. You use a class/struct that contains everything for one instance of that object. That'll have better memory access pattern too in most cases than distributing all attributes of your objects into different arrays. The DPPS instruction is great because it'll easily plug right into any existing code and you can do localized optimizations without restructuring your entire codebase.
I'm confused about the 128k L2 cache per core part, the 2006 presentation stated 256KB per core.
will the amount of cores decide if this will be a 16 or 32 core part or is any "info" so far regarding the 4MB L2 cache BS?
Perhaps they've lowered the amount per-core to improve overall latency and/or density, who knows ?
I'm still betting on 65nm samples and 32nm production...