iMacmatician
Regular
I hadn't thought of 2x 256-bit per core in a future Xeon Phi, and the last sentence seems like a good reason to do so (the same theoretical throughput as the current Xeon Phi but more similar to Haswell). 2x 512-bit per core for Knights Landing, on second thoughts, may give too high a FP performance compared to the 3+ TFLOPS projection, at least if it uses Atom cores.I think that those new Atom could be a good building block for Intel. I've just answered Nick in another thread, and I think that may be Intel could sort of blend Haswell double FMA unit those Atom.
They would widen the data path on the atom core to 256bit to match the SIMD width (8 wide), and have 2 FMA units. If I get it right that is 16 DP FLOPS per cycle. They would introduce the same support for the gather instruction as in LBNx ISA (so better than what Haswell does).
[…]
I expect Intel to level out the playing field between its different product before possibly going wider the 256bit.