So RISC, RISC and probably RISC ...
Not so much RISC but more SISC, Simplified Instruction Set Computer
So RISC, RISC and probably RISC ...
Meh, ARM has way too much deadweight for truly lightweight cores ala Cell SPUs ... and once you hang some massive width SIMD unit off the side to compensate the differences are rather academic.Any one had a look at latest ARM 64 ISA ? At least twice as good as x64 I would say.
Meh, ARM has way too much deadweight for truly lightweight cores ala Cell SPUs ... and once you hang some massive width SIMD unit off the side to compensate the differences are rather academic.
I don't think ISA has much thing to do with register file as long as they have a same approach through decode unit.
Anyway, Decode could be a part that ARM has strength.
If you think narrow (ie. 4 way SIMD or better 4 way VLIW) cores are a dead end then what use is there for ARM? A slight improvement in area efficiency in what is only a small part of the core doesn't do you a lot of good ... and that is what ARM and x86 are in a Larrabee type architecture.Yes indeed, Cell is dead end.
Isn't that store in the middle part after decode?Don't you have to encode the number of instruction registers somewhere in a 32 bit word ?
Having more than 16 architectural registers is a waste of encoding space. The only reason to desire more would be to perform unrolling for latency hiding... But there's a more efficient way to achieve that, and it's likely going to be supported by AVX: It can be extended to support 1024-bit operations, which can be executed on 256-bit units in four cycles. This only takes one more bit of encoding space, but offers four times more register space for implicit 'unrolling'.
Well..that's what I mean, no decode unit can be a strength of ARM.Well, fixed instruction length allows hardwired, no need to decode.
I told you so.It makes no sense to me that Haswell would get a Larrabee-based IGP, considering the gather and FMA support for AVX2 and its future extendability to AVX-1024 which would lower the power consumption.
Told me what, precisely? Hats off if you got it right but let's not sell the skin before the bear is caught. Larrabee got cancelled and the plans to integrate it into Haswell may have gone down with it. Or it could just be the codename for the next evolutionary step in the GMA product line, without x86 compatibility.I told you so.
Are you referring to Thumb and AArch32/64?I got news for everyone. ARM has decoders, and they have ugly variable length instructions.
It's obviously cleaner than x86, but it's all a matter of degrees.
AVX2 isn't power efficient enough yet. We need AVX-1024 for that. But yes it could be part of a longer term convergence.One possibly tempting reason to include it would be if LRB3 is aligned with Haswell's new instructions, or is part of the alignment process.
According to Tom Piazza the theoretical scatter/gather performance of Ivy Bridge's IGP is 32 times higher than Sandy Bridge. He also reveals that in practice it's lower due to bank conflicts.It might also lead to questions on where the most aggressive gather implementation would be on the Haswell die, if LRB found its way there.