DavidGraham
Veteran
Is the A12X chip really equal to an i7 6700K in single threaded performance? Some even claim it's close to an i7 in multi core performance too!
I claimed that. By initial appearances the A12X is close enough to the 6700K that what benchmark you select is going to determine what outcome you get. Pick one that suits your agenda.Is the A12X chip really equal to an i7 6700K in single threaded performance? Some even claim it's close to an i7 in multi core performance too!
On average though, across multitude of benchmarks, how close is A12X to it?I claimed that. By initial appearances the A12X is close enough to the 6700K that what benchmark you select is going to determine what outcome you get. Pick one that suits your agenda.
On average though, across multitude of benchmarks, how close is A12X to it?
And how is that even possible? the difference in silicon real estate is massive!
I'd rather say that Apple has been doing wonders with its microarchitecture I'm still not sure it can scale frequency wise, it's nonetheless a great achievement.In any case it shows how far Intel has fallen back both in silicon process and architectural efficiency of x86.
I'm still not sure it can scale frequency wise, it's nonetheless a great achievement.
You mean you don't know one can increase L1 latency? Or reduce cache size? That's not the part of the design that makes me wonder about frequency scaling.With a 128KB L1 cache ? Not a chance!
You mean you don't know one can increase L1 latency? Or reduce cache size? That's not the part of the design that makes me wonder about frequency scaling.
Is there any in depth write up about A12 or A12X other than Anandtech? (sadly they are not deep enough).Both Skylake 6700K and A12X are about 120 mm2 die area.
The Skylake is 14nm and 1.75B transistors, the A12X at 7nm is 10B transistors.
What exactly are you expecting?....Is there any in depth write up about A12 or A12X other than Anandtech? (sadly they are not deep enough).
Some comparisons to X86 CPUs for starters, and a deep dive into the A12 ARM heritage.What exactly are you expecting?....
Feel free to send me devices, for starters. As for "heritage", do you want RTL changelogs with that?Some comparisons to X86 CPUs for starters, and a deep dive into the A12 ARM heritage.
Well, that is the question everyone would like to see answered, isn't it? It is also hugely complex.On average though, across multitude of benchmarks, how close is A12X to it?
And how is that even possible? the difference in silicon real estate is massive!
If the gain of getting back to 32KB (or 64KB) is compensated by the gain in frequency then it's a no brainer. And that's one of the most easy part of a design to change. Adding a cycle of latency to L1D decreases CPU bound benchmarks (with only L1 hits)speed by much less than 10%.You can't increase latency or decrease the size of the L1 without negatively impacting performance. So no, the CPU design as it is won't scale to significantly higher frequencies.
Scheduling engine, pipe stage latency of FP operations, forward paths of more complex operations, address translation come to mind. All of these can be more complex to fix than adding a cycle of latency to L1.Which other parts make you wonder about frequency scaling ?
If the gain of getting back to 32KB (or 64KB) is compensated by the gain in frequency then it's a no brainer. And that's one of the most easy part of a design to change.
Maybe true for 3 or 4 wide CPUs, Vortex is 6-wide; Increasing latency from four to five cycles means you have to schedule around 30 instructions instead of 24 at peak issue rate.Adding a cycle of latency to L1D decreases CPU bound benchmarks (with only L1 hits)speed by much less than 10%.
It's 7-wide. In the future if Apple will want to go higher frequency, they will have no issue in doing so.Maybe true for 3 or 4 wide CPUs, Vortex is 6-wide; Increasing latency from four to five cycles means you have to schedule around 30 instructions instead of 24 as peak issue.
As a personal speculation, I was always a bit doubtful when it came to the claims that the x86 tax was some 10% or so. The claim is based on the idea that the x86 ISA is translated to a lower level code that once the translation is done runs optimally, and that the conversion carries only a small cost (and perhaps a somewhat higher cost in terms of complexity).
Because I worked in teams that had to change L1 latency OK that's not really easy, but other parts of the designs are much more difficult to change to accommodate frequency increases.Why do you think it is easy ?