Thanks..interesting article! The part I found particularly interesting was the real world performance of big.LITTLE and how it is can actually be detrimental to power. Perhaps this is part of the reason why Nvidia chose cluster migration for Tegra X1 instead of HMP. And I think this is one of the big reasons for Qualcomm's dominance of the high end SoC market for the last few years. They've demonstrated high energy efficiency at both high and low speeds with Krait.The fact that they've developed their own scheduler for S810 shows that they were able to recognize the problem and mitigate it..which is very good to see. If ARM is indeed 18 months away from a proper fix..they've really dropped the ball on this one.
I have a few doubts though..let me try and list a few that I remember:-
1. Any idea why the LDPPR3 speeds dropped from 933 mhz on the 5422 to 825 mhz on the 5430/5433?
The extra bandwidth surely would have been useful. Performance of the 7420 with LPDDR4 should give us a good idea if there are any bandwidth limitations.
2. Exynos 5420 vs Exynos 5430 block sizes.
Exynos 5420 Exynos 5430
A7 core 0.58mm² 0.4mm²
A7 cluster 3.8mm² 3.3mm²
A15 core 2.74mm² 1.67mm²
A15 cluster 16.49mm² 14.5mm²
The numbers seemed a bit off to me so I did a bit of math. Extrapolating from those figures, for the A7s, 512 KB of L2 cache on 5420 is 1.48 mm2 but on the 5430 it is 1.7 mm2. And for the A15s, the 2 MB L2 on the 5420 is 5.53 mm2 but on the 5430 it is 7.82 mm2. So in both cases the size of the cache
increased significantly, despite being on a smaller node. Wouldn't one expect exactly the opposite? I understand that there can be optimizations for area or power but this seems a bit extreme.
3. A53 v/s A7 power consumption (Reg Pg 4, SoC Synthetic "Little" Load Power Chart)
When looking at 1 core, the A53 consumes 27% more power than the A7. But with 4 cores, it consumes 87% more power. I do not see any explanation for this.
And we see the same thing with the A57 v/s A15. One core consumes 17% more power but 4 cores consume 67% more power (comparing both at 1.8 ghz). I am at odds to understand this.
Edit: Just a minor nitpick but on page 2 you mention that Qualcomm moved from 28nm HP in previous SoCs. AFAIK they were on 28LP earlier, and 28HP is what AMD and Nvidia use for GPUs.