Apple A11 SoC

Discussion in 'Mobile Devices and SoCs' started by iMacmatician, Aug 18, 2017.

Tags:
  1. Mingyu

    Joined:
    Oct 19, 2017
    Messages:
    2
    Likes Received:
    0
    Due to geekbench, iPhone 8 gets 2800 single thread and 3800 multiple threads in lower mode. It seems that A11 uses one Monsoon and one Mistral in lower mode. Would anyone like to test vfp benchmark in lower power mode?
     
  2. Mingyu

    Joined:
    Oct 19, 2017
    Messages:
    2
    Likes Received:
    0
    Another app named CPU DasherX can be used to check basic information of cpu
     

    Attached Files:

  3. mfaisalkemal

    Newcomer

    Joined:
    May 26, 2017
    Messages:
    45
    Likes Received:
    24
    checked iphone 7 plus cpu information with CPU DasherX and the result near Vitaly Vidmirov app.
    [​IMG] [​IMG]
    low power mode
    [​IMG] [​IMG]
     
  4. mfaisalkemal

    Newcomer

    Joined:
    May 26, 2017
    Messages:
    45
    Likes Received:
    24
    source:link(transcript section)
    if A10 GPU 6 cluster FP16 is equal to PowerVR 7XT(+) 6 Cluster FP16 = 768 FP16/Clock and 12 texels/clock,
    then A11 GPU have 1536 FP16/Clock and 24 texels/clock, same as A10X GPU but with more low clock because the texture fill rate
    is not same based on gfxbench texturing offscreen maximum value(17002 vs. 21261 MTexel/s)
    no information regarding FP32 and FP16 ratio. my guess A11 GPU clock is around 800MHz.
     
    #144 mfaisalkemal, Oct 24, 2017
    Last edited: Oct 24, 2017
  5. Vitaly Vidmirov

    Newcomer

    Joined:
    Jul 9, 2007
    Messages:
    105
    Likes Received:
    10
    Location:
    Russia
    Finally I've bought iPhone8, so benchmark iteration time will be shorter.
    I'm still struggling with stable autodetect. Threads are jumping between cores like crazy.

    Repo is updated with WIP commit, just in case.
    ~6 seconds measurement time should be enough even with high thread count (otherwise zeroes are displayed)
    "id" is a core signature based on issue width.

    Updated numbers for # of threads (1-6):
    2376*
    2304 2304
    2304 2304 1680
    2304 2304 1572 1572
    2304 2304 1572 1572 1572
    2304 2304 1572 1572 1572 1572
    (measured on iPhone8, iOS 11.1)

    * Single thread frequency result is significantly affected by measurement loop iteration count (CALIB_REPEAT)
    2376 +-0 variability with ~1/2 msec loop
    2385 +-5 variability with ~1/16 msec loop
     
    mfaisalkemal likes this.
  6. mfaisalkemal

    Newcomer

    Joined:
    May 26, 2017
    Messages:
    45
    Likes Received:
    24
    could you make some test of gfxbench low-level and high-level offscreen test on low power mode on? since iphone 6s, apple soc throttling a lot on graphics. for example, iphone 7 plus stable only 67% from initial score on manhattan 3.1 long term benchmark. with low power mode, iphone 7 plus will produce stable result from the beginning.
     
  7. Liandry

    Regular Newcomer

    Joined:
    Feb 26, 2011
    Messages:
    282
    Likes Received:
    35
    Is there some information about A11 GPU? ALU count, Texture block count, ROP block count, frequency, memory bandwith and shader model supported?
     
  8. loekf

    Regular

    Joined:
    Jun 29, 2003
    Messages:
    613
    Likes Received:
    61
    Location:
    Nijmegen, The Netherlands
    Is that even possible in a 10nm process ? Of course you could, because you can, because it's not very area efficient. Just ULPI with an off-the-shelf PHY is more efficient.
     
  9. mfaisalkemal

    Newcomer

    Joined:
    May 26, 2017
    Messages:
    45
    Likes Received:
    24
    geekbench battery test iphone 8 plus vs 7 plus

    #############Battery Runtime####Battery Score#######Battery Level
    iphone 8 plus#####16:12:20#########9723#############100% → 1%
    iphone 7 plus#####11:11:40#########6716#############100% → 0%


    any idea how to interpret that score?
    source:
    iphone 7 plus
    iphone 8 plus
     
    Lodix likes this.
  10. Grall

    Grall Invisible Member
    Legend

    Joined:
    Apr 14, 2002
    Messages:
    10,469
    Likes Received:
    1,800
    Location:
    La-la land
    Well, just from looking at the numbers it seems to have about 5 minutes additional battery life...which would be impressive, considering the (much) faster SoC and allegedly, a smaller capacity battery to begin with.
     
  11. iMacmatician

    Regular

    Joined:
    Jul 24, 2010
    Messages:
    720
    Likes Received:
    164
    From Ashraf Eassa at the Motley Fool:

     
    Grall likes this.
  12. Entropy

    Veteran

    Joined:
    Feb 8, 2002
    Messages:
    2,708
    Likes Received:
    528
    Interesting rumour, but hard to verify.
     
  13. Vitaly Vidmirov

    Newcomer

    Joined:
    Jul 9, 2007
    Messages:
    105
    Likes Received:
    10
    Location:
    Russia
    I finally uploaded iOS Spectre attack proof-of-concept to github, after all the hype has died down.
    Actually it was done a couple of weeks ago, but I was too lazy to prepare and submit it :-/
    https://github.com/vvid/ios-spectre-poc

    There is also disabled code to check for meltdown, but it doesn't work (at least with Spectre-V1-like speculation).
    I.e. lines 835/849
     
  14. mfaisalkemal

    Newcomer

    Joined:
    May 26, 2017
    Messages:
    45
    Likes Received:
    24
    CPU DasherX Benchmark latest update.
    FLOPS CPU
    [​IMG]
    INT8OPS
    [​IMG]
    FLOPS GPU
    [​IMG]
    HALF-FLOPS GPU
    [​IMG]

    IPhone X A11 GPU FP32 look weaker than A9 GPU but near FP16 A9X GPU
     
    Pressure and iMacmatician like this.
  15. Vitaly Vidmirov

    Newcomer

    Joined:
    Jul 9, 2007
    Messages:
    105
    Likes Received:
    10
    Location:
    Russia
    Seems like this is bullshit benchmark.

    For example they show 297 GFlops for A11.
    With NEON, each A11 monsoon core can theoretically achieve 57 GFlops.
    It think they just multiplied single-core measured result by 6.

    The only adequate peak FLOPS benchmarking utility in App Store is vfpbench.
    http://dench.flatlib.jp/app/vfpbench

    Measured numbers are a bit lower - 51GFlops single core and 197GFlops for six cores with a code like

    Code:
    op q0, q12, q13
    op q1, q12, q13
    op q2, q12, q13
    op q3, q12, q13
    op q4, q12, q13
    op q5, q12, q13
    op q6, q12, q13
    op q7, q12, q13
    op q8, q12, q13
    op q9, q12, q13
    op q10, q12, q13
    op q11, q12, q13
    where op is fmla.4s

    With 6 core layout the clock frequency of Monsoon is 3% less.
    So we have 197- 2*49,5 = 98GFlops for all 4 Mistral cores.
    98000(Gflops) / 4 (cores) / 1572(freq) = 15,58Flops per cycle.
    That means each Mistral has two 128bit pipelines. Too good, given it's size.

    So, I don't believe A11 GPU FP32 is slower.
     
    #155 Vitaly Vidmirov, Feb 19, 2018 at 11:27 AM
    Last edited: Feb 19, 2018 at 11:36 AM
    mfaisalkemal likes this.
  16. mfaisalkemal

    Newcomer

    Joined:
    May 26, 2017
    Messages:
    45
    Likes Received:
    24
    Maybe Still ongoing developement of code lol but atleast iphone A10 and older soc give gflop number like vfp benchmark.

    I ask developer about lower score of iPhone X GPU and he respond it.
    [​IMG]
     
  17. mfaisalkemal

    Newcomer

    Joined:
    May 26, 2017
    Messages:
    45
    Likes Received:
    24
    For GPU GFLOPS I think gfxbenchmark metal 3.0.3 ALU test can be an alternative to know how many GFLOPS FP32 of A11 GPU because my iPhone 7 plus ALU test score near DasherX score(around 300). But you need Charles proxy trick to download that old version of gfxbenchmark metal.
     
  18. Arun

    Arun Unknown.
    Moderator Veteran

    Joined:
    Aug 28, 2002
    Messages:
    4,996
    Likes Received:
    198
    Location:
    UK
    Any test suite which has a single "ALU" test doesn't seem very reliable to me. It's frankly complete nonsense to evaluate shader core performance with a *single* microbenchmark as if that was some kind of end-all-be-all of ALU performance. Just the fact someone is doing that indicates to me they probably don't understand how to test ALU performance in a useful way, and therefore I'd be very reluctant to believe any results.

    The original GFXBench 3.0 ALU1 test, for example, is basically testing nothing but trigonometric performance... GPUs that are slower at sin/cos (because they're bloody useless in most real workloads) are significantly slower at it. The GFXBench 3.1 ALU2 test is slightly better, but still just one biased datapoint amongst others (e.g. branch and divergence efficiency matters quite a bit more than in typical workloads).

    I honestly don't remember anything about any analysis of the A11, I've completely erased that from my brain for lack of interest... which tells me that it's probably *not* 2x slower than A10 for FP32 FMAs, because I'd hopefully have remembered that :p

    A few random thoughts on what might be going wrong: maybe they're using a very low resolution with lots of blended layers, and it's not enough tiles/pixels to fill all the shader cores? or maybe it's just one *massive* shader and the A11 either has lots of instruction cache misses, or it's a tiny loop with many iterations and the A11 compiler doesn't unroll it properly while the A10 did?

    There's literally a billion things that could go wrong with any microbenchmark, which is why if a result seems anomalous, you really want to iterate and modify the test to understand what's going on - or at least have a lot of knowledge of the trade-offs so you intuitively create a test which is unlikely to hit that kind of problem in the first place, and not a lot of people have that knowledge unfortunately. Kishonti for example didn't do that very well - they often relied on all the HW vendors telling them everything they did wrong in the beta versions until it kinda sorta worked in the final release...
     
    #158 Arun, Feb 21, 2018 at 11:49 AM
    Last edited: Feb 21, 2018 at 11:56 AM
    Ailuros, iMacmatician, Grall and 2 others like this.

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...