Yes it does, but I guess the benchmark doesn't use it for its FP computations (and anyone using NEON for FP should be extremely cautious as it's not IEEE compliant).I thought A4 had NEON. Doesn't it?
Yes it does, but I guess the benchmark doesn't use it for its FP computations (and anyone using NEON for FP should be extremely cautious as it's not IEEE compliant).
I doubt most algorithms care. All we're talking about is subnormal support and exception catching in the cases of divide-by-zero, operations with infinity and NaN's. None of which really concern most things outside of scientific computing.
Maybe there is an integrated Thunderbolt controller? Apple could be holding off until the iPhone 5 by which time the complete Mac lineup should have been refreshed with Thunderbolt. It would probably need to be implemented to use the existing 30-pin connector so users can either use the traditional 30-pin to USB cable or a new 30-pin to Thunderbolt cable.The A5 is really quite big if you compare it with Tegra 2 and Tegra 3( anyone know the figure for OMAP 4 and QSD 8x60? Though Qualcomm has an integrated baseband so the size will not be directly comparable). The process does play a part as TSMC 40nm v/s Samsung 45nm will be at least 19% smaller (and apparently the density of TSMC's processes in general are higher than Samsung). I read one figure of 122 mm2 for A5, while chipworks say it is more than twice the size of A4, which means its at least a 100 mm2. Tegra 2 is only 49 mm2 and even Quad core Tegra 3 is only 80 mm2. What exactly has Apple put in the A5 to make it so big
Add to that a single rounding mode and only single precision. Does that still look good outside of scientific programming? I agree in most cases you don't care, but given how people poorly understand the limitations of FP numbers, adding some more constraints doesn't seem wise.I doubt most algorithms care. All we're talking about is subnormal support and exception catching in the cases of divide-by-zero, operations with infinity and NaN's. None of which really concern most things outside of scientific computing.
Add to that a single rounding mode and only single precision. Does that still look good outside of scientific programming? I agree in most cases you don't care, but given how people poorly understand the limitations of FP numbers, adding some more constraints doesn't seem wise.
Anyway, my point is not really about the limitations of running FP code on NEON, it's about being forced to use NEON to get decent performance due to a poor FP unit
The A5 is really quite big if you compare it with Tegra 2 and Tegra 3( anyone know the figure for OMAP 4 and QSD 8x60? Though Qualcomm has an integrated baseband so the size will not be directly comparable). The process does play a part as TSMC 40nm v/s Samsung 45nm will be at least 19% smaller (and apparently the density of TSMC's processes in general are higher than Samsung). I read one figure of 122 mm2 for A5, while chipworks say it is more than twice the size of A4, which means its at least a 100 mm2. Tegra 2 is only 49 mm2 and even Quad core Tegra 3 is only 80 mm2. What exactly has Apple put in the A5 to make it so big
I could give exact figures for that cost, but I'd be firedI can tell you the difference of cost in implementation and it may not seem all that "poor" anymore
I'm not asking for anything, I was just explaining some benchmark results and saying what I think of running some random FP code on the NEON unit due to a poorly performing FP unit.For now at least, you can't have your 2-way DP with IEEE compliance and 300mW peak power for FMA.
I'm not asking for anything, I was just explaining some benchmark results and saying what I think of running some random FP code on the NEON unit due to a poorly performing FP unit.
Implementing PowerVR cores at a larger size in exchange for lower power consumption and/or higher clocks/performance would probably be the choice of a lot of semis in the mobile space if available.
Finally iPad2 in the GLBenchmark2.0 database: http://www.glbenchmark.com/phonedetails.jsp?D=Apple+iPad+2&benchmark=glpro20
I suppose you missed the performance preview AnandTech did 5 days ago?
The nicely clocked Mali-400MP4 gives a seriously impressive performance in the Hardkernal ODROID-A at GLBench 2.0.
Its noticeable in the ipad2 results that FSAA has zero penalty on the SGX543 platform, whereas it had a 20-30% penalty on both SGX535 in iphone4 and SGX540 in the galaxy player.