GfxBench 3.0 just became available to iOS's App Store (it was only available to Android's Play Store when the benchmark first went live), so scores should marginally creep up as more people get to run it and also get to run it multiple times where the devices have been "warmed up" with more optimal memory accesses.
As mentioned before, too many uncontrolled variables for power consumption and lack of comparability among devices might not make for meaningful results in the battery test, but the performance stability test, hopefully doing a good job accounting for the performance degradation from thermal throttling, should really be a meaningful test by which to compare.
Funny to see the 550 MHz Adreno 330, "AB" binned MSM8974 version of the Xiaomi MI3 finally show up as the new benchmark hits. Scores really well, too; quite a lot higher than the earlier Tegra 4 version.