Make sure you consider the variable boost as well. Mostly, you cannot really be sure what clock rates are being compared and some people are really sloppy regarding their benchmarks' reproducibility and comparability.
It's true that we don't have all the data for many of these reviews, though I don't think the boost clock differential can account for this great of variability, unless a reviewer disabled boost on their 1070 sample altogether and did not do the same for their 1080. 33% is the maximum difference of any functional unit comparing 1070 to 1080, a few MHz here or there can't account for the remaining 17-18% seen in some of those other tests, it would need to be quite a large difference (several hundred MHz).