It'd be impossible to try and reproduce everyone's exact test conditions anyway. You'd have to build a machine with the same hardware, use the same OS, have the same drivers, etc etc. That's going to cost too much money, and take too much work. Instead, you could collect information about lots of different reviews, and create charts that show which ones used what drivers, systems, etc. and compare their clock speeds. You could comment on the quality settings they used, wether or not they accounted or checked for a certain cheat with the benchmark(s) that they ran.
The goal of this site wouldn't be to necessarily "prove" that other reviews are incorrect, but to collect information about many reviews and see deviations, find out who is doing their homework when testing (do they check for cheats? Do they use new drivers? Are their scores consistent accross reviews?) There is a lot of information out there that could be made useful.
Nite_Hawk