It's possible, DF classified the differences between upscalers into about a dozen items, each item gets a score. DF already did it in a very simple and elegant way.
What reviewers do is pick 13 games, cover each one separately in a side piece, and compare upscalers quality wise (they already do this for new games, but with performance only), use the conclusion from these side pieces to establish benchmarking rules in the next review.
For example, if they already established that DLSS Performance in Warhammer Darktide is better than FSR Quality, then all NVIDIA GPUs will be tested as such and compared against AMD GPUs running FSR Quality.
Another example, if they established that DLSS Quality is in fact better than native in Spider-Man, then all NVIDIA GPUs will be tested with DLSS Quality, and other GPUs will be tested with with native.
We need to cater for the actual user experience, not some idealized far from reality outcome.. most UE5 titles require upscaling now to perform in a satisfing way, most games have terrible TAA implementation and thus terrible native image quality and require DLSS or TSR to cover the shortcomings of TAA .. most users with average hardware use upscaling to gain performance in heavy titles, that's the reality of the situation, native is no longer a desired thing by users in most cases. If reviews don't factor this in their prcoess, then they are detached from reality and need to account for the new variables.