it does not align every time that's why it's weird and should be taken with caution, because if as some say here in photo mode only GPU power talks, why is that in the corridor scene the difference is negligible ?
And that's fine it doesn't; there are loads of factors in place that will change with each title.
A quick story: A coin has 2 sides. If I asked you what the probability of getting heads is, you'd say 50%. This is a reasonable assumption, why would anyone assume that it's greater >50%?
If I told you I will flip it 20 times, do you think you will get 10 heads and 10 tails? Probably, but it's not exact either. What's the probability that you get 15 heads and 5 tails? Probably not very high, but that probability is still present. Eventually if it continues to stay lop sided, you need to check for bias, that bias being that heads is >50% chance of landing.
This is about all we can do. Right now, people are making calls based on 10 coin flips. We're a far way from anything statistically significant. You need to flip 100-200 times to get that 50% landing really. 20 coin flips is still like 5.6% chance you'll get 14 or more heads out of 20 flips. Which is daunting if you had 3 girls and you want a boy or vice versa, It's still quite high you'll get 1 more girl/boy.
When people made claims on XSX outperforming PS5, this was a debate on the null hypothesis really; it's like saying heads should be 50% chance of landing. That's the null hypothesis, until we start getting real data from flipping.
12 > 10, and 560 > 448. 52 CUs > 36 CUs for RT workloads. That's the null hypothesis, XSX should be biased towards being better. Data right now is not reflecting that, sometimes it is, here and there, and most of the time it seems much closer to them being a draw based on how low the consoles dips (without knowing how high the consoles can ride). A good example is that everyone assumed based on the first video of Control that PS5 was outperforming XSX just looking at stutters, and with camera mode we see the opposite with the framerates unlocked. Now a clear winner is present. So are we basing claims on good information really? That being said, if you want to be real about it; if people are comparign PS5 and XSX, sample testing should at least deviate the 2 by 5% in performance to separate them. Otherwise, we'd consider it a draw. And those that have been advocating for 'draw' has been correct all this time. Though there are some actual 5% wins earlier on for PS5, and hitman and control camera modes are more than 5% wins.
It's clear that there are issues on the XSX side of things, what those issues are, are relatively unclear still. Many are quick to point to alpha transparency, ROP performance, fixed function performance, etc, but in this academic benchmark, XSX wins clearly in all scenarios. And while we don't play synthetic benchmarks, it does tell us that our null hypothesis seems still worth keeping, and that the issues with XSX needs to be observed as to why it's performing at draw with PS5. Yes it's entirely possible that because PS5 is the lead console, they will always choose a performance profile in which the PS5 will never dip, and cap the higher end so that XSX can not exceed further so our only measurement of a win is based on loss frame rates. This is reality and that is console business. However, it works both ways. We saw Hitman choose a performance profile that catered around maximizing the XSX and PS5 was ultimate had it's ceiling performance lowered. It had less dips sure, but in the grand scheme of things, it rendered a continual 44% less pixels for the entirety of the game.
And the corridor of doom, because it's RT limited. And RT is really bad on consoles.
So if we take a bias that XSX is supposed to be on paper 20% better than PS5.
Pure math here then
Probability of XSX = 54.45% beating PS5
In reverse
Probability of PS5 = 45.45% beating XSX
They are really much closer than people think: out of 100 flips XSX should win 54.45% of them and PS5 winning 45.45% of them. I know it doesn't actually work like that in practice, but something to consider while we are lacking information.
Let's assume that out of 20 titles, PS5 wins 14 of them for instance
(20 choose 14) * 0.45^14 * 0.54^6 = 1.6% chance that this would occur. So if this does happen, we'd have to look into re-formulating our null hypothesis.
Hope this helps.