Is there a description of what the benchmark contains versus a gameplay scenario?
The priorities for a canned benchmark and a consistent interactive experience are not the same, and the structure and effort put into the two can differ.
Stronger resource demands along certain axes might be avoided in the general game because it would lead to performance drops or responsiveness issues across varied hardware or unexpected peaks in engine demand, which a benchmark wouldn't care about.
It is possible there is optimization for the readily targeted built-in benchmark, but I also find it plausible that it could be a scenario where an architecture with generally inferior optimization in general could start falling down in less structured scenarios due to unexpected complexities.
The priorities for a canned benchmark and a consistent interactive experience are not the same, and the structure and effort put into the two can differ.
Stronger resource demands along certain axes might be avoided in the general game because it would lead to performance drops or responsiveness issues across varied hardware or unexpected peaks in engine demand, which a benchmark wouldn't care about.
It is possible there is optimization for the readily targeted built-in benchmark, but I also find it plausible that it could be a scenario where an architecture with generally inferior optimization in general could start falling down in less structured scenarios due to unexpected complexities.