Frenetic Pony
Veteran
So the Fable Legends Dx12 Benchmark is out: http://www.anandtech.com/show/9659/fable-legends-directx-12-benchmark-analysis
And quite usefully has what is essentially a GPU profiler in the results.
I know the "Dynamic Global Illumination" is in this title handled via compute shader. It's interesting to note which IHV wins what. Clearly for compute shader performance Nvidia's high end GPU has the performance advantage. Same with GBuffer creation, which I would assume is geometry bound.
For Pixel operations on the other hand we can see AMD has the advantage, eg Transparency/Effects, Dynamic Lighting, and Post Processing. Which is unsurprising as AMD GPUs have scaled better with resolution to begin with.
Another interesting thing to note is scaling between GPU's on the same, or rather similar, architecture
Here we see the 980ti scaling better compared to the 980 than the Fury X scales compared to the 390x.
Since it's hard to see I'll just post the numbers. Gbuffer Creation, 4.4ms/6.4ms for Ti/980. 6.5ms/7.0ms Fury/390. Dynamic lighting 7.1/9.6 Ti/980. 6.7/7.5 Fury/390. Gi is probably fighting for resources so irrelevant. Compute 1.2/1.9 Ti/980. 2.2/3.0 Fury/390. Transparency 6.0/8.0 Ti/980. 5.4/7.0 Fury/390.
As we can see, for GBuffer creation Ti scales 45% better, while the FuryX scales only 7% better. Dynamic Lighting Ti scales 35% better, FuryX scales 10% better. Compute scales almost 60% better for the Ti, while only scaling to 36% better for the FuryX. Transparency scales 33% better on Ti, while scaling 30% better on FuryX.
From this we can see that, as some proposed, the Fury X is highly bound by certain bottlenecks. While it scales for transparency just fine as does the 980Ti, other things don't scale much at all, at least at 4k on this benchmark, while the 980ti scales quite well across the board.
This quite possibly bodes well for AMD's 4xx series, assuming they get back to properly balanced designs like the 2/390 series rather than the Fury.
And quite usefully has what is essentially a GPU profiler in the results.
I know the "Dynamic Global Illumination" is in this title handled via compute shader. It's interesting to note which IHV wins what. Clearly for compute shader performance Nvidia's high end GPU has the performance advantage. Same with GBuffer creation, which I would assume is geometry bound.
For Pixel operations on the other hand we can see AMD has the advantage, eg Transparency/Effects, Dynamic Lighting, and Post Processing. Which is unsurprising as AMD GPUs have scaled better with resolution to begin with.
Another interesting thing to note is scaling between GPU's on the same, or rather similar, architecture
Here we see the 980ti scaling better compared to the 980 than the Fury X scales compared to the 390x.
Since it's hard to see I'll just post the numbers. Gbuffer Creation, 4.4ms/6.4ms for Ti/980. 6.5ms/7.0ms Fury/390. Dynamic lighting 7.1/9.6 Ti/980. 6.7/7.5 Fury/390. Gi is probably fighting for resources so irrelevant. Compute 1.2/1.9 Ti/980. 2.2/3.0 Fury/390. Transparency 6.0/8.0 Ti/980. 5.4/7.0 Fury/390.
As we can see, for GBuffer creation Ti scales 45% better, while the FuryX scales only 7% better. Dynamic Lighting Ti scales 35% better, FuryX scales 10% better. Compute scales almost 60% better for the Ti, while only scaling to 36% better for the FuryX. Transparency scales 33% better on Ti, while scaling 30% better on FuryX.
From this we can see that, as some proposed, the Fury X is highly bound by certain bottlenecks. While it scales for transparency just fine as does the 980Ti, other things don't scale much at all, at least at 4k on this benchmark, while the 980ti scales quite well across the board.
This quite possibly bodes well for AMD's 4xx series, assuming they get back to properly balanced designs like the 2/390 series rather than the Fury.