It would be interesting to see the reception if those leaks are true and the deltas trend over to general game performance. What would people value more? A noticeable bump in rasterization or one in RT?
The problem is, one may become indistinguishable from the other as more and more games become RT enabled thanks to the consoles. It'll be a shame if Big Navi does hold up to these performance rumours but it let down in RT performance. I'll be genuinely torn as to which to get.
It would be interesting to see the reception if those leaks are true and the deltas trend over to general game performance. What would people value more? A noticeable bump in rasterization or one in RT?
For those wondering about the differences in Time Spy and Fire Strike performance, this is straight from the 3dmark technical guide:
View attachment 4819
My guess would be on modern gpus that fire strike is going to be more of a fillrate test, or something.
Edit: Looking at clock speed, potential rop count (128) and these firestrike results, AMD has potential to be the ultimate "competitive settings" gpu for csgo, valorant, fortnite, apex etc, where people tend to play on very low settings and lower resolutions. "Ultra settings" comparisons is where things are looking close, but AMD still might win there too, but ray tracing may muddy those waters. I'm curious if igors labs port royal scores included dlss for the nvidia gpu. I'd like to see native vs native numbers for ray tracing.
AMD is a polygon monster. It has 8 Rasterizer at 2.3 GHz. When i see the values between the two benchmarks the old one (fire strike) was polygon bound, the newer benchmark (time spy) is shader bound. Intersting that AMD can ceep up with 5000 Shaders with Nvidiass 10.000 Shader.
Anybody understand this:
[PATCH] drm/amdgpu: correct the cu and rb info for sienna cichlid
https://lists.freedesktop.org/archives/amd-gfx/2020-October/055069.html
Anybody understand this:
[PATCH] drm/amdgpu: correct the cu and rb info for sienna cichlid
https://lists.freedesktop.org/archives/amd-gfx/2020-October/055069.html
This seems promising. But if the 6800XT alone is so fast, why did AMD show benchmarks at the Zen 3 launch were Navi is a decent margin slower than the 3080? I'd love it to be as fast as these rumours are suggesting, not least for the sake of pricing, but I think it might be a bit too soon to jump on this hype train.
https://www.3dcenter.org/news/hardware-und-nachrichten-links-des-22-oktober-2020The recent message about the performance of "Navi 21 XT" under the 3DMark13 FireStrike Ultra benchmark must be mitigated a bit by the point that the RDNA advantage in this benchmark is not only a few percentage points, but is actually very significant. According to comparative values, a Radeon RX 5700 XT is about +19% better than those should normally be in relation to a general performance index. Specifically, under this benchmark, that AMD card even beats a GeForce RTX 2080, although it is usually clearly faster than the Radeon RX 5700 XT. To what extent the upcoming RDNA2 cards show an equal affinity for this 3DMark13 test or whether their performance scaling in the high-end field is as good as with the Radeon RX 5700 XT can only be speculated at this time. But at least there is a clear indication that the 3DMark13 FireStrike Extreme is probably not particularly suitable for assessing "normal performance" between AMD and nVidia graphics cards.
3DCenter says Fire Strike is an overperformer by 19% with RDNA1 already, so this might carry to the next generation. They insinuate to not take RX 6000 perf in this particular benchmark as indicative for general performance. That's what I read from the original german posting.Don't seem to understand this. Is there an issue with the 3DMark13 FireStrike benchmark?
https://www.3dcenter.org/news/hardware-und-nachrichten-links-des-22-oktober-2020
The original comparison was with hypothetical RT block that only gave intersection results while not performing traversal, which would leave the SIMD in a position where determining the next node addresses would require explicit vector memory reads to data that would have been fetched and parsed by the RT unit already. AMD's method is at least less redundant than that.
AMD's patent doesn't clearly outline where the process resides for the intermediate work between node evaluations. It highlights that the SIMD and CU have substantial storage available at no additional cost versus the likely hardware footprint of implementing sufficient storage on an independent unit.
AMD's claims are between their hybrid method and a dedicated unit implementing a unit that might be able to traverse a BVH to arbitrary depths without redoing traversal due to losing the full context of what had been traversed already.
Nvidia's scheme appears to have a traversal stack of finite depth that can lead to redundant node traversal, which makes it less expensive than what AMD was using as its baseline.
Whether AMD's method leverages registers, LDS, or possibly spills to memory isn't spelled out. Even if there were spills to memory, writing out data based on pointers and metadata from completed RT node evaluations to something like a stack seems like it could be less disruptive than the SIMD re-gathering node data on its own.
Compressed wide bounding volume hierarchies can significantly improve the performance of incoherent ray traversal, through a smaller working set of inner nodes and therefore a higher cache hit rate. While inner nodes in the hierarchy can be compressed, the size of the working set for a full traversal stack remains a significant overhead. In this paper we introduce an algorithm for wide bounding volume hierarchy (BVH) traversal that uses a short stack of just a few entries. This stack can be fully stored in scarce on-chip memory, which is especially important for GPUs and dedicated ray tracing hardware implementations. Our approach in particular generalizes the restart trail algorithm for binary BVHs to BVHs of arbitrary widths. Applying our algorithm to wide BVHs, we demonstrate that the number of traversal steps with just five stack entries is close to that of a full traversal stack. We also propose an extension to efficiently cull leaf nodes when a closer intersection has been found, which reduces ray primitive intersections by up to 14%.
3DCenter cites their calculation basis in their linked article. According to that, it's not an outlier or cherrypicked result, but an average FS Ultra score vs. their calculated 4K performance index, which is based on the results from Kitguru, Overclockers Club and Tweakers.Yes 5700XT does very well in FSU, though I don't think it's anywhere near 20%, that's probably a cherry picked AIB OC result. Average score is closer to 2070s. OTOH leaked score could be cherry picked AIB OC results too.
I'm running a Ryzen 3900X. If you look at the PSU recommendation for RTX3080 it is a 750W PSU. The RTX3070 already recommends a 650W one for 220W. GPUs consumption has been increasing for a while from it's usual 150W to 180W for Gx104 class models. My PSU is nothing special, a Corsair VS650 I think.
The problem is, one may become indistinguishable from the other as more and more games become RT enabled thanks to the consoles. It'll be a shame if Big Navi does hold up to these performance rumours but it let down in RT performance. I'll be genuinely torn as to which to get.
Seems to imply that Navi 21 is the only GPU where ROPs and shader arrays are disabled.Also what is up with that update patch and CU count? Jawed redemption arc or just BS?
If Navi 2x GPUs have a monster last level cache, then consoles games will not be built to take advantage of it, because consoles don't have such a monster cache.Since the consoles are based on RDNA2, the RT performance could actually be better on Big Navi in the long run as games could be optimized for it.
3DCenter cites their calculation basis in their linked article. According to that, it's not an outlier or cherrypicked result, but an average FS Ultra score vs. their calculated 4K performance index, which is based on the results from Kitguru, Overclockers Club and Tweakers.