AMD: RDNA 3 Speculation, Rumours and Discussion

Discussion in 'Architecture and Products' started by Jawed, Oct 28, 2020.

Tags:
  1. trinibwoy

    trinibwoy Meh
    Legend

    Joined:
    Mar 17, 2004
    Messages:
    12,055
    Likes Received:
    3,109
    Location:
    New York
    You're probably using a different definition of "burst" than I am. Aside from RT there is rarely any single workload that occupies the GPU for more than 10% of total frame time. And during that time utilization is almost always poor.

    I haven't seen a math bound frame in any game that I've profiled. If you're lucky you'll have one or two workloads during the entire frame that keep the ALUs 50% busy.
     
  2. techuse

    Veteran

    Joined:
    Feb 19, 2013
    Messages:
    1,424
    Likes Received:
    908
    How do Doom and Control fare math wise?
     
  3. trinibwoy

    trinibwoy Meh
    Legend

    Joined:
    Mar 17, 2004
    Messages:
    12,055
    Likes Received:
    3,109
    Location:
    New York
    So one thing I'm not seeing people talk about is what features (aside from RT) will demand big boy performance in the next 2-3 years. In the same way that people are skeptical of RT's performance hit I have similar questions about why some games are so demanding. For example what are Borderlands 3, Control and Star Wars doing that's so heavy?

    [​IMG]

    No idea. I haven't played more than 30 seconds of Doom and don't have Control yet.
     
  4. troyan

    Regular

    Joined:
    Sep 1, 2015
    Messages:
    603
    Likes Received:
    1,122
    Most rasterizing games are not even trying to improve effects like shadows or reflections. So the performance impact of Raytracing should be normal. In certain games like Cyberpunk prior the latest patch RT reflections were faster on Ampere than the "psycho" setting for SSRs.
     
  5. Pinstripe

    Newcomer

    Joined:
    Feb 24, 2013
    Messages:
    153
    Likes Received:
    133
    Bad optimization?
     
    Rootax likes this.
  6. DavidGraham

    Veteran

    Joined:
    Dec 22, 2009
    Messages:
    3,976
    Likes Received:
    5,210
    In Gears 4, the Insane screen space reflection setting had a huge performance impact that was equal to adding true RT reflections, despite having little effect on visual quality in comparison.

    Similarly, in Gears 5 the software screen space GI setting also has a very massive performance impact despite adding very little to the final image, adding true hardware RT GI or reflections would have yielded a much better image quality outcome with similar or better performance profile.

    In Assassin's Creed Odyssey, Watch Dogs 2 and Borderlands 3, setting volumetric clouds/fog to max plummeted performance badly for little image quality improvement.

    In Watch Dogs 2, Arma 3, Crysis Remastered, using draw distance settings at their max values destroyed performance, because draw distance is CPU heavy, and our current CPUs are not fast enough single threaded wise. So we end up with horrific performance at max settings. Same thing applies to Flight Simulator games, whether 2010 or 2020.

    In Quantum Break, running the game on native resolution destroys performance, the advanced lighting of the game was designed to be performant only when upscaled from lower resolutions.

    Advanced non hardware RT methods for AO always end up costing massive performance, that remained true for VXAO (in Final Fantasy 15 and Rise of Tomb Raider) or Signed Distance Field AO (in Ark Survival Evolved), adding special shadowing techniques from the sun such HFTS (The Division, Watch Dogs 2, Battlefront 2) or PCSS (Assassin's Creed Syndicate) also cost massive performance.

    All of these (and many others) are examples of effects that reduce performance by a huge amount, that can be replaced easily with real RT effects for a massively better image quality gain and/or performance.
     
    #966 DavidGraham, Oct 2, 2021
    Last edited: Oct 2, 2021
    T2098, OlegSH, PSman1700 and 3 others like this.
  7. DegustatoR

    Veteran

    Joined:
    Mar 12, 2002
    Messages:
    3,240
    Likes Received:
    3,393
    Three different games doing different things I'm afraid.
    BL3 is how a game should not be using the GPU in general. (Aka "bad optimization".)
    Control is very shading heavy even without RT, it does a lot of stuff with SDFs and cone tracing in s/w.
    SW is a DX11 game and AMD's DX11 driver is still bad.

    Funnily enough, this one seem to gain a huge performance boost on Ampere - but I haven't seen any benchmarks of it on recent GPUs, going only from my own memory of how the game ran on Pascal/Turing.
     
    DavidGraham and PSman1700 like this.
  8. Malo

    Malo Yak Mechanicum
    Legend Subscriber

    Joined:
    Feb 9, 2002
    Messages:
    8,929
    Likes Received:
    5,528
    Location:
    Pennsylvania
    Singlet player games is where the RT efforts need to be, like Cyberpunk. But of course all the big publishers just want multiplayer live service crap everywhere to milk all the money.

    It also doesn't help adoption and perception of RT when it's so difficult to actually invest it in. My wife and I would have an Ampere by now if we could buy one.
     
  9. PSman1700

    Legend

    Joined:
    Mar 22, 2019
    Messages:
    7,118
    Likes Received:
    3,088
    The whole discussion on ray tracing being relevant or how much its used right now can be had on any new tech really. How many games do really make use of nvme ssd? It aint many. But we can guess that more and more will as time progresses.

    Ye thats a problem, nothings available or its scalped etc. Theres one way to get ahold of ray tracing capable hw though..... laptops if your into that kind of stuff (im not). Its possible to get a 3070 laptop for under 1500 dollars, still alot and too much money but atleast its in stock lol. With a 115w (boost to 130w) 3070m gpu, you'd be looking at 3060Ti or RTX2080 dgpu performance. Not bad in special considering the 1080p/1440p resolutions these tend to run.

    Edit: Ray tracing indeed matters more in SP games yes..... BFV had it though, but then the question comes if you actually had/have an advantage against people not running DXR enabled? Nowadays altering normal settings wont give you any advantage at all, which was the case 15 years ago with BF2, where having low setting for shadows ment you had a visibility advantage.
     
  10. DavidGraham

    Veteran

    Joined:
    Dec 22, 2009
    Messages:
    3,976
    Likes Received:
    5,210
    Yeah, on a 6900XT it is in the ~50s fps at native 4K, drops to ~40s fps during combat, and this is a five years old game.



    On a 3090 it is considerably better, sticks to ~
    60fps during combat.



    Still, the point being this game represents something like the pinnacle of rasterized lighting the industry can offer, and it runs badly on the monstrous GPUs of today, compare that to something like Metro Exodus which is doing RT GI + reflections and the difference is clear.
     
  11. no-X

    Veteran

    Joined:
    May 28, 2005
    Messages:
    2,451
    Likes Received:
    471
    That's RX 6800 XT. Not RX 6900 XT. Also the CPUs are different (4,7GHz Zen 2 vs. 5,3GHz Cascade Lake).
     
    pharma likes this.
  12. Jawed

    Legend

    Joined:
    Oct 2, 2004
    Messages:
    11,708
    Likes Received:
    2,132
    Location:
    London
    I think this is a problematic statement because it's outside of gaming where Ampere ALUs really get a workout. The same games on RNDA 2 (ignoring any ray tracing scenario) would be quite different I imagine. Similarly the same games on Turing and Pascal should be showing more.

    In the profiling tool you use, isn't there a metric for "shader pipe" utilisation? One level up in the hierarchy from ALU utilisation?
     
  13. trinibwoy

    trinibwoy Meh
    Legend

    Joined:
    Mar 17, 2004
    Messages:
    12,055
    Likes Received:
    3,109
    Location:
    New York
    It’s Nvidia’s Nsight profiler and it breaks down FP1/FP2/INT usage. It would be interesting to see a breakdown of a RDNA 2 frame.
     
  14. Leoneazzurro5

    Regular

    Joined:
    Aug 18, 2020
    Messages:
    335
    Likes Received:
    348
    According to this we have N31 for 2023 but something else in RDNA3 line (high end? That would be N32?) for 2022...


    But the same post says also Enthusiast Lovelace for 2023...
     
  15. Bondrewd

    Veteran

    Joined:
    Sep 16, 2017
    Messages:
    1,682
    Likes Received:
    846
    who is that.
    why would anyone drag that here?
     
  16. Leoneazzurro5

    Regular

    Joined:
    Aug 18, 2020
    Messages:
    335
    Likes Received:
    348
    Well Greymon55 is also "confirming" that.



    About these leakers' reliability, IDK.
    But, starting a new architecture with the lower end card would be quite strange to me.
     
  17. Rootax

    Veteran

    Joined:
    Jan 2, 2006
    Messages:
    2,400
    Likes Received:
    1,845
    Location:
    France
    Honestly, given the semiconductor situation, everything planning for 2022 might as well slide to 2023...
     
  18. OlegSH

    Regular

    Joined:
    Jan 10, 2010
    Messages:
    797
    Likes Received:
    1,622
    I already explained how. Besides, not all games are AAA titles on consoles, and they are not even the most popular or profitable.

    That's not half that bad as you're trying to imply.
    Ray-tracing introducing noise is a wrong assumption. Discretization, monte carlo integration and stochastic sampling cause noise, this stuff is required for physically based reflections/shadows/etc with both ray-tracing or rasterisation.
    Get rid of these concepts and treat all materials as perfect mirror or perfect diffuse and you're done with noise, but that's a derp solution.
    As for denoisers, they blur rough surfaces where noise happens, but that's not that critical because integrating 1000s of samples from all possible directions would still provide very blurry reflection on rough surfaces (because reflections on rough surfaces must be blurry, that's exactly what we are doing mathematically by integrating and averaging many samples)

    Another wrong assumption, guess who will pay one way or another for ever increasing development costs?
    Business will cover expenses with micro transactions, microservices, loot boxes, DLCs, you name it.

    I would cosider geometry draw calls a burst workload, these are usually way below 1 ms, cache flushes and state changes can be required in-between draw calls and other overheads are possible, these are usually small, burst and with low utilization (that's why async compute is usually overlapped with them)

    By math bound I simply mean that frame performance is limited by any computations on GPU die, doesn't matter whether it's fixed functions blocks or SIMDs, obviously, 100% SIMD ALU utilization is a very rare case.
    Following the roofline model, what I can say for sure is that most of frames are never bound by vram bandwidth.
    There are plenty of articles on memory frequencies performance impact, performance never scales linerly with memory frequencies for obvious reasons (the roofline model).
    Also, regression models usually converge to low coefficients for the bandwidth metric, way lower in comparison with other metrics (especially when you combine all GPU metrics together), which shows that games are rarely bandwidth bound in general.
     
    T2098, SpeedyGonzales and PSman1700 like this.
  19. yuri

    Regular

    Joined:
    Jun 2, 2010
    Messages:
    283
    Likes Received:
    296
    Assuming the previous RDNA3 leaks were true, the highend uses an expensive packaging tech. This makes it prone to delays unlike the lowend RDNA3 which uses the classic single die approach.

    Active bridge chip, TSMC's 3D IC stacking, TSMC 5nm node, new microarch debugging, difficulties in TDP management, etc. - any of those can make it slip.
     
  20. Leoneazzurro5

    Regular

    Joined:
    Aug 18, 2020
    Messages:
    335
    Likes Received:
    348
    This I know but generally is also true that the bigger dies are the ones usually coming earlier, and bigger dies are more difficult to manufacture, too. SO the reason they come first is the halo effect for the marketing. In this case we would have a cheaper RX6900XT, a good feat or a "midrange" card but without the halo.
     
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...