AMD Radeon RDNA2 Navi (RX 6500, 6600, 6700, 6800, 6900 XT)

Discussion in 'Architecture and Products' started by BRiT, Oct 28, 2020.

  1. Kaotik

    Kaotik Drunk Member
    Legend

    Joined:
    Apr 16, 2003
    Messages:
    10,244
    Likes Received:
    4,465
    Location:
    Finland
    RTX 3080 with RT is about 78% of without, RTX 2080 Ti 75%, RX 6800 XT 80% and RX 6800 83%.
    The differences aren't big, but both Radeons lose relatively slightly less performance, if they were as bad as some seem to suggest that shouldn't happen. Yes, Ampere is stronger in RT, but the difference can't be worlds apart like suggested in this thread.
     
  2. Digidi

    Regular

    Joined:
    Sep 1, 2015
    Messages:
    428
    Likes Received:
    239
    News from hothardware:



    The answer of the rasterizer let me get more ??? This is teh answer from HotHardware from AMD Scott Herkelman:

     
    #1242 Digidi, Nov 21, 2020
    Last edited: Nov 21, 2020
  3. Jawed

    Legend

    Joined:
    Oct 2, 2004
    Messages:
    11,708
    Likes Received:
    2,132
    Location:
    London
    4K performance is not looking great. Being great at 1440p when your competitor is great at 4K doesn't make a halo product.

    If it's because of a lack of FLOPS or because of low cache hitrate, is that because the GPU is shading too many pixels that aren't seen?

    I don't know how we'll ever find out, when it comes to games.

    10 years ago those tessellation comparisons were interesting. Unigine Heaven was interesting too.

    AMD defaults to "AMD optimised tessellation" these days, as I understand it. It could be having an effect on performance and IQ in current games, but there's no tech journalism these days that goes that deep as far as I can tell.

    Another point of comparison is geometry shading. It turned out to be a dumb idea. NVidia scorched ahead in very specific synthetics - because the GS export data didn't leave the chip.

    I did watch and wasn't edified.

    In the end AMD beat NVidia by less than 5% in performance per watt (though there's more memory on 6800XT) and for the time being, 4K performance is not very good. 6900XT is not really going to make that better, either, since 20% more FLOPS (and other substantial advantages) in 6800XT versus 6800 brings about 11% more performance on average (varying from 4 to 19% based on Techspot).

    That chart is almost as scummy as the NVidia equivalent, both making 2x performance per watt claims by cherry picking places on the curves that don't relate to the best performing cards being compared.

    I'm glad to see that laptops is a place AMD can compete again, but they've plunked themselves back at Fury X versus 980 Ti in halo performance terms...
     
  4. trinibwoy

    trinibwoy Meh
    Legend

    Joined:
    Mar 17, 2004
    Messages:
    12,055
    Likes Received:
    3,111
    Location:
    New York
    Really interesting video thanks for sharing. Sharing within a workgroup is standard fare for compute APIs so it’s surprising that Turing prefers mesh shader workgroup size of just a single 32-wide wavefront. Is there some special sauce in RDNA that enables more efficient cross group data sharing beyond the usual LDS stuff?

    The presenter emphasized that it’s hard to write mesh shaders that beat dedicated geometry setup and culling hardware units at their own game. But mesh shaders came very close. Maybe those hardware units go away next generation and all the old geometry pipeline stuff will be emulated on general compute shaders.
     
  5. OlegSH

    Regular

    Joined:
    Jan 10, 2010
    Messages:
    797
    Likes Received:
    1,623
    Or simply bugged AMD drivers, which CD Project didn't have time to debug for since the cards were just released?:roll:
     
    PSman1700 likes this.
  6. xEx

    xEx
    Veteran

    Joined:
    Feb 2, 2012
    Messages:
    1,060
    Likes Received:
    543
    Nvidia "second gen" RT have the same performance than the first one, less than 5% diff at best cases.

    In today's world I think it's better to have better rasterization than RT since the number of games with RT are very limited and the number of games with useful RT are like unicorns. And all of the "but future proof!" is no sense to me. Nvidia is barely making it in tern of performance and AMD is barely barely making so I don't think today's GPU will be able to use full RT in next gem games.

    With that said AMD have the advantage of games RT implementation being though with AMD capabilities and limitations in mind and then (maybe) added extra effect for the PC ports. So I think AMD RT will age better than Nvidia's.


    I really have no idea why we have RT GPUs...*at least* we are 1 gem away from truly usable perf. and the space on Die could be use to have faster and better GPUs capable of more "traditional" effect that would be more useful than slightly better shadows that eat 20-30-50% of the frame rate just so you can "admire" the difference in a static pic in a side by side comparative. Ridiculous...
     
  7. boipucci

    Newcomer

    Joined:
    May 31, 2019
    Messages:
    42
    Likes Received:
    22
    Yes anything that justifies disabling a whole SE but taking into account probability theres more chances for defects to spread across CUs of all 4SEs vs crippling defects on just 1SE while the other 3SE remain intact. Wouldn't 64CU & 56CU configurations be more likely?
     
  8. Jawed

    Legend

    Joined:
    Oct 2, 2004
    Messages:
    11,708
    Likes Received:
    2,132
    Location:
    London
    Frankly I was gobsmacked by this. If the video was from autumn 2018, just after Turing had arrived, I'd consider it an "early code" problem.

    Ampere might be a whole lot "better". Honestly, I'm suspicious, so while it's a useful example of wild divergence between two platforms, I am doubtful that it'll have a substantial impact in games.

    I'm not sure how mesh shading will come to PC games, because it looks like the fallback for older hardware isn't easy (whoops, performance sucks, never mind). I haven't spent much time on this subject. Maybe incompatible hardware is simply capped at "medium" geometry?

    My understanding of mesh shaders is that they are compute shaders with some connectivity in hardware to link up with the rasteriser. I don't really understand the hardware model of mesh shaders, though. It's useful to start with the mental model: "a mesh shader needs no geometry to consume", so it's an entirely compute-centric view of geometry.

    Also, we're now learning that RDNA 2 features heavy primitive shader usage ("NGG") in place of traditional old geometry pipeline stages. The driver is re-writing lots of code, it seems. It's unclear to me how much like "compute shader" that is.

    Will there ever be a D3D13? To remove things like VS?... Dunno.
     
  9. Rootax

    Veteran

    Joined:
    Jan 2, 2006
    Messages:
    2,400
    Likes Received:
    1,845
    Location:
    France
    You have to start somewhere. You can't allocate time and money on RT, for years without testing it "in the field" on a commercial product imo. First T&L implentation (in a gaming card) was not great, first shaders neither, etc.
     
    HLJ, tinokun, DegustatoR and 4 others like this.
  10. Jawed

    Legend

    Joined:
    Oct 2, 2004
    Messages:
    11,708
    Likes Received:
    2,132
    Location:
    London
    It's quite a bit better in absolute frametimes. I think the "percentage performance loss when activated" metric is junk. This isn't MSAA, this is a whole new ballgame.

    I was hopeful that Ampere would be 50-100% faster than it turned out. It seems that bandwidth killed the ray (dio) star.

    Apparently AMD decided to spend very little die space on ray acceleration... Do we even know how much space NVidia has spent?

    If bandwidth/latency really are the killers (demonstrated by Ampere?) then we're going nowhere without substantially cleverer algorithms. We're not going to get 2TB/s bandwidth in consumer cards any time soon.

    Using yet more bandwidth by exporting ray query results to memory (DXR 1.0, as I understand it) to be consumed by the next shading pass seems painfully like the failed experiment that was GS.
     
    Lightman likes this.
  11. xEx

    xEx
    Veteran

    Joined:
    Feb 2, 2012
    Messages:
    1,060
    Likes Received:
    543
    Yes you can and it's exactly what they did. and both decided to use their user base and beta tester and making them pay extra premium price for that "privilege".

    No. It's only better because it starts higher. We are talking about the performance on the RT cores here. With Ampere at best they are less than 5% faster.

    I prefer AMD approached. for today's games. If they can get a DLSS competitor then their RT will be "usable enough" for at least 1440p. But that is a big if.
     
  12. Rootax

    Veteran

    Joined:
    Jan 2, 2006
    Messages:
    2,400
    Likes Received:
    1,845
    Location:
    France
    You quoted me for the "Apparently AMD decided to spend very little die space on ray acceleration... Do we even know how much space NVidia has spent?", but I never said that :D

    I don't get it. If you "could", then neither amd or nvidia would have RT right now. Maybe my first sentence was wrongly formulated...

    What I said was you can't have a perfect solution from the start when it's a new field. And if you decide to wait for ... The right process node, another big leap in the theoretical field, idk, then you will never release it... Devs/games will eat what you give them, it will never be enough anyway...

    Now, as a consumer yeah you can say "it's not fast enough yet, I won't weight RT in the balance", and that's perfectly fine. But nVidia, AMD, Intel, Imgtech whoever ... have to start somewhere...
     
    HLJ and PSman1700 like this.
  13. LordEC911

    Regular

    Joined:
    Nov 25, 2007
    Messages:
    877
    Likes Received:
    208
    Location:
    'Zona
    Not when you also take into account binning for performance.
    Edit- Or in this case, binning for performance differentials.
     
  14. trinibwoy

    trinibwoy Meh
    Legend

    Joined:
    Mar 17, 2004
    Messages:
    12,055
    Likes Received:
    3,111
    Location:
    New York
    No matter when they chose to bring hardware accelerated RT to market the fact is that there would be zero game support on day one and it would be many years before there is wide market adoption. So given those facts what percentage of die area would you suggest be dedicated to the first RT implementation? 25%, 50%?
     
    tinokun, Lightman, PSman1700 and 2 others like this.
  15. Man from Atlantis

    Regular

    Joined:
    Jul 31, 2010
    Messages:
    960
    Likes Received:
    853
    This may be not best the place but comparing TU102 and TU116;

    TU102
    a single Dispatch and BVH transversal area is ~1.4065mm2, TU102 has 36 TPC of which Dispatch and BVH transversal area in total is ~50.6325mm2 , 6.68% of total die size(754mm2).

    TU116 Doesn't have RT accelerators,
    a single Dispatch area is around ~0.5873mm2, TU116 has 12 TPC of which Dispatch area in total is ~7.0471mm2 , 2.4814% of total die size(284mm2).

    Minus TU102's Dispatch and BVH transversal area and TU116's Dispatch area is ~0.8192mm2.

    Finally, for TU102 that means ~29.4912mm2 BVH transversal in total, which is 3.9% of total die size. This is only a comparison of BVH area there're probably other regions that needs to be beefed up for RT acceleration.

    TU102 die shot with annotations
    TU116 die shot with annotations
     
    Jawed, Lightman and T2098 like this.
  16. Digidi

    Regular

    Joined:
    Sep 1, 2015
    Messages:
    428
    Likes Received:
    239
    Anybody know what Scott Herkelman wants to say with this sentence?

    P.s. please share and like the interview. Only with these interviews we get contact to persons inside amd.

     
    Lightman, PSman1700 and pharma like this.
  17. DavidGraham

    Veteran

    Joined:
    Dec 22, 2009
    Messages:
    3,976
    Likes Received:
    5,211
    Nothing is suggested in this thread, game benchmaks, synthetics, 3D Mark, word of developers and actual technical details point to that fact exactly.

    Dirt 5 uses very low amount of RT, that means the game is bound by rasterization not by RT, which won't make it the optimal case of showing RT differences, which leads to me believe that AMD will use that tactic a lot in their sponsored games, fill them with low effort RT effects just to claim they support DXR with good enough performance.

    Even their marketing appears to be aware of their modest RT solution when they stated they target 1440p, which was the target of Turing GPUs 2 years ago.
     
    Cuthalu, Picao84, PSman1700 and 2 others like this.
  18. boipucci

    Newcomer

    Joined:
    May 31, 2019
    Messages:
    42
    Likes Received:
    22
    Can you elaborate on this? and how it would be more profitable than 64/56CU configurations
     
  19. pharma

    Veteran

    Joined:
    Mar 29, 2004
    Messages:
    4,887
    Likes Received:
    4,534
    You also have to remember that Nvidia brought RT to the market to be used with DLSS. Performance with DLSS 2.1 and ease of implementation has improved the RT experience considerably.
     
    PSman1700 and Rootax like this.
  20. manux

    Veteran

    Joined:
    Sep 7, 2002
    Messages:
    3,034
    Likes Received:
    2,276
    Location:
    Self Imposed Exhile
    Difficult to believe this statement. For example blender in benchmark below is 3080 11.9s versus 2080ti 19.9s or 2080 super 26.1s. Pure ray tracing perf in ampere is much improved. Also minecraft or quake2 rtx would show how pure raytracing perf has advanced. I'm not 100% sure where 6800xt falls, but I believe it's somewhere around 38s for same bmw blender benchmark.

    https://www.phoronix.com/scan.php?page=article&item=blender-290-rtx3080&num=3
     
    #1260 manux, Nov 21, 2020
    Last edited: Nov 21, 2020
    tinokun, DegustatoR, Cuthalu and 6 others like this.
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...