AMD: Navi Speculation, Rumours and Discussion [2019-2020]

Discussion in 'Architecture and Products' started by Kaotik, Jan 2, 2019.

Thread Status:
Not open for further replies.
  1. CarstenS

    Legend Subscriber

    Joined:
    May 31, 2002
    Messages:
    5,800
    Likes Received:
    3,920
    Location:
    Germany
    I disagree about the 4K thing, but let's leave it at and not derail here any further.
     
  2. Erinyes

    Regular

    Joined:
    Mar 25, 2010
    Messages:
    808
    Likes Received:
    276
    I don't think the cache would require games to be programmed a different way. If that was the case Navi 2x would not perform well on older games at all, which dosen't seem to be the case from the leaks/benchmarks we have seen so far. Just the fact that the console games would be optimized for the RDNA2 architecture should help performance in future games. The purpose of the cache is perhaps more to increase performance/W? And which also allows them to reduce external memory bandwidth and costs. The die cost for the cache may have been too large on the 7nm process for the consoles, which are already pushing the envelope in terms of new tech and costs. RDNA2 and Zen 2 by themselves are large enough of a leap in terms of performance and performance/W compared to Jaguar and GCN.
     
  3. LordEC911

    Regular

    Joined:
    Nov 25, 2007
    Messages:
    877
    Likes Received:
    208
    Location:
    'Zona
    Why wouldn't it be reliant on RDNA2's memory subsystem or HBCC like unit to realize that it has available cache and can keep the data closer?
    I realize that modern APIs have given more low level access to devs but after all the features that AMD tries to add that are never fully supported, I would expect them to not make that same mistake.
     
  4. kresek

    Newcomer

    Joined:
    Apr 4, 2007
    Messages:
    14
    Likes Received:
    4
    The patch seems to disable both the RBs and the CUs for corresponding shader arrays marked as being disabled.
     
  5. itsmydamnation

    Veteran

    Joined:
    Apr 29, 2007
    Messages:
    1,349
    Likes Received:
    470
    Location:
    Australia
    bigly claims are bigly



    Im really not sure on the rumoured die sizes, a 40cu die taking 340mm when RDNA1 40cu GPU takes 250 and the consoles also taking about the same area as RDNA 1.
     
    Lightman likes this.
  6. DDH

    DDH
    Newcomer

    Joined:
    Jun 9, 2016
    Messages:
    36
    Likes Received:
    39
    If the rumoured size is accurate, has to be more. 64 perhaps
     
  7. Erinyes

    Regular

    Joined:
    Mar 25, 2010
    Messages:
    808
    Likes Received:
    276
    Higher IPC and clocks don't come for free obviously. If all these rumours are true, then they've exceeded their 50% perf/W target by a mile and the numbers teased at the Zen even are almost certainly from a 72CU or even 64CU SKU if we're being really optimistic. I'm still a bit skeptical, though I'd love to be pleasantly surprised. 3 more days to go anyway, we'll find out soon enough.
     
    Lightman likes this.
  8. SimBy

    Regular

    Joined:
    Jun 21, 2008
    Messages:
    700
    Likes Received:
    391
    Depends on how well it scales.
     
  9. trinibwoy

    trinibwoy Meh
    Legend

    Joined:
    Mar 17, 2004
    Messages:
    12,059
    Likes Received:
    3,119
    Location:
    New York
    He also added another 10% to the 6800xt scores to account for a new driver. it is a bit odd that all the leaks are for 3dmark. Maybe AMD did like Nvidia and whitelisted only certain apps for AIB drivers. There’s also talk of AIBs having artificially crippled drivers. Wednesday can’t come soon enough.
     
    Cuthalu and Lightman like this.
  10. Scott_Arm

    Legend

    Joined:
    Jun 16, 2004
    Messages:
    15,134
    Likes Received:
    7,680
    I have a stack of games waiting for a new video card, so this whole thing is making me very impatient. Lol
     
    Silent_Buddha, PSman1700 and Lightman like this.
  11. Leoneazzurro5

    Regular

    Joined:
    Aug 18, 2020
    Messages:
    335
    Likes Received:
    348
    This is not so clear, the way he words that seems to mean AMD supplied a new driver which brought a 10% increase in scores, and the scores he was showing were achieved with that driver.
     
    Lightman likes this.
  12. Erinyes

    Regular

    Joined:
    Mar 25, 2010
    Messages:
    808
    Likes Received:
    276
    Yea that's my understanding as well, that the driver brought an increase.
     
  13. Jubei

    Regular

    Joined:
    Dec 10, 2011
    Messages:
    559
    Likes Received:
    198
    So in RTGs latest video, he claims he was shown slides that compare gaming performance between 6800XT and 3080. 6800XT wins 5, loses 3 and ties in 2 out of 10 at 4K resolution. And wins 8/10 in 1440p

    I wonder if this is a controlled leak by AMD
     
  14. chris1515

    Legend

    Joined:
    Jul 24, 2005
    Messages:
    7,157
    Likes Received:
    7,966
    Location:
    Barcelona Spain
    He was principal engineer at Playstation for the PS5 and he seems to talk about NX gamer video about the rumored infinity cache

     
    Lightman and Jubei like this.
  15. SimBy

    Regular

    Joined:
    Jun 21, 2008
    Messages:
    700
    Likes Received:
    391
    It's interesting that the biggest change in both Zen3 and RDNA2 seems to be massively reworked cache.
     
    neckthrough and Lightman like this.
  16. Frenetic Pony

    Regular

    Joined:
    Nov 12, 2011
    Messages:
    808
    Likes Received:
    478
    Not totally surprised. Apple has been leading in mobile CPU performance for years now by concentrating on their cache hierarchy. Having idling compute resources waiting around means adding more silicon other than getting instructions to the idling hardware doesn't make a lot of sense, and modern archs generally rip through things while sram has utterly and completely failed to scale as well as logic.

    Makes me wonder whether AMD, Nvidia, (and... possibly Intel?) are lining up for Samsung's 3nm. They'll not only be years ahead of others if they stick to their schedule, but their stacked s-ram tech could cut die space by a hell of a lot, as sram is a ballooning on most dies versus logic. If they can get their yield numbers towards anything reasonable they'll be in a great position. After all TSMC's upcoming chiplet interconnects can technically connect dies from other foundries as well as their own.
     
  17. trinibwoy

    trinibwoy Meh
    Legend

    Joined:
    Mar 17, 2004
    Messages:
    12,059
    Likes Received:
    3,119
    Location:
    New York
    Yeah I’m using a translator which says:

    “In addition, it must also be added that AMD is said to have distributed another performance driver yesterday, which then brought the last 10% increase, which is already included in the above result.”

    I took the above result to mean his calculated numbers as that graph was right above the quote. But it could also mean it was included in the original percentages he got from the AIB.
     
  18. tsa1

    Newcomer

    Joined:
    Oct 8, 2020
    Messages:
    89
    Likes Received:
    97
    I don't really get Igor, why so much obfuscation. If he just named the figures in the ballpark of tens of thousands with granularity of 500, no one would guess his leak's source (unless AMD provides bioses with very specific power targets and v/f curves that can help them to root out the leaker AIBs). Also, it seems most of the people doing that have no clue about overall / graphic score and the fact that overall score basically says nothing if you don't know the specific CPU it was tested on. Anyways, only three days to go and then this madness will end (probably)
     
  19. trinibwoy

    trinibwoy Meh
    Legend

    Joined:
    Mar 17, 2004
    Messages:
    12,059
    Likes Received:
    3,119
    Location:
    New York
    I for one am very pleasantly surprised. Most of the GPU cache research I’ve seen seems to favor HPC. I always figured if big caches are useful for HPC and Nvidia still hasn’t bothered to invest there on its high margin HPC chips then we probably won’t see it on gaming cards either.

    My guess is if there is some kind of huge cache then AMD is pinning entire buffers in that cache to avoid thrashing. Probably via heuristics of which render targets & UAVs are accessed most frequently.

    I haven’t given up on the option that they simply went with a 512-bit bus though.
     
    Lightman likes this.
  20. Jawed

    Legend

    Joined:
    Oct 2, 2004
    Messages:
    11,716
    Likes Received:
    2,137
    Location:
    London
    I've seen patent documents talk about the problem of dealing with the overheads of small cachelines (e.g. 64 bytes) while using monster caches. The solution is based upon regions (from/to addresses). Which would map nicely to things like render targets and UAVs.

    256-bit MC + PHY is about 64mm².
     
    Lightman likes this.
Loading...
Thread Status:
Not open for further replies.

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...