AMD: RDNA 3 Speculation, Rumours and Discussion

Discussion in 'Architecture and Products' started by Jawed, Oct 28, 2020.

Tags:
  1. Well if there's a 160CU ~50 TFLOPs part coming then I can't imagine it being cheap.
    Especially considering it's coming right after a crypto craze.

    Here's hoping the slightly cut-down version with 2×60 or 2×72CUs activated don't break the bank.
     
  2. dskneo

    Regular

    Joined:
    Jul 25, 2005
    Messages:
    816
    Likes Received:
    298
    A 160CU RDNA GPU does sound surreal. But we now have two precedents. The ryzen model in the CPU space, and the new pricing paradigm of 2021, which demonstrates that you could make a halo procut for 3000€ and still be able to sell it.

    If anything, the most viable argument to justify how such a GPU will not be made, is the memory bandwidth requirement to allow for linear scalling across the board.
     
  3. RDNA2's memory bandwidth requirements are relatively low because of Infinity Cache. Assuming there's a third I/O chip (large but cheap 12nm GlobalFoundries ASIC?) that takes in most of the non-compute related parts like PCIe, display output, video codecs, GDDR6 PHYs, etc. then at 5nm the main GPU chips could even increase the relative amount of L3.
    I wouldn't be surprised if 160CU Navi 31 got away with a 384 or 320bit wide GDDR6 bus, especially if it uses 18Gbps chips.


    They could also adopt HBM3, where the lower chip in the stack could be that same I/O chip along with die-to-die communication, which would forego the necessity of an interposer. 2x HBM3 stacks at the predicted 512GB/s each would provide 1TB/s total external bandwidth.
     
    #303 Deleted member 13524, May 24, 2021
    Last edited by a moderator: May 24, 2021
  4. dskneo

    Regular

    Joined:
    Jul 25, 2005
    Messages:
    816
    Likes Received:
    298
    Its precisely because of Infinity Cache that you know the memory requirements are very high. I dont see how just doubling the infinity cache size would provide linear scalling in the current RDNA2 gpu, let alone in a 160CU monster.

    I believe the 160CU is possible. But i'm more intrigued by the memory solution to feed it.
     
    Krteq likes this.
  5. Latest patent seems to indicate the PHYs to be located on the Chiplet, and the Bridge chiplet is hosting the L3 and acts as interconnect


    upload_2021-5-24_13-14-56.png

    Also it looks like SoIC so they would need to be codesigned and therefore cannot use GF 12nm for example.
    I think they are doing it mainly for yield
     
    Jawed and Deleted member 13524 like this.
  6. Bondrewd

    Veteran

    Joined:
    Sep 16, 2017
    Messages:
    1,682
    Likes Received:
    846
    Arguably not the hard part here.
    Packaging is.
    Granted, technically N31 has two parents (MI200 and the shortcake) for trial duties, but still.
    MCD is N6.
    2022 AMD is nearly exclusively a TSMC shop.
    That one is pretty away.
     
  7. Bondrewd

    Veteran

    Joined:
    Sep 16, 2017
    Messages:
    1,682
    Likes Received:
    846
    The ultimate fancy in all things fancy, yes.
    Lower end parts like N33 are also real shit tbh, and they're still single die and N6.
     
  8. madhatter

    Newcomer

    Joined:
    Jul 23, 2020
    Messages:
    32
    Likes Received:
    25
    What's that? Is it a CPU?
    I'm curious about N33, not sure about some of the current rumors. It being N6 is to be expected, but is it really an 80 CU part?
     
  9. Globalisateur

    Globalisateur Globby
    Veteran Subscriber

    Joined:
    Nov 6, 2013
    Messages:
    4,592
    Likes Received:
    3,411
    Location:
    France
    Looks like RDNA2 GPUs with their infinite cache (L3) was actually a prototype GPU for RDNA3 chiplet + L3 design. Logically those chiplet GPUs could have same functionning as RDNA2 + L3 GPUs.
     
  10. PSman1700

    Legend

    Joined:
    Mar 22, 2019
    Messages:
    7,118
    Likes Received:
    3,090
    Well, lol. Ray tracing hardware does absolutely nothing for RT, except for speed. You can run RT on pascal hardware (or any hardware) aswell. Its just that, speed is the differerinator.
    RDNA3 gpus probably will perform much better in all tasks as opposed to RDNA2 and earlier ones.

    Infinity Cache is a really good addition to the RDNA2 gpus, without it would be much less performant.
     
    Remij likes this.
  11. dskneo

    Regular

    Joined:
    Jul 25, 2005
    Messages:
    816
    Likes Received:
    298
    That is what I said.
     
    Globalisateur and PSman1700 like this.
  12. Bondrewd

    Veteran

    Joined:
    Sep 16, 2017
    Messages:
    1,682
    Likes Received:
    846
    Yes.
    Seems so.
    Granted, 2022 GPU mobility stuff from AMD is currently tbd thus can't answer which segment and how much ass does it kick.
     
    madhatter likes this.
  13. Aldebaran should launch along side Trento within a quarter so we will find out.
    Both of them have exotic packaging, should provide a good base for RDNA3
     
  14. Bondrewd

    Veteran

    Joined:
    Sep 16, 2017
    Messages:
    1,682
    Likes Received:
    846
    Trento is actually very simple purpose-built stick (Milan + IF3).
     
    Deleted member 90741 likes this.
  15. MCD for Navi 31 is N6, that's the Memory Complex Die, the chiplet with L3 that guarantees coherency between the Graphics Complex Dies (the ones with the CUs).
    Unless @Bondrewd confirms the GCDs are also N6, I'm assuming those are N5.

    I also don't think @Bondrewd suggested N33 is a 80CU part. Considering the power and performance requirements of the previous Navi x3 parts, as well as it being a N6 monolithic chip, I think it's more likely that Navi 33 is closer to 40CUs.

    Are we looking at the MCD being stacked on top / below the GCDs then?
    Or is it just using an incredibly dense substrate? Or even an interposer?
     
  16. Bondrewd

    Veteran

    Joined:
    Sep 16, 2017
    Messages:
    1,682
    Likes Received:
    846
    Compute tiles are that, yes.
    N5p, really.
    The latter; now imagine the poor thing going thru thermal cycles with many many watts radiating from thingies above it.
     
    Deleted member 13524 likes this.
  17. iroboto

    iroboto Daft Funk
    Legend Subscriber

    Joined:
    Mar 6, 2014
    Messages:
    14,833
    Likes Received:
    18,633
    Location:
    The North
    Hard for me to believe we could feed such a wide GPU without it. You sure HBM3 isn’t in the cards for consumers somewhat within the next 5-7 years? We have to be hitting an inflection point soon.
     
    BRiT likes this.
  18. Bondrewd

    Veteran

    Joined:
    Sep 16, 2017
    Messages:
    1,682
    Likes Received:
    846
    AMD went from Vega10 to Navi21 perf at nearly the same offchip bandwidth.
    Maybe but 2022/2023 are still HBM2e milking.
     
  19. iroboto

    iroboto Daft Funk
    Legend Subscriber

    Joined:
    Mar 6, 2014
    Messages:
    14,833
    Likes Received:
    18,633
    Location:
    The North
    Imo; that just signals that another bottleneck was present that failed to allow compute and to saturate the amount of bandwidth available; thus bandwidth was negligible in its performance.

    I’m not confident that 160CUs would be able to operate the same way. Increasing L3 cache has its limitations. You will need to hit slow ram eventually. And if all 160CUs want to hit slow ram often, bandwidth needs to be part of that equation
     
    BRiT likes this.
  20. Bondrewd

    Veteran

    Joined:
    Sep 16, 2017
    Messages:
    1,682
    Likes Received:
    846
    You're really not supposed to.
    Those things are brand spanking new and no one really did GPUs like that before.
    Honestly, yes, we need another huge LLC GPU in the wild (Ponte Vecchio) to see and compare.
     
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...