AMD CDNA Discussion Thread

Discussion in 'Architecture and Products' started by Frenetic Pony, Nov 16, 2020.

  1. Kaotik

    Kaotik Drunk Member
    Legend

    Joined:
    Apr 16, 2003
    Messages:
    10,244
    Likes Received:
    4,465
    Location:
    Finland
    CDNA2 does supposedly Full rate FP64 and packed FP32 (so some FP32 can run at twice the speed, but not all) and doubles the CU count compared to MI100 (due both chiplets having 128 CU like MI100).
    AMD also confirmed 128 GB of HBM2e already.
     
    Kidra, Lightman and no-X like this.
  2. xpea

    Regular

    Joined:
    Jun 4, 2013
    Messages:
    551
    Likes Received:
    783
    Location:
    EU-China
    No one denies the importance of these deals. As I said, it's well done by AMD. But for now they only appear as one time politically-driven opportunity. You need more than that to flip the market. The proof is that one year later, US government will put online a Grace-Hopper supercomputer in Los Alamos lab:
    https://www.lanl.gov/discover/news-release-archive/2021/April/0412-nvidia.php
    And 3 more government deals in the USA for Grace-Hopper supercomputers are nearly closed (announcement soon). Same for CSCS in Switzerland that won't use AMD but Grace-Hopper:
    https://www.cscs.ch/science/compute...orlds-most-powerful-ai-capable-supercomputer/
    Why go Nvidia in 2023 if AMD will rule it all as the one liner is suggesting ? The answer is simple. Nvidia will still be highly competitive and CUDA won't die soon.

    Yes Nvidia is pushing hard in academia and it works as numerous startups are betting on CUDA. For AMD to succeed, great hardware is not enough. You must provide a commercial path after academia, otherwise no one will waste their time on unsupported hardware and/or without a widely accepted ecosystem. I said it many times, NVIDIA is a software company first and AMD must quadruple their software effort to get a chance of changing the market...
     
  3. Bondrewd

    Veteran

    Joined:
    Sep 16, 2017
    Messages:
    1,682
    Likes Received:
    846
    LARP again, not funny.
    Oh should I count all the MI300 systems or nah yet?
    That's gotta be a loooooong laundry list of stuff.
    Yeah they make even better one.
     
    #103 Bondrewd, Jul 31, 2021
    Last edited: Jul 31, 2021
  4. troyan

    Regular

    Joined:
    Sep 1, 2015
    Messages:
    603
    Likes Received:
    1,123
    DavidGraham, DegustatoR and pharma like this.
  5. Bondrewd

    Veteran

    Joined:
    Sep 16, 2017
    Messages:
    1,682
    Likes Received:
    846
    Bait again; quoting GEMM numbers again (dawg they explicitly banned GEMM acc for Top500 HPL, see Perlmutter going /2 in rmax).

    Is this circlestrafing or what
     
  6. CarstenS

    Legend Subscriber

    Joined:
    May 31, 2002
    Messages:
    5,800
    Likes Received:
    3,920
    Location:
    Germany
    How so? Perlmutter Phase 1 is 1536 nodes of 4x A100 and 1x Epyc 7763.
     
    pharma likes this.
  7. Bondrewd

    Veteran

    Joined:
    Sep 16, 2017
    Messages:
    1,682
    Likes Received:
    846
    [​IMG]
    >120PF target went POOF.
    The GPU partition aka where HPL bang-bang comes from is here and it's only 90PF.
     
    #107 Bondrewd, Jul 31, 2021
    Last edited: Jul 31, 2021
  8. pharma

    Veteran

    Joined:
    Mar 29, 2004
    Messages:
    4,887
    Likes Received:
    4,534
    Perlmutter Debuts in the Top 5 of the Top500 (nersc.gov)
    June 29, 2021
     
    Lightman likes this.
  9. CarstenS

    Legend Subscriber

    Joined:
    May 31, 2002
    Messages:
    5,800
    Likes Received:
    3,920
    Location:
    Germany
    And how is it halved going to RPeak to 90?
    How is A100 with 6144 x 9.7 TFLOPS FP64 and 7763 with 1536 x 2.5-ish TFLOPS with a combined theoretical peak of 63.4 PFLOPS reaching an RMax of 64.6 PFLOPS?

    How, if not including Tensor math? I don't get it.

    edit:
    And how did your source not know about this half a year after A100's launch?
     
    #109 CarstenS, Jul 31, 2021
    Last edited: Jul 31, 2021
  10. Bondrewd

    Veteran

    Joined:
    Sep 16, 2017
    Messages:
    1,682
    Likes Received:
    846
    Cuz the planned one was close to Summit. Duh.
    List parts aren't the HPC ones here.
    Gotta pamp some watts in.

    You see, Rpeak for 400W A100 at 6k GPUs alone is >120PF.
     
    #110 Bondrewd, Jul 31, 2021
    Last edited: Jul 31, 2021
  11. CarstenS

    Legend Subscriber

    Joined:
    May 31, 2002
    Messages:
    5,800
    Likes Received:
    3,920
    Location:
    Germany
    Makes total sense to lowball to ">120" when I really mean >180. While others are down to a decimal digit... *doh*
    LOL - still it's RPeak vs. RMax. Even if i give around 20% more power/more TFLOPs just for shits and giggles, this is a ridiculously high efficiency.

    Speaking of which: Why do they say on your sheet it's 6 MW when on it's RMax-run it was only 2.5? Axed some nodes?
     
    DavidGraham, pharma and xpea like this.
  12. Bondrewd

    Veteran

    Joined:
    Sep 16, 2017
    Messages:
    1,682
    Likes Received:
    846
    Major hopium it was, yeah.
    Just like Aurora.
    YES.
    A100 is unironically more efficient when you pump more watts into it.
    Yep, CPU ones aren't here.
    Nor they're good at HPL pumping anyway.
     
  13. CarstenS

    Legend Subscriber

    Joined:
    May 31, 2002
    Messages:
    5,800
    Likes Received:
    3,920
    Location:
    Germany
    Nah, that'd be if they wrote >180 TFLOPS. "<120" is just lowballing.

    In large scale clusters - of course. Losses through networking diminish.
    But I'm talking Blue-Gene league of efficiency of 85% RMax of RPeak. Doesn't work that way except for maybe very small clusters.

    Altogether around 19.2 TFLOPS RPeak - yes, not much compared to Accelerators, but 20% of the whole thing nevertheless.

    edit:
    So, to conclude: I don't see, why you should not use your compute resources for HPL. I don't see any proof, whether or not Top500 has explicitly banned GEMM Engines from their ranking. Can we please get back to the topic of CDNA/CDNA2ß
    .
     
  14. Bondrewd

    Veteran

    Joined:
    Sep 16, 2017
    Messages:
    1,682
    Likes Received:
    846
    Oh come on it's the same slide that lists Aurora power for <60MW.
    Yeah but the GPU numbers are still off and we're still under target.
     
  15. CarstenS

    Legend Subscriber

    Joined:
    May 31, 2002
    Messages:
    5,800
    Likes Received:
    3,920
    Location:
    Germany
    We're not debating Aurora. Or did you just discredit your own source?
    Ever occured to you, that this slide of yours might be incorrect in more than one places?

    You know, I'm asking because your slide says it's dated November 20th, 2020, right?
    And then there's this PDF here, dated June 2020, 5 months earlier and just after A100 launch:
    https://www.energy.gov/sites/default/files/2020/06/f75/fy-2021-sc-ascr-cong-budget.pdf

    Funnily enough, it says on p.23:
    "...and begin operations of the 75 petaflop NERSC-9 system, named Perlmutter after LBNL Nobel Laureate Saul Perlmutter."
     
    #115 CarstenS, Jul 31, 2021
    Last edited: Jul 31, 2021
    pharma likes this.
  16. CarstenS

    Legend Subscriber

    Joined:
    May 31, 2002
    Messages:
    5,800
    Likes Received:
    3,920
    Location:
    Germany
  17. CarstenS

    Legend Subscriber

    Joined:
    May 31, 2002
    Messages:
    5,800
    Likes Received:
    3,920
    Location:
    Germany
  18. Kaotik

    Kaotik Drunk Member
    Legend

    Joined:
    Apr 16, 2003
    Messages:
    10,244
    Likes Received:
    4,465
    Location:
    Finland
  19. CarstenS

    Legend Subscriber

    Joined:
    May 31, 2002
    Messages:
    5,800
    Likes Received:
    3,920
    Location:
    Germany
    For revenue or for bring-up? I'm asking because, you know, Intels 10nm products have been "shipping to customers since 2017" (maybe they used a sailboat for that and had strong headwind). So, there clearly is a difference between shipping (samples for qualifications and bring-up) and shipping (for actual market introduction).

    Maybe AMD just replaced SPOCK systems at ORNL with some real ones for now.
     
    #119 CarstenS, Aug 7, 2021
    Last edited: Aug 7, 2021
  20. Bondrewd

    Veteran

    Joined:
    Sep 16, 2017
    Messages:
    1,682
    Likes Received:
    846
    The former.
    Ehhhh yea but also no...
    Also can be Polaris shipment.
    Dunno.
    SC'21 to be pricise.
    I may even be there in-person just for kicks.
     
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...