AMD CDNA Discussion Thread

Discussion in 'Architecture and Products' started by Frenetic Pony, Nov 16, 2020.

  1. pTmdfx

    Regular

    Joined:
    May 27, 2014
    Messages:
    415
    Likes Received:
    379
    https://github.com/llvm/llvm-project/commit/47bac63d3f6b9e64fdf997aff1f145bc948f02d9
    It seems like cache coherence in GFX940 (CDNA 3?) is achieved by:

    * memory local to L2: reads & writes are cacheable
    * remote memory (other L2s or CPU): reads are uncached; writes are write through and send an invalidation to the home L2.
    * Each L2 keeps a probe filter for CPU cached lines from its local memory, and forces CPU invalidation or writeback as appropriate.

    It appears that GFX940 no longer has a unified L2 cache shared by all CUs. It is configurable:

    1. from many smaller agents/virtual devices, each having their own private L2 cache;

    2. to one single agent having multiple L2 caches.

    Each L2 owns a disjoint(?) region of the device memory now, while they appear to still have internal interleaved “channel” partitions.

    Makes perfect sense in multi-agent mode where each small agent gets a fixed contiguous region, which can be owned outright by a single L2. But I am not sure how more “monolithic” configurations where one agent sees multiple standalone L2s would work effectively. Page level interleaving, eh?
     
    #402 pTmdfx, Mar 31, 2022
    Last edited: Mar 31, 2022
    pharma, T2098, Lightman and 1 other person like this.
  2. xpea

    Regular

    Joined:
    Jun 4, 2013
    Messages:
    551
    Likes Received:
    783
    Location:
    EU-China
    A100 compared to MI250 in HPC workloads:
     
    Lightman, Qesa, Jensen Krage and 2 others like this.
  3. CarstenS

    Legend Subscriber

    Joined:
    May 31, 2002
    Messages:
    5,800
    Likes Received:
    3,920
    Location:
    Germany
    Does that "MI250X/1" indicate, that it's running on half a MI250X, i.e. only on one of the two devices?
     
    Lightman likes this.
  4. trinibwoy

    trinibwoy Meh
    Legend

    Joined:
    Mar 17, 2004
    Messages:
    12,055
    Likes Received:
    3,112
    Location:
    New York
    Footnote in the first pic says "single GPU of Crusher/MI250X".
     
    Lightman likes this.
  5. Krteq

    Newcomer

    Joined:
    May 5, 2020
    Messages:
    148
    Likes Received:
    261
    Yes, same as for SVD. They clearly stated "single GPU" in figure description and "MI250X/1" in graphs legends and "Single GCD" for SVD figure desc.

    Also, some comparisons are for V100 vs MI100 with old ROCm etc.
     
  6. pharma

    Veteran

    Joined:
    Mar 29, 2004
    Messages:
    4,887
    Likes Received:
    4,534
    #408 pharma, May 3, 2022
    Last edited: May 3, 2022
  7. Granath

    Newcomer

    Joined:
    Jul 26, 2021
    Messages:
    80
    Likes Received:
    82
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...