AMD CDNA Discussion Thread

Discussion in 'Architecture and Products' started by Frenetic Pony, Nov 16, 2020.

  1. Granath

    Granath Newcomer

  2. pTmdfx

    pTmdfx Regular

    https://github.com/llvm/llvm-project/commit/47bac63d3f6b9e64fdf997aff1f145bc948f02d9
    It seems like cache coherence in GFX940 (CDNA 3?) is achieved by:

    * memory local to L2: reads & writes are cacheable
    * remote memory (other L2s or CPU): reads are uncached; writes are write through and send an invalidation to the home L2.
    * Each L2 keeps a probe filter for CPU cached lines from its local memory, and forces CPU invalidation or writeback as appropriate.

    It appears that GFX940 no longer has a unified L2 cache shared by all CUs. It is configurable:

    1. from many smaller agents/virtual devices, each having their own private L2 cache;

    2. to one single agent having multiple L2 caches.

    Each L2 owns a disjoint(?) region of the device memory now, while they appear to still have internal interleaved “channel” partitions.

    Makes perfect sense in multi-agent mode where each small agent gets a fixed contiguous region, which can be owned outright by a single L2. But I am not sure how more “monolithic” configurations where one agent sees multiple standalone L2s would work effectively. Page level interleaving, eh?
     
    Last edited: Mar 31, 2022
    pharma, T2098, Lightman and 1 other person like this.
  3. Granath

    Granath Newcomer

  4. xpea

    xpea Regular

    A100 compared to MI250 in HPC workloads:
     
    Lightman, Qesa, Jensen Krage and 2 others like this.
  5. CarstenS

    CarstenS Legend Subscriber

    Does that "MI250X/1" indicate, that it's running on half a MI250X, i.e. only on one of the two devices?
     
    Lightman likes this.
  6. trinibwoy

    trinibwoy Meh Legend

    Footnote in the first pic says "single GPU of Crusher/MI250X".
     
    Lightman likes this.
  7. Krteq

    Krteq Newcomer

    Yes, same as for SVD. They clearly stated "single GPU" in figure description and "MI250X/1" in graphs legends and "Single GCD" for SVD figure desc.

    Also, some comparisons are for V100 vs MI100 with old ROCm etc.
     
  8. pharma

    pharma Veteran

    Last edited: May 3, 2022
  9. Granath

    Granath Newcomer

     
    Lightman and Krteq like this.
Loading...

Share This Page

Loading...