SIMD & Wave execution:
GCN: CU has 4 x SIMD16, Wave64 execute on SIMD16 x 4cycles.
RDNA: CU has 2 x SIMD32, Wave32 execute on SIMD32 x 1cycles.
LDS:
GCN: 10 Wave64 on Each SIMD16, 2560 threads per CU. 2560 threads (1CU) share 64KB LDS.
RDNA: 20 Wave32 on Each SIMD32, 1280 threads per CU. 2560...