I don't currently have the hardware to see what's going on, but I noticed that the Shark library for ML, which had some publicity for a bit for accelerating Stable Diffusion on AMD hardware, was supposed to support WMMA on RDNA3 according to AMD. Which is strange because it compiles to Vulkan/SpirV and there's supposedly no WMMA support in there (only through various ways in RocM).
The Shark library seems to think the RDNA3 driver supports the VK_NV_cooperative_matrix extension but the driver doesn't seem to advertise it ... what's going on here?
The Shark library seems to think the RDNA3 driver supports the VK_NV_cooperative_matrix extension but the driver doesn't seem to advertise it ... what's going on here?