We have. Initial version, 128 queues, GCN crashed, Kepler/Maxwell basically serialized all of them. Which I'd say is what happens with graphics + compute queue anyway.We also have not tested a multi-queue state, and we do not have ready visibility on what queues are actually being exercised relative to the API-visible queues.
I think NV drivers are just not there yet. That said there's one more thing we can try. D3D12 for graphics + CUDA instead of D3D12 compute shaders. So this version is now NV only.