D
Deleted member 13524
Guest
Which is part of my point, the packed FP16 is possibly only going to be available for Vega and SoCs-consoles.
As referred above, GCN3 and up already pack 2*FP16 load/store for bandwidth and latency savings. This means Tonga, Fiji, Polaris 10 and Polaris 11 already to it.
I think you're mistaking 2*FP16 packing with processing FP16 at twice the rate of FP32 in the same ALU units. That's the feature that is present in the PS4 Pro and probably in Vega (and TX1 + GP100 on nvidia's side).