For Turing? Like Xavier Int8 because Int8 is enough for inference. You don't need high accuracy for inference, Ampere as next gen training architecture will get higher precision cores as Volta is using. These are 2 different workloads with 2 different requirements.
We are speculating here that Turing is the next gaming GPU architecture. (Ampere already more or less has been declared a false/changed rumour).
Say Turing is the next generation of Pascal, it doesn't need hardware for NN training, just inferencing.
What will be called the next generation (next Volta) of HPC GPU (ie NN training/ double precision/ HMB2/NV link etc), we don't know or have not speculated.
It might also be called Turing or something else.
if you think the next generation of Volta HPC GPU will be called Ampere, how do you get to this and why do you think it needs higher precision training hardware (Volta is FP16/FP32 mixed precision) ?
Last edited: