Since Kepler and Maxwell have instruction level preemption ability already, there's likely something more to it. Perhaps it's just a software change now exposing it to the graphics driver, not just CUDA? Or, thinking about it, it's probably the ability to reorder the kernel execution stack instead of just a push/pop like Kepler, which would explain the "real-time" example.http://www.theregister.co.uk/2016/04/06/nvidia_gtc_2016/
Specifically they say: "Software running on the P100 can be preempted on instruction boundaries, rather than at the end of a draw call."