Jawed
Legend
I think this comes back to MfA's desire for lockable lines, e.g. wanting to keep atomic variables in-cache. But obviously there's a scalability problem there, too, as the count of concurrent atomics balloons (e.g. millions). Sorting the domain so that all the atomics come packed together in time and space, like Larrabee's tiled forward rendering, works around that scaling problem.These hints appear to be orthogonal to what would be needed for an atomic operation. Hints don't prevent some inopportune memory traffic pattern from interfering.
Jawed