Ext3h
Regular
4. Zero coverage culling, especially after tesselation1. Backface culling : Solved
2. View volume culling : Solved
3. Hidden surface culling : Solved in TBDR
Depends on which type of stall you refer to, for the generic stall on memory transactions: Batched prefetch / read ahead and explicit use of the LDS.Ignorant question of the day: on today's non-Polaris GPUs, how are pipeline stalls visible to the programmer and how does one typically have to deal with it?
Preferably not done explicitly in software, but rather implicitly by the hardware itself.
You can stall not only on memory transactions, but (especially for larger shader programs) also on the instruction cache. Which is much harder to account for, since you don't have explicit control over the corresponding cache.