Does anybody here know for certain how Fermi executes warp instructions? I'm not sure if I understand the white paper exactly, but is a warp of 32 threads really finished in two clock cycles by a group of 16 cores?
There is a difference between "raster rate" and "dual rasterizers" and you know thatThere is no "sorta" about it. There is 2x the raster rate there.