So, after staring at this for a while, it seems like G80 has 128 scalar shader processors which do 1 MADD and 1 MUL per clock. And then each shader processor can also do a special op every 4 clock cycles (sin, cos, log, exp, ?).
R600 seems to have 64 5-way superscalar shader processors that...