nAo, their flop count also includes both the 4 way MAD + 2 flops SFU/scalar in the mini-fpu which they can't schedule simultaneously because the unit is only 4D; unless maybe they are counting bias/scale which is more non-programmable flops. So I'm only coming up with 16 programmable flops per PS ALU combo.