Luminescent
Veteran
Here is a post of mine from around June which extrapolates the whole two shaders per clock shabang from the results obtained by MDolnec's benchmark on AnteP's test system, a 5800 and 5900.DaveBauman said:Luminescent, have you seen any evidence that actually backs up that it has two FP shaders?
NV35:
Maximum fillrate=1772.702026M pixels/sec
Per-pixel fillrate=105.561607M pixels/sec (at fp16 for the sake of neglecting fp32's register overhead)
Maximum fillrate/per-pixel fillrate=~16.79
Considering there are 21 instructions:
21/16.79=~1.25 instructions/cycle per pipeline
Now, compare this to NV30:
Maximum fillrate=1957.946899M pixels/sec
Per-pixel fillrate=67.032890M pixels/sec
(at fp16 for the sake of neglecting fp32's register overhead)
Maximum fillrate/per-pixel fillrate=~29.209
Considering there are 21 instructions:
21/29.209=~.7189 instructions/cycle per pipeline
Clock for clock, the improvement between NV35 and NV30 in number of instructions executed per clock, for the per pixel shader test is:
1.25*1.10/.7189=1.91 or almost 91%, which is close a two-fold improvement.