Magnum_Force
Newcomer
Or to break it down some more:
3870= 4 arrays of 16 shaders totaling 64 shaders, each shader with a 4+1 MADDs, or 80 MADDs per array = 320 sp total
4870 =5 arrays of 32 shaders totaling 160 shaders, each with a 4+1 MADDs, or 160 MADDs per array = 800 sp total
With g92 you'd be talking:
128 shaders, each shader having a 2+1 (MADD+MUL) = 384 units.
Of course that's layman's and not taking into account the shader clockspeed is higher (~double) on an nvidia part, and that the MUL is not used in general shading, but rather for special function which (iirc) ATi does with their MADDs. I would say look at g92 as 256 MADDs, but that wouldn't be giving it fair credit for the work the MUL is doing, or the fact it's used in CUDA (and perhaps will be used for physX.)
At any rate, it's a serious boost.
So 5 arrays...Are we thinking 20 ROPs?
Just Off topic(ish) for a second...
I always thought that G80/G92 did 1 madd (plus a mul) per stream proc, not 2??
I worked out in another thread that RV670 at 775mhz x 320 Stream Proc puts out 248000 a second (I dunno what you'd call the number, shader calcs per second ( times a million for mhz converstion??)
I figured G92 for only 1 calc per stream proc so that puts it at: 1650mhz (8800GTS 512mb shader clock) x 128 = 211200.
If it does 2 shaper calcs per SP that is 422400 !!!
Thats almost double RV670. That can't be right can it??
If it is, then what is the bottleneck of the architecure?? the Rops?? bandwidth??
It surely isn't the TMU's.
I most likely wrong on this so can someone please explain it to a simpleton like myself.