Glonk said:
IIRC, the EDRAM chip is 90M.
IMO that's too low. The memory array itself is at least 84M, perhaps some redundancy to improve yields adds even more. And there's quite a bit of logic packed on that chip, too.
ERP said:
Oh I'll agree with that.
All I'm saying is that MS's 256GB/s isn't some made up best case number based on variable compression, it's as real a number as any of them are. With 4xAA alpha blending and Z/Stencil compare/update that is the number you will get, which happens to match nicely the exact requirements of an 8 pipeline part with 32 bit
As long as there are no edge pixels involved, which would result in additional (external) bandwidth required.
The problem with such numbers, however, is that nowhere actually is this amount of data really transmitted. It's always some compressed representation. If you want to compare that figure, you have to compare it to other "effective bandwidth" figures. I mean, even an NV40 can do in excess of 2GPix/s in theoretical benchmarks with the mentioned settings (alpha blending, z test and update, 4xAA).
More than half of R500 with only 1/7 of the bandwidth?
btw, NV40's theoretical peak for that situation, leaving bandwidth out of the picture, is even beyond that of R500 (8 blended pixels per clock at 550MHz, ROPs running at mem clock)
DemoCoder, R500 presumably has the bandwidth (and ROPs) to output 4 quads with Z only, 2 quads with 32bpp color, or 1 quad with 64bpp color, all with 4xAA enabled. Half speed for FP16 rendering looks quite good to me.