YeuEmMaiMai
Regular
OH I forgot where Carmack said that there are IQ trade offs for speed with the NV30 I guess you missed that part. JC was very carfull when he coded Doom III and he is doing his best to minimize the IQ has be has pointed out many times
"The R200 path has a slight speed advantage over the ARB2 path on the R300, but only by a small margin, so it defaults to using the ARB2 path for the quality improvements. The NV30 runs the ARB2 path MUCH slower than the NV30 path. Half the speed at the moment. This is unfortunate, because when you do an exact, apples-to-apples comparison using exactly the same API, the R300 looks twice as fast, but when you use the vendor-specific paths, the NV30 wins."
so while nVIdia claims higher percision (rightfully so) they cannot really use it due to lack of required speed
In terms of Performance :
NV30+NV30-path is faster than NV30+ARB2
NV30+NV30-path is faster than R300+ARB2
R300+ARB2 is faster than NV30+ARB2
R300+R200-path is faster than R300+ARB2
In terms of Quality :
NV30+ARB2 is better than NV30+NV30-path
NV30+ARB2 is better than R300+ARB2
R300+ARB2 is better than NV30+NV30-path
R300+ARB2 is better than R300+R200-path
so I concede the fact that NV30 is better looking at ARB2 but the problem is NV30 and ARB2 = half of the performance of R300+arb2....
please note that JC stated that "R300+ARB2 is better than NV30+NV30-path" is correct.
Looking at the NV3X performance in current games where it has a hard time keeping up with a slower clocked part from a competitor makes me wonder how badly it will do once the DX9 games that do make use of all of the advanced PS 2.0 features comes out
I am definately glad I did bot buy one of Nvidia's cards in my price range as I would have been royally screwed...
"The R200 path has a slight speed advantage over the ARB2 path on the R300, but only by a small margin, so it defaults to using the ARB2 path for the quality improvements. The NV30 runs the ARB2 path MUCH slower than the NV30 path. Half the speed at the moment. This is unfortunate, because when you do an exact, apples-to-apples comparison using exactly the same API, the R300 looks twice as fast, but when you use the vendor-specific paths, the NV30 wins."
so while nVIdia claims higher percision (rightfully so) they cannot really use it due to lack of required speed
In terms of Performance :
NV30+NV30-path is faster than NV30+ARB2
NV30+NV30-path is faster than R300+ARB2
R300+ARB2 is faster than NV30+ARB2
R300+R200-path is faster than R300+ARB2
In terms of Quality :
NV30+ARB2 is better than NV30+NV30-path
NV30+ARB2 is better than R300+ARB2
R300+ARB2 is better than NV30+NV30-path
R300+ARB2 is better than R300+R200-path
so I concede the fact that NV30 is better looking at ARB2 but the problem is NV30 and ARB2 = half of the performance of R300+arb2....
please note that JC stated that "R300+ARB2 is better than NV30+NV30-path" is correct.
Looking at the NV3X performance in current games where it has a hard time keeping up with a slower clocked part from a competitor makes me wonder how badly it will do once the DX9 games that do make use of all of the advanced PS 2.0 features comes out
I am definately glad I did bot buy one of Nvidia's cards in my price range as I would have been royally screwed...
Ailuros said:YeuEmMaiMai,
A simple example where you obviously want to see only half of the picture would be here:
John carmack states that when you force NV30 to run the STANDARD ARB2 path the R300 appears to be twice as fase.
Carmack's specific statement wasn't concentrated on performance alone; he also commented on image quality differences between different modes. I'd urge you to re-read the full statement.
If the differences in a game like Doom3 are miniscule between different accuracy depths, then it's senseless to torture a specific accelerators performance for nothing. Riding over it makes to me as much sense as the R3xx 5bit LOD precision.
FXs are by the way yielding better performance due to stencil op performance with the special NV30 path.
----------------------------------------------------------------
I think it's time that someone sits down and writes an educated article about different floating point formats, implementations and what not. Ideally even with an attempt to analyze where and why each format is required and what the differences would look like.
I don't think that there's anyone with half a brain that cannot acknowledge that the R3xx family has an advantage in terms of arithmetic efficiency; yet that doesn't mean that the FXs are completely worthless either. In fact wouldn't anti-aliasing quality take such a high persentage in my own preferences I'm not so certain I'd own a R3xx today.