ATi is chtg in Filtering

Quitch · May 26, 2004

Clear advice of this article would be that anyone in need of strict OpenGL conformance should not use ATI cards because they have reduced precision in logarithm calculation and because they do not follow OpenGL recommendations (Rho .vs. Lambda).

LOL! So we should avoid something because it doesn't follow a *reccomendation*? How stupid is that? If it's optional then it's optional, if the OpenGL body wanted it followed by everyone it wouldn't be a mere reccomendation.

Sounds like someone finding a way to back their favourite IHV.

Clootie · May 26, 2004

Suspicious said:
Maybe I can help a bit with somewhat better Russian translation, I used to learn it in school (it was a long time ago though)

Pretty good translation, thank you for spending time on it

Pete · May 26, 2004

Agreed, thanks for the very readable translation, Suspicious.

It would appear that nV still offers more refined blends, and ATi still leans toward sharper (and potentially more shimmery) textures. Brandon and Lars have noted this, but not in very much depth (in all fairness, Lars appeared to mention this in one sentence, and Brandon noted it throughout his screenshot analysis). ATi's more detailed texturing is evident in Mike's AM3 screenshots, too.

Now I just have to figure out what it means that nV bases their linear interpolation on Lambda, and ATi on Rho (beyond potential patent issues)....

FUDie · May 27, 2004

Pete said:
Now I just have to figure out what it means that nV bases their linear interpolation on Lambda, and ATi on Rho (beyond potential patent issues)....

If you look at the OpenGL spec (www.opengl.org), you'll find that lambda = log base 2 of rho(x,y). rho(x,y) = max { sqrt( (du/dx)^2 + (dv/dx)^2 ), sqrt( (du/dy)^2 + (dv/dy)^2 ) }. From the results at iXBT, it looks like ATI uses a linear approximation to log base 2. (natural log of x = ln x = 1 + x + x^2 / 2 + x^3 / 3 + ..., so 1 + x would be a linear approximation to ln x. For log base 2, just take ln x and divide by ln 2). However, since the D3D refrast does the same, I see no basis for complaints.

-FUDie

bloodbob · May 27, 2004

This is where Nvidia's LOD calculation are better then refrast. It is a slight improvement but in the big picture the devations are pretty small.

KimB · May 27, 2004

FUDie said:
(natural log of x = ln x = 1 + x + x^2 / 2 + x^3 / 3 + ..., so 1 + x would be a linear approximation to ln x. For log base 2, just take ln x and divide by ln 2). However, since the D3D refrast does the same, I see no basis for complaints.

Well, that's not true.

A correct formula is:
ln (1+x) = x - x^2 / 2 + x^3 / 3 - x^4 / 4 + ...

So, to first order, one can approximate the log by simply using ln(1+x) = x, or ln(x) = (x-1)/x.

As a side note, if you better-approximate the log, it can become easier to properly do the calculation:
sqrt(x^2 + y^2) that is necessary for LOD calculations, as the square root becomes trivial (a simple divide by two after the log).

FUDie · May 27, 2004

Chalnoth said:
FUDie said:

(natural log of x = ln x = 1 + x + x^2 / 2 + x^3 / 3 + ..., so 1 + x would be a linear approximation to ln x. For log base 2, just take ln x and divide by ln 2). However, since the D3D refrast does the same, I see no basis for complaints.

Click to expand...

Well, that's not true.

A correct formula is:
ln (1+x) = x - x^2 / 2 + x^3 / 3 - x^4 / 4 + ...

So, to first order, one can approximate the log by simply using ln(1+x) = x, or ln(x) = (x-1)/x.

You're right, I typed the above in haste.

-FUDie

Pete · May 27, 2004

Thanks again for taking the time to explain things, FUDie (and Chalnoth). I just need some time to understand rho as "scale," and from that, lamba. (Tau, I think I get, simple as it is.) Give me a few days before you waste more time on me, impatient as I am.

Simon F · May 27, 2004

FUDie said:
(natural log of x = ln x = 1 + x + x^2 / 2 + x^3 / 3 + ..., so 1 + x would be a linear approximation to ln x.

For log base 2, just take ln x and divide by ln 2).

ACK! No one in their right mind would do it that way in computer HW! As an exercise to the reader think of floating point numbers.

KimB · May 27, 2004

Simon F said:
ACK! No one in their right mind would do it that way in computer HW! As an exercise to the reader think of floating point numbers.

Well, obviously with floating point numbers, to a first approximation, the log would just be the exponent. Then you could do successive approximations on the mantissa. But the real question is: How useful is this in the case of texture LOD calculations? Do they encompass enough dynamic range? Is the calculation even done in floating point?

Using this method would basically be useful to make the geometric series converge (as the log of a number between 1 and 2, which would be the mantissa, is pretty stable), and, I'm sure, could be used similarly on integers.

nAo · May 27, 2004

Simon F said:
ACK! No one in their right mind would do it that way in computer HW! As an exercise to the reader think of floating point numbers.

I just implemented my own LOD formula on the PS2 thinking in that way..

Simon F · May 27, 2004

nAo said:
Simon F said:

ACK! No one in their right mind would do it that way in computer HW! As an exercise to the reader think of floating point numbers.

Click to expand...

I just implemented my own LOD formula on the PS2 thinking in that way..

I'm sorry, I didn't understand what you meant. Do you mean that you are using bit tricks to extract the exponent and mantissa or that you are using the C "math.h" library "log" function and multiplying by 1/log(2) ?

If it's the latter, remember I said "in computer HW"

nAo · May 27, 2004

Simon F said:
Do you mean that you are using bit tricks to extract the exponent and mantissa

Yes, I mean that

I just use a VU instruction to convert an integer number to a float representation on a floating point number.
Another couple instructions to unbias and to fix the result and I have a good (at least in my case..) log2 approximation

ciao,
Marco

KimB · May 27, 2004

Did you use:
(mantissa) - 1 + (exponent) ?
...because that would be a little bit better than just using the exponent.

nAo · May 27, 2004

Chalnoth said:
Did you use:
(mantissa) - 1 + (exponent) ?
...because that would be a little bit better than just using the exponent.

yeah. In fact you don't need to subtract 1 at all, cause it's implicit in the FP representation, but the integer to float conversion instruction doesn't 'know' that

Demirug · May 27, 2004

Images with and without R420 "AF-optimization"

Sorry, the text is only in german but pictures should show the differences.

ChrisRay · May 27, 2004

Demirug said:
Images with and without R420 "AF-optimization"

Sorry, the text is only in german but pictures should show the differences.

It's amazing how the German/French, Asian sites figure this stuff out before us.

Very Interesting find. Thank you. Reading up

Ailuros · May 27, 2004

I haven't followed this thread and I'm not going to read 54 pages anyway. Those who can't understand german, just pop it into a online translator:

Deutlich ist zu erkennen, dass die R420-Karte inklusive der trilinearen Optimierung zusÃ¤tzlich, und das konnten wir auf unserer Radeon 9600 XT bislang nicht entdecken (!), das LOD leicht anhebt, so dass ein auf Standbildern schÃ¤rferes Bild erzeugt wird, welches in der Bewegung allerdings eher zum Flimmern neigen dÃ¼rfte, denn umsonst wird der Nullpunkt beim LOD im allgemeinen nicht eingehalten. Durch diesen angehobenen LOD filtert man lÃ¤nger aus der hochauflÃ¶senderen Mip-Stufe, kommt dabei aber auch lÃ¤nger mit einer, im VerhÃ¤ltnis zum korrekten LOD gesehen, niedrigeren Anzahl an Textursamples bsw. bei anisotroper Filterung aus. Wenn man jetzt noch zwei und zwei zusammenzÃ¤hlt und bedenkt, dass die Catalyst-Treiber per default nur auf Textur-Stage 1 trilineares AF, auf allen anderen sieben dagegen nur bilineares AF bieten, erklÃ¤rt sich sicherlich ein groÃŸer Teil der erstaunlichen FPS-Raten, die die X800-Serie bietet, wenn man alle Optimierungen in Aktion belÃ¤sst.
Weiterhin interessant zu beobachten ist zudem, dass diese LOD-Anhebung nur in Bereichen wirksam ist, in denen die WinkelabhÃ¤ngigkeit des AF nicht schon einen groÃŸen Teil der zu leistenden Arbeit einspart.

That's far from an ideal situation. Very interesting article (kudos to whoever wrote it) and I'd also like to applause the closing comment of the specific article. Cliff notes: a plead to both IHVs to get rid of the optimisations when it comes to texture filtering.

ChrisRay · May 27, 2004

I dont paticularly agree that there should be no filtering optimisations. I like having the choices between bilinear, Trilinear, and texture stage optimisations.

But they need to have the option to shut them off.

Richthofen · May 28, 2004

ChrisRay said:
Demirug said:

Images with and without R420 "AF-optimization"

Sorry, the text is only in german but pictures should show the differences.

Click to expand...

It's amazing how the German/French, Asian sites figure this stuff out before us.

Very Interesting find. Thank you. Reading up

well the simple reason for that is that those sites are not in the IHVs pockets. That's why they often have to try hard to get any review samples unlike other big sites who just write down what is trendy and what the majority wants to hear. 1 1/2 years ago and the time before it was praising Nvidia.
The last 1 1/2 year it was all about praising ATI and bashing "evil" Nvidia.
I trust them as much as i would trust PR or marketing departments.

ATi is chtg in Filtering

Quitch

Clootie

Pete

Moderate Nuisance

FUDie

bloodbob

Trollipop

KimB

FUDie

Pete

Moderate Nuisance

Simon F

Tea maker

KimB

nAo

Nutella Nutellae

Simon F

Tea maker

nAo

Nutella Nutellae

KimB

nAo

Nutella Nutellae

Demirug

ChrisRay

<span style="color: rgb(124, 197, 0)">R.I.P. 1983-

Ailuros

Epsilon plus three

ChrisRay

<span style="color: rgb(124, 197, 0)">R.I.P. 1983-

Richthofen

Similar threads

ATi is ch**t**g in Filtering

Moderate Nuisance

Trollipop

Moderate Nuisance

Tea maker

Nutella Nutellae

Tea maker

Nutella Nutellae

Nutella Nutellae

<span style="color: rgb(124, 197, 0)">R.I.P. 1983-

Epsilon plus three

<span style="color: rgb(124, 197, 0)">R.I.P. 1983-

Similar threads

ATi is chtg in Filtering