Bigus Dickus
Regular
Just what is the expected performance hit for enabling FP color on the R300? I know there probably aren't benchmarks available to show that yet, but I'm asking more from a theoretical point of view.
What will that affect? Do the pixel shaders take longer to perform a given operation on a FP32 texel than they do an int 32 texel? Does it take more bandwidth to move texture data? Does it take a larger footprint in memory?
What is the difference between the way the R300 handles FP color vs. the way the NV30 does it? I know the NV30 has dedicated hardware for int 32, FP16 and FP32, per the B3D interview. I've seen speculation that the NV30 would be two or three times faster than the R300 using 128 bit color (or 96, as the case might be). Is there any rational for such claims?
What governs performance with FP color? Bandwidth, shaders, otherwise...
Lots of questions, yes, but this is one area I'm not too informed in. It's well known what the impact of enabling anisotropic filtering is on different architectures, or SSAA, or MSAA, but little is known (to me at least) about the impact of enabling high precision color.
I almost want to think that if the architecture was designed to natively handle 128/96 bit internal FP precision, then there would not be a tremendous performance penalty for using it. Is this hunch correct?
What will that affect? Do the pixel shaders take longer to perform a given operation on a FP32 texel than they do an int 32 texel? Does it take more bandwidth to move texture data? Does it take a larger footprint in memory?
What is the difference between the way the R300 handles FP color vs. the way the NV30 does it? I know the NV30 has dedicated hardware for int 32, FP16 and FP32, per the B3D interview. I've seen speculation that the NV30 would be two or three times faster than the R300 using 128 bit color (or 96, as the case might be). Is there any rational for such claims?
What governs performance with FP color? Bandwidth, shaders, otherwise...
Lots of questions, yes, but this is one area I'm not too informed in. It's well known what the impact of enabling anisotropic filtering is on different architectures, or SSAA, or MSAA, but little is known (to me at least) about the impact of enabling high precision color.
I almost want to think that if the architecture was designed to natively handle 128/96 bit internal FP precision, then there would not be a tremendous performance penalty for using it. Is this hunch correct?