RingWraith
Newcomer
Considering the R3xx handled T&L through the Vertex shader, couldn't they boost the performance of the R420 by adding a dedicated T&L unit?
Well, not really. . . but in any case, the speed boost would be negligible when compared to the already huge increase of speed the games will get from the new card's raw power.K.I.L.E.R said:Popular games that use a hardwired T&L unit are nearly dead with maybe some small exceptions.
I don't think you would have to worry about this. GPU's are fast because they can work on many parallel streams of data, and nearly all processing can be broken down into a few simple mathematical operations that are done repeatedly. The GPU programming algorithms enforce this independence of vertex and pixel processing. Any special hardware that must be added to the vertex shader for fixed-function T&L would not slow the vertex shader down, because it is still working on the same type of data. Still, I'm pretty sure that today's vertex shaders are more than general enough to fully emulate fixed-function T&L without any extra hardware dedicated to fixed function.RingWraith said:T&L is still being used, because EVERYONE has T&L capable cards. I wasn't referring to a boost in T&L performance, I was referring to the increased VS performance for not having to emulate a T&L unit.
The rule that seems to be universal is the fact that a dedicated unit is much faster than a multi-pupose unit.
trinibwoy said:Is the T&L implementation in hardware transparent to the application? Do you have to specifically code for a vertex shader implementation? I thought it was just a driver/hardware consideration.
AndrewM said:trinibwoy said:Is the T&L implementation in hardware transparent to the application? Do you have to specifically code for a vertex shader implementation? I thought it was just a driver/hardware consideration.
You have to specifically program vertex shaders. HW T&L is transparent.
RingWraith said:T&L is still being used, because EVERYONE has T&L capable cards. I wasn't referring to a boost in T&L performance, I was referring to the increased VS performance for not having to emulate a T&L unit.
The rule that seems to be universal is the fact that a dedicated unit is much faster than a multi-pupose unit.