AFAIK AF was never broken - as well as AA/Z-Fill, according to AMD. But that's another story.AA was fixed in RV770. Z-fill rate was fixed in RV770. AF may have been fixed too, although that's less clear.
AFAIK AF was never broken - as well as AA/Z-Fill, according to AMD. But that's another story.AA was fixed in RV770. Z-fill rate was fixed in RV770. AF may have been fixed too, although that's less clear.
DUH!
R600 was severely short on texturing capability, so short that it became bottleneck, so the relatively good shader-speed was unobservant.
Invcreasing Tex power the bottleneck was removed - so now shading power can be better utilized.
And fixed z-speed & AA/AF.
Thats what SH is implying imho.
Anyone claiming R600 has "enough" tex power should take a look at ATi's market&mind share.
Regardless of how underpowered one aspect of it was, if everything is basically increased in a 1:1 fashion, nothing is actually being "fixed". While it may be coincidence that texturing power is now much more robust, it was not singled out as a main source of "fixing" because both it, and the shading power, has increased at the same rate. I do not understand how that can be so easily construed as fixing.
It would be like saying that the r700 is somehow fixing something in the rv770 because it is now 100% more of both shading and texturing elements.
If shaders were unused on RV670, they are still unused on RV770, so nothing was fixed from this POV.
Anyway, G94 had twice as many TMUs as RV670, but it performance was about the same, so I don't think RV670 was bottlenecked by it's texturing core.
And here, again, is where it all falls down, because this situation is entirely unchanged in RV770 due to the fact that it has exactly the same number of textures to ALU's; there is just more of all.Shaders were going unused because there simply were not enough texture units to handle the workload.
And? This is no different in RV770. But everything has bottlenecks on different workloads. Its easy to find shaders that are bottlenecked to ALU's and textures are underutilized.You're missing the point though. Let's say the current frame being rendered has a large amount of texture lookups per pixel in the shader core. What happens if you have too few TMUs to meet these requests? Your shader core goes idle because of the texturing bottleneck!
And here, again, is where it all falls down, because this situation is entirely unchanged in RV770 due to the fact that it has exactly the same number of textures to ALU's; there is just more of all.
Again this line of thought has merit if the texture:ALU ratio went up at different rates, but simply it didn't.
ShaidarHaran said:I believe R7xx is a "correction" to the mistake that was R6xx and it's horrible lack of texturing/z-fill/and AA sample rates.
ShaidarHaran said:Why then did your engineers increase texture filtering/sampling performance by 250% this generation?
willardjuice said:Yeah they also increased their shading power by 250% too. Is SH suggesting the R6x0 was "severely short" on shading power?
And, assuming no other bottlenecks, RV770 would show exactly the same untilization but just be faster to the degree that that the overall engine scaled up.If I were a programmer (where's jawed and humus when you need them?) I'd write an app to run on both RV670 and RV770 to demonstrate this fact. All I can do in the meantime is refer back to my suggestion to use Pix.
And, assuming no other bottlenecks, RV770 would show exactly the same untilization but just be faster to the degree that that the overall engine scaled up.
In terms of hardware units, RV770 is no more texture limited than RV670.
When referenced with actual workloads, however, it appears there is a return on investment curve when it comes to having X number of units of any type.
We can see diminishing returns for adding tons of extra texture units. Nvidia's higher TMU counts didn't translate into linear increases in performance.
On the other hand, it could be argued that R6xx's base level of texturing capability--irrespective of ALU:TEX--was pathologically small compared to the texturing threshold many game worloads would need before other bottlenecks (bandwidth, setup, etc.) started to take precedence.
Diminishing returns can occur with an excess of capability, but escalating costs can be associated with falling below the baseline that many workloads would consider adequate.
In that regard, R600 could be considered too small for some workloads, and RV770 is what happens when a design grows up to match its tasks.
And you've just said no different from: the new generation is better than the prior because it has more engine to cope with the workloads.In that regard, R600 could be considered too small for some workloads, and RV770 is what happens when a design grows up to match its tasks.
And you've just said no different from: the new generation is better than the prior because it has more engine to cope with the workloads.
In that regard, R600 could be considered too small for some workloads, and RV770 is what happens when a design grows up to match its tasks.
All that means is the RV770 improved performance over R6x0. How is that evidence that AMD specifically made (went out of their way) a "correction" to their "texturing problems"?
In realtion to R600 the base ability does not change, other than the fact the entire engine has scaled upto a different baseline - this is not a correction in any principles adopted with R600, this is taking advantage of newer processes and engineering to increase the performance of the new architecture, much like other arhictectural generations have before.Again, it's about base ability. You're hung up on the ratio because that's all ATi's preached for years now
Why is this topic so hard for some to grasp?
Someone needs to do some testing here, because the answer seems blatantly obvious to me but words obviously aren't enough to convince everyone.
In realtion to R600 the base ability does not change, other than the fact the entire engine has scaled upto a different baseline - this is not a correction in any principles adopted with R600, this is taking advantage of newer processes and engineering to increase the performance of the new architecture, much like other arhictectural generations have before.