Chalnoth said:Oh, I'd be willing to bet that nVidia's texture units take up more total die area than ATI's. Remember that nVidia has 24 of 'em.
That wasn't my point; imagine how transistor count would had risen in G7x if it would have had the same texture samplers as in NV2x/3x. In my mind the higher angle dependency since NV4x is a clear transistor saving design decision. Even more so since the design was set to scale beyond 4 quads.
The dynamic branching optimizations are likely the most costly. It is for this reason that I suspect that nVidia's next part isn't going to be as good as the R5xx at dynamic branching. The memory controller is also likely a big part.
But still, ATI's R580 is sitting at roughly twice the die area for a part with similar performance and featureset when compared to nVidia's G71. I don't buy that you can account for all of this just by the G71's relatively fewer features.
Don't you think that the ballpark between 160M on R4x0 and 384M on R580 is a tad too wide to attribute the majority on "just" SM3.0+dynamic branching+memory controller as an example?