I'm not sure I really agree with that. The problem IMO is that people expect that entire games are ALU-limited and that unless the performance scales basically as fast as the number of GFlops, then ALUs are not the bottleneck. That's absolute nonsense!If it's really "highly accessable" - no doubt about, yeah. But AFAICT this accessibility is shown only in very arithmetic-heavy environments, aka CUDA-style apps. Games in general, even newer ones, do not utilize the ALUs as excessively as i.e. AMD would like them to. So G86s changes barely had a chance to shine yet.
It's really not hard to see how only half a frame or less might be ALU bottlenecked. You've got Z/Stencil/Shadow passes, and some pixels requiring high levels of AF. There also are potential ROP and triangle setup bottlenecks even in color passes... In the end, what shocks me is reasoning such as this:
Performance increases by 8.3% for a 15% shader clock increase (1350->1550) and they aren't happy?! Pff, whiney bitches!http://www.anandtech.com/video/showdoc.aspx?i=2931&p=5 said:F.E.A.R. does respond a little better to shader overclocking than Oblivion, but even at 8.3% improvement at 1600 MHz, F.E.A.R. performance doesn't even improve at half the rate shader clock speed is increased. Like Oblivion, F.E.A.R. benefits much more from increasing core clock speed.
I'm not sure how (much) a larger register file would help G80. But it might come in handy in G92 if they've got both a higher clock rate and higher memory latency (if they are indeed using GDDR4)...So, if the coming generation is not going to stay in the market unchanged, i think it'd be better off with investing those transistors in a larger register file or the ominous thing which keeps the GS from being competitive with AMD or maybe even a higher triangle-rate.
As for triangle setup and GS performance, I completely agree. The former will already be helped a bit by the clock speed I suppose, but I'm not sure if that's really enough. As for the latter, we'll see. I'd also like to see INT8 blending improved a bit, since G8x is rather weak there compared to R6xx right now.