Then we agree on this, don't we?
There were always rumours about companies counting transistors a bit differently. Also, it doesn't tell you that much about chip complexity anyway - transistors in caches can be packed way more densely usually than those in logic circuits. But most importantly, die area is a more direct indicator of cost than the amount of transistors - what matters is how big your die is (and thus how many dies you can fit on a wafer), not how many transistors are in the chip. So sure G94 looks better with its 500 million transistors vs. rv670 with its 666 million transistors considering the performance - but it doesn't make it cheaper to produce. Why there is this quite large discrepancy in transistor count vs. die size is of course interesting on its own - maybe G94 has relatively less cache and more logic, counted differently, or AMD was just able to pack them more tightly together for some other unknown reasons...If you don't mind my asking, what's wrong with comparing transistor counts?
I have no idea if that's the case, but that'd seem like a pretty smart idea if you don't support DX10.1, doesn't it?Does this mean that they intend to push CUDA even further, while diluting DX10/OpenGL marketing features, or was it merely an inconsequential decision ?
Yep -- it's like $0.99 against $1.00.That 933 GFlops just keeps hounding me, mainly because it is so close to the teraflop mark. Wonder if the chips couldnt be clocked higher or the yeilds were really that bad?
Cheers, guess it was a typo on the other forum then
I have no idea if that's the case, but that'd seem like a pretty smart idea if you don't support DX10.1, doesn't it?
Basically, you're saying that performance increases more than linearly with transistor count increase. And that higher-end chips are more effective (because the fixed parts that always have to be there take up relatively less portion of the total number of transistors) - of course, excluding the cases where the chips are bandwidth-limited, like G92, but I don't suppose this will be the case with RV770XT or GT200. So, theoretically, GT200 should have more perf/tran. count than RV770.But anyway, comparing chips born for different markets is difficult when coming at performance per transistor, as i.e. even in the same architecture and process node, if you compare i.e. the HD3870 and HD3650, you have the first being much more than 2 times as powerful than the second, with an increase of only 60% of the die area.
Sure I was comparing official figures which may be counted differently. But it's absolutely irrelevant for the sake of the argument! We do know that 505 million in G94 is comparable to 666 million in RV670. As was already said, G92 is limited by memory bandwidth, so it's not as effective. GT200 has a higher ALU:tex ratio, that should give it even more perf/transistor than if if was just a bigger G80/G92. GT200 has 1,4 billion transistors, that is an official figure. If we were to compare its effectiveness against other GPUs, we need to factor in the NVIO which is about 100 million transistors. So we have a 1,5 billion transistor beast. RV770 won't come anywhere near 75% of this number. I know that the mathematics involved are not exactly correct, but hey, it works.There were always rumours about companies counting transistors a bit differently.
. If we were to compare its effectiveness against other GPUs, we need to factor in the NVIO which is about 100 million transistors.
Back to NVIO:
Just some weeks ago I heard a rumor in Asia, that Nvidia is planing to ship in Q2 2009 GeForce only without display-outputs, while open H-SLI to AMD, with the target to stay in Intel Chipset market, even in Nehalem times.
Maybe NVIO is part of this strategy and NVIO can then only ordered for Quadro.
I thought nvio was quite small in terms of transistor count - maybe not even two-digit million number. However, I'd certainly expect those to be "large transistors"... G92 had compared to G80 nvio integrated, VP2 engine, twice the texture addressing capability. Conversely, it certainly saved on ROPs (though while there only 4 quad-rops instead of 6 each one probably is slightly more complex since they support better compression - still those 4 rops should of course use less transistors than the old 6).=>AnarchX: Well apparently the 100 million figure is overstated. Google can't find any info on this, so I don't know. But if you just compare G80 and G92, the difference is 73 million and G92 has only two thirds the ROPs and memory controller channels. But frankly I don't know whether those >73 million came from NVIO or the modified texturing units.
Well you're right, but I still don't like using these numbers for anything. Though I admit you could make the same argument if you'd use die size for comparison too...Sure I was comparing official figures which may be counted differently. But it's absolutely irrelevant for the sake of the argument!
But you forget that current rumours suggest it's clocked quite a bit lower - so I don't really expect it to have a higher "per-transistor performance" than G92. Also, if you think G92 is really limited by memory bandwidth (which certainly seems to be a factor), then G200 would be just the same if not worse if you assume it's at least twice as fast...We do know that 505 million in G94 is comparable to 666 million in RV670. As was already said, G92 is limited by memory bandwidth, so it's not as effective. GT200 has a higher ALU:tex ratio, that should give it even more perf/transistor than if if was just a bigger G80/G92. GT200 has 1,4 billion transistors, that is an official figure. RV770 won't come anywhere near 75% of this number.
Where do you get vastly? Even at 4.5GHz, that is 144GB/s with a 256bit memory interface.mczak said:If R700 is going to use GDDR5 ram, it will also have vastly more total memory bandwidth
240 * 2 + 1 (MADD+ GPGPU/General Shading enabled MUL) * 1.296GHz = 933GFLOPs.