Man from Atlantis
Veteran
not sure why you need more rops when you are already bottlenecked by Front End..Maybe some secret ROPs, though? My layman's eyes can't tell.
not sure why you need more rops when you are already bottlenecked by Front End..Maybe some secret ROPs, though? My layman's eyes can't tell.
Those aren't GeForce boards and are far more expensive.
off-line 3D rendering : then yes a 7970 6GB looks awesome.
Actually there isn't, see my thread on this topic. It's just marketing.the ECC feature is a must if you're going to do serious HPC, there's maybe quite a risk of bits flipping on your gas guzzling gaming card.
You're not just bottlenecked by the front end. It would be a rare chip to have a single bottleneck.not sure why you need more rops when you are already bottlenecked by Front End..
I'm sure HD7970 wouldn't have ended up much slower if Tahiti had been designed with a 256bit interface and "just" 28 CUs: HD7950 scales to an average of 95% of HD7970 performance if clocked to 925/2750, and cutting the memory speed of HD7970 by 33% results in about 10% less performance (a little more or less depending on settings and games).Lots too much silicon is required for "just marketing".
I'm sure HD7970 wouldn't have ended up much slower if Tahiti had been designed with a 256bit interface and "just" 28 CUs
I didn't intend to take away any credit from the architects and other tech people at AMD. I actually think they did a pretty amazing job with the entire HD7000 line. Marketing and psychology are pretty important, though.Let's try and give architects and other people involved slightly more credit, eh?
I didn't intend to take away any credit from the architects and other tech people at AMD. I actually think they did a pretty amazing job with the entire HD7000 line. Marketing and psychology are pretty important, though.
That's not to say that there's more marketing than mathematics involved in laying out a chip's specs - but I'd be surprised to hear that all that TeraScale, GHZ Edition, and frame buffer talk didn't influence the design goals of a chip at all. Not to speak of making a card actually fit into a specific market segment ...
Concealing faulty hardware and flattering overclockers is marketing.Lots too much silicon is required for "just marketing".
I don't now, but ROPs are the odd one out:
HD 7870: 2.560 GFLOPs; 32.000 MPix/s; 80.000 MTex/s; 153.600 MB/s
HD 7970: 3.789 GFLOPs; 29.600 MPix/s; 118.400 MTex/s; 264.000 MB/s
GCN clearly had the room to double the depth/stencil rate, given the generous BW.
Due to the pixel export limitation of the SMs (14 pixel/clock, a GTX560Ti increases that to 16 pixel/clock, clock means the base clock here, not the hot clock, and it is actually only that much for RGBA8 and FP32, all other pixel formats half that value), the GTX460 is nowhere even close to a HD7970 and comparing the older HD5800/6900 series to a GTX480/GTX580 with blending, the nV GPUs just profit from the higher memory bandwidth, not the higher ROP count (besides in some AA cases) due to the same export limitation. NVidia was obviously just too lazy (better: it judged it as an unnecessary cost) to decouple the ROP count from the width of the memory controller, the higher number of ROPs buys them basically no performance.Fellix, whats your take on these pixel fillrates? The 7970 has similar to a GTX460, while a GTX580 has 37.8.
That's the export limitation of the SMs I mentioned. ROP count is no bottleneck with Fermi. It makes no sense to complain that Tahiti should have more ROPs while it decisively beats a GTX580 in all fillrate benchmarks (and the GTX460 jaredpace mentioned is simply no contest).Note the colour write rates for GTX 580. The count stop short before the scan-out capacity of 32 fragments per clock and far below of what the 48 ROPs are capable.
Top to bottom GHZ lineup (for GCN). They were so close too.
I wonder if we will ever know why not?