what about a GTX270? they can keep it at 28 ROPs/448bit if needed and up a bit the core, but enable further 24 SPs.
Wouldn't the card have to be 512bit if they enabled the remaining cluster?
Wouldn't the card have to be 512bit if they enabled the remaining cluster?
Obviously some like to think that way, yes. Me, I'd rather believe, Nvidia vastly underestimated RV770.Heh, so RV770 has people believing in miracles now eh?
Can't say I blame them. ATI's engineering didn't have much to brag about with their previous DX10 efforts.I'd rather believe, Nvidia vastly underestimated RV770.
NVidia may be in a tough spot now due to their commitment to CUDA. I think Jawed pointed this out, but it seems like they're stuck with 8-wide SIMDs. ATI basically has 16x5 SIMDs right now and there's no pressing need to go for better granularity. Even after 55nm scaling, the former is more than half the size of latter, and despite increased utilization and clock speed, that's not even close to being small enough.
NVidia may be in a tough spot now due to their commitment to CUDA. I think Jawed pointed this out, but it seems like they're stuck with 8-wide SIMDs. ATI basically has 16x5 SIMDs right now and there's no pressing need to go for better granularity. Even after 55nm scaling, the former is more than half the size of latter, and despite increased utilization and clock speed, that's not even close to being small enough.
I think computational speed is starting to matter less, though. Games are probably using a bit more math, but it's not increasing as fast as GPU ability. We'll see if GT300 has some innovations there.
NVidia's connecting 10 clusters to 8 ROP partitions - whereas ATI's connecting 10 clusters to 4 MCs. The interconnection logic scales faster than either side being connected- it's a combinatorial explosion.Since a lot of the arbitration logic is part of the clusters what is all the extra stuff on the GT200 die?
IMO they really don't want to move away from 8-wide SIMDs, as that is something they want to keep consistent in their GPU computing framework. ATI hasn't made any such commitment.Can you elaborate as to why you think NV is in a tough spot due to their commitment to CUDA?
Sure, but not at the same cost as ATI. NVidia has a long history of designing as optimally as possible for a given set of design constraints (NV3x aside). I'm pretty sure their current design can't get any smaller.In terms of math capability I think Nvidia can keep up even with the more expensive 8-way SIMD approach.
I don't think you're right about that. The ALU space is ~25% on both, and while TEX space is about the same for NV's DX10 chips, the TEX area is a lot smaller than the ALUs for ATI. It looks like ~40% ALU+TEX on RV770, and 50% ALU+TEX on GT200 and G92/G80.What I don't get is why GT200 seems to have a lot more supporting logic than RV770. For example, ALU+TEX on RV770 seems to be a larger percentage of the die than ALU+TEX on GT200 even with NVIO parceled out to a separate chip. Since a lot of the arbitration logic is part of the clusters what is all the extra stuff on the GT200 die?
If only CUDA was not so close to the hardware and a little bit more abstract..IMO they really don't want to move away from 8-wide SIMDs, as that is something they want to keep consistent in their GPU computing framework. ATI hasn't made any such commitment.
Thanks for the link, I just finished reading it. It looks like a very interesting model and implementation, and the fact that they developed a non-trivial application on it (the X3D parser) makes it a lot more credible.Slight tangent: what do you guys think of the bulk synchronous parallel processing model? There's a paper from microsoft research at siggraph 08 on it: BSGP: Bulk-Synchronous GPU Programming, scroll down a bit for the paper.
We said a few months ago that Nvidia drives two projects in parallel. GT200 65nm that got launched and branded as GTX 280 / 260 is out and there is a GT200 55nm chip that should be launched shortly.
Our sources are telling us that 55nm version of the chip should be ready either in late August or in September which means that Radeon HD 4870 X2 will get some competition.
We believe that Shaders and clock of 55nm GT200 are definitely going to be higher than the 65nm and the chip itself should be a bit cooler.
This means R700 will get some better competition and the fact that this ATI’s dual card is going to end of faster than GTX 280, doesn’t mean ATI has already won the war.
"We believe that Shaders and clock of 55nm GT200 are definitely going to be higher than the 65nm and the chip itself should be a bit cooler."Fudo says GT200b will be here in September, maybe even August:
http://www.fudzilla.com/index.php?option=com_content&task=view&id=8515&Itemid=1