It's 1:8 (704 GFLOPS DP).
Really?
Middle-End - 7970/280X -- 1:4 DP
Hi-End - 290X -- 1:8 DP
It's 1:8 (704 GFLOPS DP).
But I'm sure the AMD cares: this GPU seems to be hitting thermal limits quite often. It needs a very noisy cooler and it inevitably has voltage regulator cost and complexity consequences too.
The possibility seems real to me that growing the die to the same size as Titan might improve power efficiency without necessarily increasing the performance by the same amount.Improve the power efficiency by 22% and it'd blow the Titan completely out of the water with a die that's ~15% smaller too. (Or they could have grown the die for an even bigger lead.)
The tenor of the discussion around 20nm planar is that it isn't that great. This is relative to a modern scaling situation where good scaling isn't all that great on an absolute scale, and being planar means the foundries are again one node too late on improving their device properties like they were with HiKMG.My understanding is that 20nm won't give a huge improvement in terms of power (can somebody confirm?), so in the pathological case of no power related architecture fixes, all they can do is reduce die size.
So, yes, some people care: the AMD engineers. And they're probably working very hard right now to fix it.
This is a victory for AMD no doubt. Fast card at the right price.
However, NVIDIA has enough TDP headroom to easily beat the 290X with a "Titan Ultra". Perhaps that is what the 780Ti will be.
Still, thank you AMD for the kind pricing. :smile:
I do not know if it is a victory. Sure it the fastest card, but it also needs the most energy and the reference cooler is just bad and the rpice is also very low. NV has been selling that performance level for $1000 and AMD starts at nearly half.
Sure. But we're allowed here to talk about the engineering aspect, right?
And, as I mentioned, is 20nm better at perf/W than 28nm? If not, then there's work to be done.
Source? Most reviews still claim 1:4 (but with nothing to back it up neither).
Really?
Middle-End - 7970/280X -- 1:4 DP
Hi-End - 290X -- 1:8 DP
GF110 -> Geforce GTX580: 1:8 DP - 200 DP FLOPS
GK104 -> Geforce GTX680: 1:24 DP - 128 DP FLOPS
Did you also facepalm then?
Tahiti was an all-around flagship for all-around performance, with special commitment to computing performance. Hawaii is a chip made for gaming (at high resolutions) above all.
You can see that through the architectural changes AMD made from Tahiti to Hawaii:
- Doubled the front-end
- Significantly increased memory bandwidth
- Trimmed down on DP performance for area/power/heat savings
- Marginally increased ALU performance
Zero. As mentioned in the initial GCN architecture introductions back with Tahiti, DP is just a scalable aspect of the architecture that can and will be tuned dependent on product needs and focus.I wonder how much the console connection influenced the decreased focus on DP?
Zero. As mentioned in the initial GCN architecture introductions back with Tahiti, DP is just a scalable aspect of the architecture that can and will be tuned dependent on product needs and focus.
Zero. As mentioned in the initial GCN architecture introductions back with Tahiti, DP is just a scalable aspect of the architecture that can and will be tuned dependent on product needs and focus.
Can you say if the 1:8 rate is what the Hawaii silicon is really capable of or if a possible FirePro version could be faster? Some testers said that Mike Mantor indicated that possibility to them.Physical implementation in logic I have no clue about. But likewise, there is a maximum physical DPFP rate and a register controlled rate, if need be.
Zero. As mentioned in the initial GCN architecture introductions back with Tahiti, DP is just a scalable aspect of the architecture that can and will be tuned dependent on product needs and focus.
Can you say if the 1:8 rate is what the Hawaii silicon is really capable of or if a possible FirePro version could be faster? Some testers said that Mike Mantor indicated that possibility to them.
So why does Hawaii have reduced DP? It's very surprising for a flagship with more ACEs than Tahiti, a lot of bandwidth, and a 512-bit bus that could accommodate 8GB of RAM. I mean, apart from DP, Hawaii looks like a great GPGPU chip.
It could also be a strategy to hold out in that market with Tahiti until 20nm arrives and focus on the gaming market with Hawaii sacrificing DP performance (and possibly ECC) in the process. You have to weigh the saved silicon and therefore incrementally higher profit for each sold consumer GPU against the possible additional revenue generated from the relatively low volume (but high margin) FirePro sales. I don't have the numbers to say how this would calculate, of course.I would be notably surprised if the FirePro (or some other SKU / line that would try again where Firestream failed epically) did not expose more DP throughput.
Hawaii offers more FLOPS per square mm than Tahiti (regardless its 64 ROPs).I disagree. The ALU-to-ROP ratio is the lowest we've seen in AMD flagships for quite a while.
Want better than a Tahiti for GPGPU? Get another Tahiti. It's a more efficient GPU for that.
But at least the R9 290X incarnation offers just a bit more than half the DP flops per mm² .Hawaii offers more FLOPS per square mm than Tahiti (regardless its 64 ROPs).