Did you calculate it from the rumored die-size and Tahitis transistor-density?Kepler GK104 has 1536 ALUs using 4.1B Transistors
Did you calculate it from the rumored die-size and Tahitis transistor-density?Kepler GK104 has 1536 ALUs using 4.1B Transistors
Did you calculate it from the rumored die-size and Tahitis transistor-density?
And if anyone was asking for an example of the blind man and elephant, it'd be hard to top this...whitetiger said:Quantifying:
Cayman has 1536 ALUs using 2.64B Transistors
Tahiti has 2048 ALUs using 4.31B Transistors
(substitute your own transistor counts if you disagree with these)
Comparing VLIW4 with GCN, then
--> 63% more transistors yields 33% more ALUs
--> therefore the overhead to support GCN vs VLIW4 is 22%
Fermi GF114 has 384 ALUs using 1.95B Transistors
Kepler GK104 has 1536 ALUs using 4.1B Transistors
(substitute your own transistor counts if you disagree with these)
Comparing Fermi to Kepler, then
--> 2.1x transistors yield 4x the ALUs
--> therefore no increase in GPGPU burden, but ditching the hot-clock gives massive benefit in terms of ALU density....
for comparison sake, i dig nvidias pwm designs what nvidia used similar pwm for their cards..This looks like a ddr3 equipped version so probably for GK107-200. Wouldn't be surprising then power draw would be below HD7750 level. Maybe for the other versions need more pwm circuitry?
And if anyone was asking for an example of the blind man and elephant, it'd be hard to top this...
Take a crude number from a marketing slide, don't bother to apply any reasonable correction factors for known chip differences, add a division or two and throw it out to the world as proof. Pointless.
But, hey, numbers don't lie: http://dilbert.com/strips/comic/2007-08-08/
It's just a guess...
- it could be lower if NV's densities aren't as high as AMDs...
I meant that for GK104.It's not THAT close to billion... If you compare GF114 vs. Cypress you get 1950M vs. 2150M (based on http://techreport.com/articles.x/20126) for aproximatly the same die area. So the difference is about 10%.
It is an interesting question, but you're counting much more than just the transistors required for the raw flops and don't take into account the increased TEX and MC logic compared to previous generation. And then make a sweeping statement about just the shaders. So you're mixing apples and oranges to calculate grapes and then compare it to apples (I'm sure there is a better car based metaphor for this.)whitetiger said:So, I was asked to quantify what the GCN vs VLIW4 cost AMD
- i.e. how much did the decision to go for a more flexible architecture GPGPU approach
- and the answer is 22% less raw FLOPS per tranny.
(but raw FLOPs doesn't mean actual performance, obviously)
it would definitely have to require something about engine sizes........ thats been double confirmed.(I'm sure there is a better car based metaphor for this.)
Welcome to my new sig.So you're mixing apples and oranges to calculate grapes and then compare it to apples.
One can make a very elaborate case with the numbers you have and then make what seems like a well reasoned conclusion, but chances are high that the whole exercise resembles the story of blind men describing an elephant based on the particular body part they're touching.
It is an interesting question, but you're counting much more than just the transistors required for the raw flops and don't take into account the increased TEX and MC logic compared to previous generation. And then make a sweeping statement about just the shaders. So you're mixing apples and oranges to calculate grapes and then compare it to apples (I'm sure there is a better car based metaphor for this.)
I think silent_guy's basic point is that a bunch of us are picking necessarily non-unique solutions to a severely under-constrained set of equations that involve a lot of hidden (to us) variables, with error bars of unknown magnitude on coefficients we only have wild-ass-guesses for. This is not the way to make a convincing argument .