Nvidia BigK GK110 Kepler Speculation Thread

Discussion in 'Architecture and Products' started by A1xLLcqAgt0qc2RyMz0y, Apr 21, 2012.

Tags:
  1. Homeles

    Newcomer

    Joined:
    May 25, 2012
    Messages:
    234
    Likes Received:
    0
    So Nvidia's 551mm2 die manages to use roughly the same power as AMD's 365mm2 die, all while outperforming it by 27%? And now they're getting charge $1000 a board, while making presumably hefty margins on the chip itself?

    What this tells me is that small dies are not the way to go, and Nvidia's efficiency lead is rather massive. I know this doesn't take volume into account, which is a very important part of the equation, but your average consumer doesn't care about volume. The market will see that Nvidia has a massive lead in performance over AMD, and that perception will trickle down and make their cheaper cards seem like the superior alternative, regardless of whether they actually are or aren't.

    So this is a marketing stunt card, really. Where's AMD's marketing stunt card? They're missing out on the limelight.
     
  2. A1xLLcqAgt0qc2RyMz0y

    Veteran

    Joined:
    Feb 6, 2010
    Messages:
    1,589
    Likes Received:
    1,490
    Not if they sell every one they make at $999.
     
    #1222 A1xLLcqAgt0qc2RyMz0y, Feb 21, 2013
    Last edited by a moderator: Feb 21, 2013
  3. lanek

    Veteran

    Joined:
    Mar 7, 2012
    Messages:
    2,469
    Likes Received:
    315
    Location:
    Switzerland
    Huum marketing wise, i find it is the other way around, anyone who look reviews on the research of a new card, who fall on titan review, but will never put 1000$ on this one cards, fall directly on the 7970Ghz ... who cost half the price ( without even include the 3 games Bundle ).
    Its not the poor review made with the first Never Settle drivers who will have make much publicity for AMD cards. I dont really see what AMD should have done with a 1 year card in term of marketing, outside playing on the price.
     
    #1223 lanek, Feb 21, 2013
    Last edited by a moderator: Feb 21, 2013
  4. Homeles

    Newcomer

    Joined:
    May 25, 2012
    Messages:
    234
    Likes Received:
    0
    This isn't about the 7970. It's about Nvidia competing in the razzle-dazzle market segment, while AMD is not.
     
  5. Ailuros

    Ailuros Epsilon plus three
    Legend Subscriber

    Joined:
    Feb 7, 2002
    Messages:
    9,511
    Likes Received:
    224
    Location:
    Chania
    It's not going to be any worth-mentioning volume either, so it's rather a moot point.
     
  6. DSC

    DSC
    Banned

    Joined:
    Jul 12, 2003
    Messages:
    689
    Likes Received:
    3
    http://www.hardocp.com/article/2013/02/21/nvidia_geforce_gtx_titan_video_card_review/5

    .
     
  7. DuckThor Evil

    Legend

    Joined:
    Jul 9, 2004
    Messages:
    5,996
    Likes Received:
    1,062
    Location:
    Finland
    The problem I have with these $999 nVidia cards is that they leave too much stuff on the table. 690 only having 2GB per GPU and Titan only allowing 6% increase in power target. I guess it leaves them room to release a 15 SMX part with higher ceiling later on. This price point shouldn't have such compromises imo.
     
  8. UniversalTruth

    Veteran

    Joined:
    Sep 5, 2010
    Messages:
    1,747
    Likes Received:
    22
    #1228 UniversalTruth, Feb 21, 2013
    Last edited by a moderator: Feb 21, 2013
  9. Kaotik

    Kaotik Drunk Member
    Legend

    Joined:
    Apr 16, 2003
    Messages:
    10,245
    Likes Received:
    4,465
    Location:
    Finland
    There's surely a mistake in the graph, there's simply no way Radeons would gain performance going from 2560x1600 > 5760x1080 when the settings are the same like they claim
     
  10. UniversalTruth

    Veteran

    Joined:
    Sep 5, 2010
    Messages:
    1,747
    Likes Received:
    22
  11. DSC

    DSC
    Banned

    Joined:
    Jul 12, 2003
    Messages:
    689
    Likes Received:
    3
  12. jimbo75

    Veteran

    Joined:
    Jan 17, 2010
    Messages:
    1,211
    Likes Received:
    0
    Well by the looks of it 10% of the performance appears to be due to their crafty boost 2 implementation. Let's be generous and say 25% more performance though for 51% larger area.

    Obviously the more shaders you have at lower clocks, the better the TDP will be. I'm not saying AMD would be equal with more shaders at lower clocks but clearly the perf/watt disadvantage would be narrowed.

    To be honest, in the current environment I believe a $1000 card would sell more by being faster and power hungry. I'm quite certain that given the much smaller than anticipated gap in performance, the 7970 will likely have another record breaking February.

    Don't forget this is 8 months after the GHz edition was released. AMD probably killed a part with 10% better performance and maybe 5-10% better power characteristics because it was pointless. Sure Nvidia has a perf/watt lead but it's not massive by any means.
     
  13. mczak

    Veteran

    Joined:
    Oct 24, 2002
    Messages:
    3,022
    Likes Received:
    122
    Rather convenient that you should only look at CUDA results where you can't compare it to competitor's card no? Fact is it seems to lose in both OpenCL and OpenCompute often which frankly is a bit disappointing - for some workloads it looks like it doesn't even win with doubles (sisoft sandra results for example). Optimized drivers or not.
    (At least it looks like there's quite some improvement if you're running cryptography-like workloads (more than twice as fast as GTX 680) but it's still only roughly half as fast there as a 7970 GE so still definitely not the card for bitmining...)
    When GK104 was sort of weak at compute, everybody was saying well it's not optimized for that. But I don't really see GK110 doing much better there relatively (with the exclusion of the already mentioned cryptography stuff, I'm curious actually why that's running faster), and given the SMX look nearly identical this can't be a surprise.
     
  14. UniversalTruth

    Veteran

    Joined:
    Sep 5, 2010
    Messages:
    1,747
    Likes Received:
    22
    Have you thought about driver-related phenomenon?

    If they killed it, then what would they launch in Q4? Another chip with +10% performance gain on top the former +5-10%?
     
  15. trinibwoy

    trinibwoy Meh
    Legend

    Joined:
    Mar 17, 2004
    Messages:
    12,059
    Likes Received:
    3,119
    Location:
    New York
    Register bandwidth per flop and register count per work-item are more useful measures though. Register count per flop doesn't really tell you much about the architecture's ability to achieve peak utilization.
     
  16. trinibwoy

    trinibwoy Meh
    Legend

    Joined:
    Mar 17, 2004
    Messages:
    12,059
    Likes Received:
    3,119
    Location:
    New York
    As far as I can tell, single precision efficiency is not improved at all on GK110 vs GK104. Double precision gets a massive boost and of course there's the 50% bandwidth increase.

    However, there's no reason to believe GK110 will fare better than GK104 in single-precision, math-intensive workloads.
     
  17. CarstenS

    Legend Subscriber

    Joined:
    May 31, 2002
    Messages:
    5,800
    Likes Received:
    3,920
    Location:
    Germany
    Since I now can talk about it: GK110 is even slightly worse. Best case is about equal, but in medium instruction sequences, there's a slight loss. My numbers are not verified yet, so take them with a grain of salt.
     
  18. Alexko

    Veteran Subscriber

    Joined:
    Aug 31, 2009
    Messages:
    4,541
    Likes Received:
    964
    Well, once again, Kepler sacrifices registers/cache for flops, and sometimes that hurts performance a good bit. If you rewrite programs to take this under account, you can achieve very good results. Example: Understanding the Efficiency of Ray Traversal on GPUs – Kepler and Fermi Addendum. Clearly, this requires extra work and may not always be possible, but Kepler does have the potential for very high GPGPU performance.

    GCN is probably easier to use, but since Teslas are outselling FirePros (to be best of my knowledge) the general feeling must be that NVIDIA's software is better. Whether things will remain that way is another story.

    But you can't just look at register count per work-item either, you have to look at the total number of work-items as well; and that didn't increase much, not nearly as much as flop power.

    Besides, it's just an upper bound, in practice you're often better off trying to increase IPC per work-item rather than increasing the number of work-items (it's not as straightforward, but yields better results). See Vasily Volkov's work, e.g. this: Better Performance at Lower Occupancy. And doing this is not easy if you're short on registers.
     
  19. jimbo75

    Veteran

    Joined:
    Jan 17, 2010
    Messages:
    1,211
    Likes Received:
    0
    Tech Report had Titan at 33W more than the GHz Edition -

    [​IMG]

    It's really almost impossible to tell tbh. Anandtech had Titan almost 50W higher...is anyone doing decent power tests on these?
     
  20. Tridam

    Regular Subscriber

    Joined:
    Apr 14, 2003
    Messages:
    541
    Likes Received:
    47
    Location:
    Louvain-la-Neuve, Belgium
    Nope it's not a golden sample, it's actually linked to the way boost behaves on Titan.

    If I measure power in Anno after 30s I get 220W but the cooling system (including the way it is calibrated) is not able to maintain 80°C with such a power draw (unless the system is in the fridge of course). After 5 minutes power drops to 180W. If I add extra cooling around the board power goes up to 200W.
     
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...