NVIDIA Kepler speculation thread

Discussion in 'Architecture and Products' started by Kaotik, Sep 21, 2010.

Tags:
  1. A1xLLcqAgt0qc2RyMz0y

    Veteran

    Joined:
    Feb 6, 2010
    Messages:
    1,589
    Likes Received:
    1,490
    Yet many posters here have stated that Nvidia could never improve performance/watt on the same process node.

    Just wanted to show that they can and did.
     
  2. PSU-failure

    Newcomer

    Joined:
    May 3, 2007
    Messages:
    249
    Likes Received:
    0
    Nobody with half a brain could ever say that.

    nVidia couldn't improve perf/watt and perf/mm² to the point GF110 went past Cypress (well... they could, with some magic), but with such a low performance they were bound to improve.

    Kepler will probably be Fermi's last iteration, just like GT200 was G80's one, so I don't think they will do considerably better than GF110 (which is in fact a fixed GF100, with redesigned TMUs and power grid). Maxwell should be the real next arch and improve these, but it should be noted that perf/mm² will stay low if they continue to follow the half-rate DP + ECC route (assuming AMD doesn't improve DP rate and/or DP doesn't get over-used).

    About the ASCII art, I see nVidia doing incremental improvements and periodical overhauls: G8x gave birth to GT200, then GF100 to Kepler, then Maxwell to its successor. G9x are in fact G8x, just like GF104 and GF110 are GF100 (well, GF104 is already further from GF100 than G92 to G80, it's almost as far as GT200 was).

    It's safe to assume AMD went the same route, they just have one/two more iteration(s) of incremental improvements: R580 => (R500 ->) R600 -> R700 -> R800 => new arch.
     
    #142 PSU-failure, Nov 11, 2010
    Last edited by a moderator: Nov 11, 2010
  3. mczak

    Veteran

    Joined:
    Oct 24, 2002
    Messages:
    3,022
    Likes Received:
    122
    Well it's on 28nm which should help some. Of course it will help AMD too :)
    If you compare GF104 (which has neither half-rate DP nor ECC) with GF110, it doesn't look that to me like these features cost them that much, neither in perf/area nor perf/power - I'm sure it does make SOME difference, just not that much.
     
  4. trinibwoy

    trinibwoy Meh
    Legend

    Joined:
    Mar 17, 2004
    Messages:
    12,059
    Likes Received:
    3,119
    Location:
    New York
    If nVidia is to be believed it was the parallelization of geometry processing that cost the most transistors in Fermi. I figure Kepler's and Maxwell's success hinge on:

    1. Developers buying into nVidia's "geometry is king" message.
    2. AMD being unable to match their geometry performance in a more efficient way.

    Cayman hopefully can answer #2 in a few weeks.
     
  5. A1xLLcqAgt0qc2RyMz0y

    Veteran

    Joined:
    Feb 6, 2010
    Messages:
    1,589
    Likes Received:
    1,490
    Well try rereading this thread from the beginning there are more than a few 1/2 brains posts.

    nVidia's Kepler is going after the HPC and Professional market where other features such as ECC and Cache are as important as DP throughput. Those take transistors and thus power. Comparing AMD's gaming GPU ppw against a HPC GPU ppw is not valid.
     
  6. trinibwoy

    trinibwoy Meh
    Legend

    Joined:
    Mar 17, 2004
    Messages:
    12,059
    Likes Received:
    3,119
    Location:
    New York
    It's a valid comparison because that's the product nVidia chose to offer to the gaming community. They don't get a free pass because useless transistors are burning power - that was their decision not ours. However, you're right that we need to take that into consideration when talking about ppw.
     
  7. PSU-failure

    Newcomer

    Joined:
    May 3, 2007
    Messages:
    249
    Likes Received:
    0
    Well, as I said, GF104 is already (almost?) as far from GF100 as GT200 was from G80, so I don't know if we could compare directly.

    Scheduling/dispatch logic is different, there are 33% more TMUs and some areas (mainly L2) simply don't scale with anything else than the removing of ECC.


    Then compare GF100/110 and GF104, and laugh.

    GF104 isn't better than GF110 despite not having these "useless" elements.
     
  8. A1xLLcqAgt0qc2RyMz0y

    Veteran

    Joined:
    Feb 6, 2010
    Messages:
    1,589
    Likes Received:
    1,490
    No it is not a valid comparison . When AMD has a HPC GPU then you can compare. And as of now AMD has officially stated they will enter the HPC market when they see it as viable. Who knows when that will be.
     
  9. A1xLLcqAgt0qc2RyMz0y

    Veteran

    Joined:
    Feb 6, 2010
    Messages:
    1,589
    Likes Received:
    1,490
    Why should I laugh. I will smile though as the GF110 is much improved in ppw over the GF100. It will be interesting the see the new Tesla that will come out next year based on the GF110. When that product arrives ppw will be very much improved over the current Tesla's.
     
  10. trinibwoy

    trinibwoy Meh
    Legend

    Joined:
    Mar 17, 2004
    Messages:
    12,059
    Likes Received:
    3,119
    Location:
    New York
    What do you mean? AMD not having an HPC GPU doesn't have anything to do with Geforce v Radeon or the power consumption of Geforce cards. We can all understand why Geforces pull more power but that doesn't change the fact that they do.
     
  11. Sxotty

    Legend

    Joined:
    Dec 11, 2002
    Messages:
    5,496
    Likes Received:
    866
    Location:
    PA USA
    PPW is still not that important for most folks unless you run into HVAC problems b.c of it. PP$ is much more important as evidenced by buyers habits.
     
  12. mczak

    Veteran

    Joined:
    Oct 24, 2002
    Messages:
    3,022
    Likes Received:
    122
    This is true though you'd think that without a need to cater for anything compute, those changes would benefit graphic workloads (that is, perf/area would be higher for these workloads). Though I can't really see much of that. Well certainly gt2xx failed on that front too.
    So the questions still unanswered I have are:
    - How big would a GF110 be without the "unnecessary" stuff, i.e. without ECC and slower DP mostly?
    - How would a "3/4" GF110 (that is, 3 GPC, 12SM, 384 "cores", 256bit/32 ROPs, and without the "useless" stuff) compare to a (fully enabled) GF104, in terms of die size (or transistor count) and performance?
     
  13. PSU-failure

    Newcomer

    Joined:
    May 3, 2007
    Messages:
    249
    Likes Received:
    0
    Who told you nVidia will release "GTX580" Tesla 2.5 versions?

    HPC is a SLOW market, nothing to do with the gaming market and so the Tesla line hasn't to be updated before the next step (probably Kepler, just like with Tesla 0.8 and 1). There will be GF110-based Tesla 2 for sure, but I don't see any reason to do such an incremental upgrade.


    Look at the entire lineup, you'll see only one chip is barely competitive (perf, power and size), and you'll probably find nVidia's issue is the chips' being too big for their perf level. Static power seems ultra-low (awesome work there), but then come clock distribution and other fun issues still related to die size.
     
  14. MDolenc

    Regular

    Joined:
    May 26, 2002
    Messages:
    696
    Likes Received:
    446
    Location:
    Slovenia
    GT200 had a seperate DP unit. Fermi doesn't have that. Yet DP implementation on Fermi is cheap compared to GT200. Not all SP-s are created equal on Fermi, and that's also the reason behind GF104 DP performance.

    "3/4" GF110 would win geometry wise and lose pixel fillrate wise. Die size would probably be so so. And generally for such a chip you should not be that worried about geometry throughput.
     
  15. Megadrive1988

    Veteran

    Joined:
    May 30, 2002
    Messages:
    4,723
    Likes Received:
    242


    Me too.

    G80 = NV50
    GT200 = NV55
    Fermi = NV60
    Kepler = NV65
    Maxwell = NV70
     
  16. trinibwoy

    trinibwoy Meh
    Legend

    Joined:
    Mar 17, 2004
    Messages:
    12,059
    Likes Received:
    3,119
    Location:
    New York
    G80=NV50? So what's G70/1?
     
  17. Kaotik

    Kaotik Drunk Member
    Legend

    Joined:
    Apr 16, 2003
    Messages:
    10,245
    Likes Received:
    4,465
    Location:
    Finland
  18. neliz

    neliz GIGABYTE Man
    Veteran

    Joined:
    Mar 30, 2005
    Messages:
    4,904
    Likes Received:
    23
    Location:
    In the know
    No! :D
     
  19. no-X

    Veteran

    Joined:
    May 28, 2005
    Messages:
    2,455
    Likes Received:
    471
    NV47. NV45 was NV40+PCIe bridge.
     
  20. Megadrive1988

    Veteran

    Joined:
    May 30, 2002
    Messages:
    4,723
    Likes Received:
    242
    G70 / G71 / and PS3's RSX = NV47
     
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...