NVIDIA Maxwell Speculation Thread

Discussion in 'Architecture and Products' started by Arun, Feb 9, 2011.

Tags:
  1. Picao84

    Veteran

    Joined:
    Feb 15, 2010
    Messages:
    2,109
    Likes Received:
    1,196
    If it is, can we stop the babbling about GM107 being a Kepler derivative? Is the change in the number of CUDA cores per SMX enough to call it a Maxwell now?
     
  2. Ailuros

    Ailuros Epsilon plus three
    Legend Subscriber

    Joined:
    Feb 7, 2002
    Messages:
    9,511
    Likes Received:
    224
    Location:
    Chania
    960 vs 640SPs don't make much difference?

    Exactly why I consider a 640 config heaven sent; now we'll of course see theories in the wild that they cut out SPs last minute :p
     
  3. iMacmatician

    Regular

    Joined:
    Jul 24, 2010
    Messages:
    797
    Likes Received:
    223
    Probably not, after all, the GF100 and GF104 have different numbers of CCs per SM.

    [Clarification: I think the GM107 a legitimate Maxwell architecture chip. I don't think that those who believe the GM107 is a Kepler refresh will change their mind even if the GM107 ends up having 640 CCs, for the reason above.]

    Also, PedantOne in the XS thread has posted some purported pictures including one showing Hynix H5GC4H24MFR-T2C memory. According to this data sheet the memory could be rated 6.0 Gbps or 5.0 Gbps, which doesn't really tell us much (but is good to know).
     
    #823 iMacmatician, Feb 11, 2014
    Last edited by a moderator: Feb 11, 2014
  4. Ailuros

    Ailuros Epsilon plus three
    Legend Subscriber

    Joined:
    Feb 7, 2002
    Messages:
    9,511
    Likes Received:
    224
    Location:
    Chania
    So let me get this right: someone else had to challenge the bloke in order to check the supposed relevant documentation and find out that there are all of the sudden 33% less SPs in the real chip? *raises eyebrow*
     
  5. Picao84

    Veteran

    Joined:
    Feb 15, 2010
    Messages:
    2,109
    Likes Received:
    1,196
    LOL!

    :runaway:

    Of course 960 vs 640 SPs makes a difference... Given the alleged performance of these 640 SPs, faster than the 768 SPs GTX650Ti, they increased performance per SP. Although not by much? We still need to know how much power consumption it consumes exactly though...
     
  6. Picao84

    Veteran

    Joined:
    Feb 15, 2010
    Messages:
    2,109
    Likes Received:
    1,196
    True, but then again all Kepler had this number constant throughout the family including GK208A. Why would they changed it just for one chip?
     
  7. tviceman

    Newcomer

    Joined:
    Mar 6, 2012
    Messages:
    191
    Likes Received:
    0
    Could have sworn that I've read from multiple reputable sites that TK1's GPU is "up to 1ghz." And after you move past the memory bus, which is 1/2 GM107 and not 1/5, memory bandwidth is entirely based on the power constraints of the ram. I think you're trying to create correlations to match up product features, when there really is no real correlation.

    Nvidia bifurcated Kepler more so than they did with Fermi. GK104 and it's derivatives were more graphics focused than GF104 and it's derivatives and stripped of more HPC-oriented functionality. Consequently, GK104 and GK106 had a die size reduction over their predecessors. GK110 brought new, exclusive compute features with it, and as a result increased in die size over it's predecessor. From what I'm deducing, Nvidia is continuing this strategy. I have no idea if Nvidia plans to implement ARM cores into anything other than the flagship Maxwell die, but I am fairly confident they will continue to bifurcate their product line, like they started doing with Fermi and even more so with Kepler.

    Okay it's not stripped to the bone, but it's definitely stripped down.
    Anand Li Shimpi says:
    I'm not saying you are definitley wrong, I just don't think you are right. :p I'm standing by theory that Nvidia was ready with Maxwell significantly before TSMC was able to deliver 20nm at reasonable costs, so Nvidia
     
  8. AnarchX

    Veteran

    Joined:
    Apr 19, 2007
    Messages:
    1,559
    Likes Received:
    34
    GM107 and GM108 are no Keplers. If they have compute capability 5.0 there are probably some radical changes (4.x is missing) in the architecture.
     
  9. A1xLLcqAgt0qc2RyMz0y

    Veteran

    Joined:
    Feb 6, 2010
    Messages:
    1,589
    Likes Received:
    1,490
    It is the same memory but runs at different speeds depending on the supply voltage.

    1.5V = 6.0Gbps

    or

    1.35V = 5.0Gbps

    Factory overclocked boards will probably run at 1.5V
     
  10. iMacmatician

    Regular

    Joined:
    Jul 24, 2010
    Messages:
    797
    Likes Received:
    223
    The "roadmap feature" of Kepler, dynamic parallelism, didn't even show up until GK110. I wouldn't be surprised if GM107/GM108 have "significantly" fewer features than the GM200 (the "through and through" Maxwell?) or even the other GM20x chips, but that doesn't make the GM10x chips "just" a Kepler refresh.
     
  11. Picao84

    Veteran

    Joined:
    Feb 15, 2010
    Messages:
    2,109
    Likes Received:
    1,196
    Hey, I am not saying they are keplers, quite the contrary.:razz:
     
  12. Ailuros

    Ailuros Epsilon plus three
    Legend Subscriber

    Joined:
    Feb 7, 2002
    Messages:
    9,511
    Likes Received:
    224
    Location:
    Chania
    We all know that what really counts is what comes out at the other end and that any unit != any unit.

    It's just strange that those in the "know" so far didn't notice that there's something different.
     
  13. Picao84

    Veteran

    Joined:
    Feb 15, 2010
    Messages:
    2,109
    Likes Received:
    1,196
    [pararel topic]
    Hence my doubt about Videocardz having any kind of source other than chinese forums...
    [/paralel topic]
     
  14. ams

    ams
    Regular

    Joined:
    Jul 14, 2012
    Messages:
    914
    Likes Received:
    0
    The math is what it is. Based on the leaked rumors, 750 Ti has 5x more CUDA cores, 5x more pixel fill rate [ROP throughput], and 5x more mem. bandwidth compared to the TK1 GPU, period. That is undeniable. Unit counts obviously don't need to increase by 5x across the board to achieve this, nor would it be area efficient to do so. Now, if you think that the leaked specs are wrong, you are entitled to your opinion, but that is all we have to analyze at the moment.

    FWIW, the TK1 GPU has been specified at up to 365 GFLOPS throughput which would imply up to 951-952MHz GPU clock operating frequency.

    Most GPU's are "stripped down" in one form or another. On a fundamental level, the TK1 GPU is a Kepler GPU, and halving TMU count per SMX and halving ROP count per 32-bit mem. channel doesn't change that. For it's power envelope, the TK1 GPU would be just as well balanced as the purported GTX 750 Ti.

    My comments are related to the purported leaked specs only. If the leaked specs are incorrect, then obviously anything goes.
     
    #834 ams, Feb 11, 2014
    Last edited by a moderator: Feb 11, 2014
  15. tviceman

    Newcomer

    Joined:
    Mar 6, 2012
    Messages:
    191
    Likes Received:
    0
    Well looks like there is mounting evidence that the smx is considerably different, and configured with 128 CC's instead of 192. I am going with Maxwell. ;)
     
  16. dnavas

    Regular

    Joined:
    Apr 12, 2004
    Messages:
    375
    Likes Received:
    7
    Didn't anandtech have a rumor that part of Maxwell was that dp could also be used for sp instructions?

    http://forums.anandtech.com/showthread.php?t=2346062
    Maybe they removed a third of the sps, so there's 128 normal sp alus, and 64 dp alus which can be used as sp alus? They could also choose to bifurcate and not include dp at all. Does the rumored performance align with 640 alus?
     
  17. DSC

    DSC
    Banned

    Joined:
    Jul 12, 2003
    Messages:
    689
    Likes Received:
    3
    Since Fermi, all GPUs have 64bit units, just enough to debug and run 64bit code. Since Nvidia wants to sell a lot of Teslas, they made it so you can run and debug 64bit code on GeForce with capped performance of course.
     
  18. ams

    ams
    Regular

    Joined:
    Jul 14, 2012
    Messages:
    914
    Likes Received:
    0
    If that is truly the case, then I would be really surprised, because I didn't expect any heavily rearchitected Maxwell GPU's to be announced until GTC 2014 at the end of March. Note that each rearchitected CUDA core for 750 Ti would need to be capable of much more work (at least 50% more) than a Kepler CUDA core in order for that GPU to be a worthy successor to 650 Ti.
     
    #838 ams, Feb 11, 2014
    Last edited by a moderator: Feb 11, 2014
  19. dnavas

    Regular

    Joined:
    Apr 12, 2004
    Messages:
    375
    Likes Received:
    7
    Makes sense. What if the difference between Tesla and commodity is Denver? If the area difference for 64 dp vs sp alus per smx is small enough, there'd be no reason to castrate 64bit performance. Aren't they at a competitive disadvantage to amd as things currently stand (wrt dp)?

    50% more? Huh. ;^/
     
    #839 dnavas, Feb 11, 2014
    Last edited by a moderator: Feb 11, 2014
  20. DSC

    DSC
    Banned

    Joined:
    Jul 12, 2003
    Messages:
    689
    Likes Received:
    3
    Nvidia doesn't really care, Quadro and Tesla get uncapped 64bit performance since they have high margins, GeForce gets capped 64bit performance, TITAN being an exception.

    This of course only affects parallel 64bit compute performance, I don't think the Denver core does much for 64bit performance since CPUs do serial workloads best, GPUs parallel workloads.
     
    #840 DSC, Feb 11, 2014
    Last edited by a moderator: Feb 11, 2014
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...