Nvidia Volta Speculation Thread

Discussion in 'Architecture and Products' started by DSC, Mar 19, 2013.

Tags:
  1. Samwell

    Newcomer

    Joined:
    Dec 23, 2011
    Messages:
    149
    Likes Received:
    183
    Sounds like a too small increase for 7nm and the timeframe is very early for 7nm, as they wrote early samples in late Q1 and availability in H2. Post volta can just mean the gaming oriented architecture of volta. Maybe they now call stuff like 128Sp to 64Sp per SM different architecture or there are more changes now between compute and gaming architecture. Might be a lot of marketing inside. Xavier has 30TOPS in 30W with a dedicated deap learning accelerator or 10 TOPS at 30W with tensor cores. It might be, that they just build a bigger dl accelerator into the next gpu. Without more details it's hard to guess.
     
  2. LiXiangyang

    Newcomer

    Joined:
    Mar 4, 2013
    Messages:
    87
    Likes Received:
    48
    Not necessarily an improvement, since a self-driving DL computer on car only need to apply pre-trained DL model to forecast instead of training the network itself, the 130TOPS could very well be very low precision stuff like int8 or even lower, just like GP102 can do nearly 50T DL ops but GP100 can only do 22T DLops.

    Maybe Nvidia will install some int8 tensor cores on their volta geforce product line.
     
  3. CSI PC

    Veteran

    Joined:
    Sep 2, 2015
    Messages:
    2,050
    Likes Received:
    844
    Looks like Baidu and Nvidia have been making some big improvements with FP16 training using gradient scaling for accuracy and memory resource improvements; this is used with the Volta Tensor cores and libraries as the techniques are too slow otherwise due to the steps/cycles involved.

    https://devblogs.nvidia.com/parallelforall/mixed-precision-training-deep-neural-networks/
    Baidu/Nvidia paper on the topic: https://arxiv.org/pdf/1710.03740.pdf

    They have shown this working well now for a couple of Baidu applications, seems like a pretty important milestone.
    Cheers
     
    silent_guy, xpea and pharma like this.
  4. Grall

    Grall Invisible Member
    Legend

    Joined:
    Apr 14, 2002
    Messages:
    10,801
    Likes Received:
    2,176
    Location:
    La-la land
    Skynet is one step closer... Yayy! :p
     
    Lightman and el etro like this.
  5. xpea

    Regular

    Joined:
    Jun 4, 2013
    Messages:
    551
    Likes Received:
    783
    Location:
    EU-China
    #665 xpea, Oct 15, 2017
    Last edited: Oct 16, 2017
    CSI PC, nnunn, DavidGraham and 4 others like this.
  6. pharma

    Veteran

    Joined:
    Mar 29, 2004
    Messages:
    4,891
    Likes Received:
    4,539
  7. Infinisearch

    Veteran

    Joined:
    Jul 22, 2004
    Messages:
    779
    Likes Received:
    146
    Location:
    USA
    Am I missing something in regards to performance enhancements (claims) with volta?
    1. Twice the perf/w
    2. better L1 cache performance and size (some graph comparing LDS vs new L1)
    3. reduced latency for dependent back to back alu instructions.

    Anything else you can think of?
     
  8. Ext3h

    Regular

    Joined:
    Sep 4, 2015
    Messages:
    428
    Likes Received:
    497
    Given the price estimates for the Volta ASIC (around $2000-$3000 at full discount, or around that magnitude I think), and given how over-engineered that thing is, is anyone actually believing that we are going to see Volta based GeForce cards? Especially given the complete lack of any such announcements?

    If not, what else? Possibly a Pascal shrink instead? GP200 series?

    It's not like Nvidia would need much to take the performance crown distinctively again, but there should be sufficient headroom with a more recent node to push the perf/W boundary further down by quite a bit.

    If we are actually getting a different architecture for the GV100 based Tesla cards, and the (possibly) GP200 based GeForce cards, it would also become unlikely that Nvidia would release a GV100 based Titan card either.



    Besides, I would be very careful with all numbers which Nvidia publishes with regards to neural network performance of the Volta cards. Especially if there is by chance the word "TensorRT" hidden somewhere in the footnotes, which essentially means it wasn't the same network, but a minimized one (layers combined, near-zero weights eliminated, reduced precision in all parts where possible). Where as the CPU "reference" had to execute the full network instead.

    Apply the same basic minimization methods to the network executed on the CPU, and I severely doubt whether the GV100 could still claim more than a 5-10x speedup at most.
     
  9. Infinisearch

    Veteran

    Joined:
    Jul 22, 2004
    Messages:
    779
    Likes Received:
    146
    Location:
    USA
    Isn't that for V100? As in it has tensor units. They have worked on the shader performance as well as tensor units so why would they waste all that R&D money by not releasing gaming cards based on the architecture? And didn't they say they had big things in store for the graphics side of things? (don't remember the exact quote)
     
  10. pharma

    Veteran

    Joined:
    Mar 29, 2004
    Messages:
    4,891
    Likes Received:
    4,539
    I think people have started focusing on performance numbers outside Nvidia's influence and so far researcher's results seem to align with Nvidia's Volta statements. Comparisons at this stage seem to be primarily against P100, or what they previously used.
     
  11. silent_guy

    Veteran Subscriber

    Joined:
    Mar 7, 2006
    Messages:
    3,754
    Likes Received:
    1,382
    4. Increased physical clock speed of the HBM memory
    5. Increased efficiency of the HBM memory controller compares to P100
    6. Separate integer execution units as opposed to shared with FP32
     
    nnunn likes this.
  12. Samwell

    Newcomer

    Joined:
    Dec 23, 2011
    Messages:
    149
    Likes Received:
    183
    Of course there won't be a GV100 based Titan. This is a pure HPC-Chip like GP100. A Titan would only be possibly with a GV102. But as it seems with all the info published with Drive PX Pegasus, i agree that there won't be volta based geforce cards. The Geforce cards will get the post-volta architecture used in pegasus.
     
  13. Infinisearch

    Veteran

    Joined:
    Jul 22, 2004
    Messages:
    779
    Likes Received:
    146
    Location:
    USA
    Thanks for the input. What's the chances any of the volta graphics cards have HBM?
     
  14. silent_guy

    Veteran Subscriber

    Joined:
    Mar 7, 2006
    Messages:
    3,754
    Likes Received:
    1,382
    With GDDR6 on the horizon? Quite low, I think.

    At some point, I expect there to be a Volta HBM based Quadro version, just like for GP100. Does that count?
     
  15. seahawk

    Regular

    Joined:
    May 18, 2004
    Messages:
    511
    Likes Received:
    141
    Do you see many similarities between a GP104 and a GP100? It won´t be much different for Volta.
     
  16. Infinisearch

    Veteran

    Joined:
    Jul 22, 2004
    Messages:
    779
    Likes Received:
    146
    Location:
    USA
    Volta has tensor cores per SM, so the layout of things in the SM might have to be significantly different between GV100 and GV102/104/106/107.
     
  17. Samwell

    Newcomer

    Joined:
    Dec 23, 2011
    Messages:
    149
    Likes Received:
    183
    The gaming architecture will also have tensor cores. For compatibility and devs there will be a at least small amount of them, like it's with DP on consumer cards at the moment. The chip on pegasus has a high number of TCs, so there'll be even gpus with a lot of tensor cores. But maybe with lower precision than v100 tensor cores.
     
  18. Infinisearch

    Veteran

    Joined:
    Jul 22, 2004
    Messages:
    779
    Likes Received:
    146
    Location:
    USA
    Double precision is fully programmable, while tensor cores are fixed function units aimed at AI. There is likely zero chance they'll be in consumer cards or even quadro's for that matter.
     
  19. Infinisearch

    Veteran

    Joined:
    Jul 22, 2004
    Messages:
    779
    Likes Received:
    146
    Location:
    USA
    Yeah it counts, but I was more focused on consumer cards when I asked. I was hoping to see HBM2 on the new titan and ti model at least.
     
  20. Kaotik

    Kaotik Drunk Member
    Legend

    Joined:
    Apr 16, 2003
    Messages:
    10,244
    Likes Received:
    4,465
    Location:
    Finland
    I think he was referring to just that - GP104 and GP100 are already a world apart, GV104 and GV100 at least just as far apart if not more
     
    ImSpartacus and nnunn like this.
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...