Nvidia Volta Speculation Thread

Discussion in 'Architecture and Products' started by DSC, Mar 19, 2013.

Tags:
  1. DavidGraham

    Veteran

    Joined:
    Dec 22, 2009
    Messages:
    3,976
    Likes Received:
    5,213
    Or it could be a typo.
    Even at 2.0GHz. TitanXP is only 15 TFLOPS vs the 20 TFLOPS of the Titan V. A 33% more performance in favor of the Titan V.
    Some claim the tester was having difficulties with GPU under-utilization and throttling. The performance is there as evident by Gears 4 and Superposition numbers, it's just not manifested well in 3D Mark for some reason. We'll have to wait for a proper review and analysis from a credible site.
     
  2. Ryan Smith

    Regular

    Joined:
    Mar 26, 2010
    Messages:
    629
    Likes Received:
    1,131
    Location:
    PCIe x16_1
    Possible, but likely not. They are consistent for FP64, FP32, and Tensor.
     
  3. CSI PC

    Veteran

    Joined:
    Sep 2, 2015
    Messages:
    2,050
    Likes Received:
    844
    Yeah, which is why I also used your TitanxP values as well.
     
  4. CSI PC

    Veteran

    Joined:
    Sep 2, 2015
    Messages:
    2,050
    Likes Received:
    844
    You can only take both from spec, then calculate clock and TFLOPs from that.
    The Titan V has 21.5% more CUDA cores in the regard you are interested in, both boost clock possibly to same level although you need to note the Titan V is a monster die and will have much greater total power consumption/thermals.
    So that is why like I mention some results will not be that impressive and as some have mentioned they fit that scaling, but other test may utilise the newer arch and will have a much greater benefit when around similar clocks.

    You got a link showing a 20 TFLOPs FP32 figure with the PCIE version of the V100 either Titan or HPC card - I am not yet confident I would fully accept some of what is seen on Reddit and reported by some tools such as EVGA Precision without further validation.
    I still cannot see how it can sustain 2GHz with such a die and that cooler, we really need to see the accurate power consumption figures as well for such a GPU die.
    They rate Titan V the same as most other top Geforce cards at 250W, but the top NVLink V100 is 300W - yeah some of that will be the NVLink but also due to its higher clocks.
    Thanks

    Edit:
    I mean looking at the WhyCry results just now sort of fit with what I am saying from both the CUDA core increase and possibly the detriment in certain cases for the SM doubling, while other tests may be a fair bit more due to benefiting from other aspects of the newer arch.
     
    #884 CSI PC, Dec 9, 2017
    Last edited: Dec 9, 2017
    DavidGraham likes this.
  5. Dayman1225

    Newcomer

    Joined:
    Sep 9, 2017
    Messages:
    77
    Likes Received:
    169
  6. CarstenS

    Legend Subscriber

    Joined:
    May 31, 2002
    Messages:
    5,800
    Likes Received:
    3,920
    Location:
    Germany
    Priceless. 3k for the card, but free benchmarks only plz. :D I can understand though.

    Three instead of four HMB2 stacks and one memory controller plus associated ROP cluster plus 1.5 MiB L2 tile is gone as well.
     
    #886 CarstenS, Dec 10, 2017
    Last edited: Dec 10, 2017
    Grall likes this.
  7. DavidGraham

    Veteran

    Joined:
    Dec 22, 2009
    Messages:
    3,976
    Likes Received:
    5,213
    His Deus Ex numbers seem to indicate that Titan V is 25% faster than 1080Ti @Ultra 1440p, as the 1080Ti usually scores around 69fps DX11, the Titan V scores 88fps.
     
    Lightman likes this.
  8. Bondrewd

    Veteran

    Joined:
    Sep 16, 2017
    Messages:
    1,682
    Likes Received:
    846
    So, somewhat inline with increase in ALU count?
     
  9. DavidGraham

    Veteran

    Joined:
    Dec 22, 2009
    Messages:
    3,976
    Likes Received:
    5,213
    Depending on clocks I guess. We still don't know the max stable boost clocks of the card. And whether it has a more aggressive throttling behavior than Pascal or not.
     
  10. CSI PC

    Veteran

    Joined:
    Sep 2, 2015
    Messages:
    2,050
    Likes Received:
    844
    True but one stack and relevant L2 will not be more than what 10W tops if that when considering they are also clocked lower than 2Gbps spec?
    Look to the Tesla P100 that had both 12GB and 16GB models.
    In the scheme of things core clocks would be more notable, where the NVLink version is a fair bit higher albeit adding to the power demand some as well.
    The boost clock of the Titan V is same as the 300W NVLink2 V100 Mezzanine model, as a side note the Mezzanine model can also be said to have better cooling due to its implementation but one needs to see the performance envelope-power demand to see how important this could be; more so if the card is pushed but it may also impact leakage-static power/efficiency some.
     
    #890 CSI PC, Dec 10, 2017
    Last edited: Dec 10, 2017
  11. pharma

    Veteran

    Joined:
    Mar 29, 2004
    Messages:
    4,887
    Likes Received:
    4,534
  12. Pinstripe

    Newcomer

    Joined:
    Feb 24, 2013
    Messages:
    153
    Likes Received:
    133
    Question: Can the Titan V also play games? :smile:
     
  13. Grall

    Grall Invisible Member
    Legend

    Joined:
    Apr 14, 2002
    Messages:
    10,801
    Likes Received:
    2,176
    Location:
    La-la land
  14. MfA

    MfA
    Legend

    Joined:
    Feb 6, 2002
    Messages:
    7,610
    Likes Received:
    825
    Can tensor cores run at the same time as normal SMs at peak performance? I find it strange the tensor cores aren't promoted for visualization, someone should be able to find a graphics use for the 4x4 matrix flops.
     
  15. nnunn

    Newcomer

    Joined:
    Nov 27, 2014
    Messages:
    40
    Likes Received:
    31
    Then again, when running pure fp32, there's lots of dark silicon to help with cooling (?)
     
  16. CSI PC

    Veteran

    Joined:
    Sep 2, 2015
    Messages:
    2,050
    Likes Received:
    844
    You would have hotspots still, and it is still a 250W/300W die well before 2GHz, context is this is not cooled like the Mezzanine and also using the reference blower, and like I mentioned thermal related efficiencies from power leakage/static power.
    Yeah not quite the same thermal issue as with transistor density but still the power consumption is going to be large due to the core/SM structure increase and a consideration, also for the VRM/power stage.

    The TitanX (reduced core count Pascal version) hit 249W at around 1750MHz to 1775Mhz, I cannot find an accurate one for Titan Xp that has the full core count, which obviously is less than the Titan V but the only context we have is the Mezzanine V100 with its 300W TBP and higher specification core clock than the V100 250W PCIE card.
    But to reiterate this needs to be taken into context with my full posts, not being critical here as I think it is a great GPU for a top Titan model.
     
    #896 CSI PC, Dec 10, 2017
    Last edited: Dec 10, 2017
  17. Megadrive1988

    Veteran

    Joined:
    May 30, 2002
    Messages:
    4,723
    Likes Received:
    242
    Do we yet know how many ROPs the Tesla V100 and Titan V have?

    I'm assuming more than 96.
     
  18. Arun

    Arun Unknown.
    Legend

    Joined:
    Aug 28, 2002
    Messages:
    5,023
    Likes Received:
    302
    Location:
    UK
    I'm thinking of buy a Titan V to use for deep learning, if so I'll probably spend a few days/weeks to write custom microbenchmarks for it. Sadly the current shipping date for the UK website is Dec 30, so I haven't pulled the trigger on it yet :(
     
    pharma likes this.
  19. DavidGraham

    Veteran

    Joined:
    Dec 22, 2009
    Messages:
    3,976
    Likes Received:
    5,213
    #900 DavidGraham, Dec 11, 2017
    Last edited: Dec 11, 2017
    pharma likes this.
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...