Nvidia Turing Speculation thread [2018]

Discussion in 'Architecture and Products' started by Voxilla, Apr 22, 2018.

Tags:
Thread Status:
Not open for further replies.
  1. ShaidarHaran

    ShaidarHaran hardware monkey
    Veteran

    Joined:
    Mar 31, 2007
    Messages:
    3,984
    Likes Received:
    34
    The 8% is a bit tongue-in-cheek on my part. Should be at least 20%. Still though, that's a bit underwhelming compared to the last few generational performance deltas. Also with a 754mm^2 die I would expect more CUDA cores, but I understand that this generation is all about shifting the focus towards ray tracing.
     
  2. McHuj

    Veteran Regular Subscriber

    Joined:
    Jul 1, 2005
    Messages:
    1,454
    Likes Received:
    568
    Location:
    Texas
    I just hope the ray tracing performance is enough to be noticeable in games and not just by pixel peepers.
     
    egoless and BRiT like this.
  3. lanek

    Veteran

    Joined:
    Mar 7, 2012
    Messages:
    2,469
    Likes Received:
    315
    Location:
    Switzerland
    Ray tracing....... The day we see a complete game render in raytracing... wake me up... I deal with raytracing since 15 years ... At the moment 3D modeling softwares are moving from raytracing for real time productivity in real time work to other type of engine ( Evee ) ( this way we can models with the same quality we will then render the scene with raytracing ,.... ( 3Ds, Blender etc )... games move to real time raytracing ... yes ...

    Real time viewport .....
     
    #323 lanek, Aug 18, 2018
    Last edited: Aug 18, 2018
    BRiT likes this.
  4. SpaceBeer

    Newcomer

    Joined:
    Apr 15, 2017
    Messages:
    38
    Likes Received:
    14
    Location:
    The Balkans
    And GTX 1070 had ~55% higher clocks, so it's theoretical performance (FP32, pixel and texture rate) are bit higher than 980 Ti's.
     
  5. Clukos

    Clukos Bloodborne 2 when?
    Veteran Newcomer

    Joined:
    Jun 25, 2014
    Messages:
    4,474
    Likes Received:
    3,822
    That TU104 die looks large. The Ti one must be enormous.
     
  6. ImSpartacus

    Regular Newcomer

    Joined:
    Jun 30, 2015
    Messages:
    252
    Likes Received:
    199
    Clukos likes this.
  7. DavidGraham

    Veteran

    Joined:
    Dec 22, 2009
    Messages:
    2,837
    Likes Received:
    2,670
    I will only say this once, you are extremely on the low side on this.
    Not that high in the end. They were almost the same, but with significantly lower bandwidth on the 1070.
     
    ShaidarHaran likes this.
  8. Rootax

    Veteran Newcomer

    Joined:
    Jan 2, 2006
    Messages:
    1,203
    Likes Received:
    597
    Location:
    France
    Maybe a stupid question, but can the RT cores assist the more traditional "cuda cores" for non-RT compute stuff ? Same thing with tensor cores ?
     
  9. pharma

    Veteran Regular

    Joined:
    Mar 29, 2004
    Messages:
    3,003
    Likes Received:
    1,687
    Not sure this is the case. All articles I've read seem to indicate combined coherent memory access via NVLink is not limited to just compute workloads. Hopefully we should know more on Monday.
    https://www.eetimes.com/author.asp?section_id=36&doc_id=1333598&_mc=RSS_EET_EDT
     
    ShaidarHaran likes this.
  10. ShaidarHaran

    ShaidarHaran hardware monkey
    Veteran

    Joined:
    Mar 31, 2007
    Messages:
    3,984
    Likes Received:
    34
    Good. I hope so. It's atypical for a new generation to only be marginally faster than the last one. I can see the stated clocks could be way off from reality, as is the case with every Pascal card on the market, but it seems like NV quotes clockspeed based on a set of parameters that I have never encountered in the real world, even before installing watercooling on every card, as I eventually do. When I owned a Titan X (Pascal) it boosted to ~1800MHz with a waterblock installed but no overclocking applied. Same goes for my 1070 (about 1900MHz "stock" boost under water). The thing is, installing waterblocks on Pascal cards had no effect on maximum (pre-overclocked) boost clock in my experience, only sustainability.

    I've owned both 1070 and 980 Ti. I went from 980 Tri-SLI down to single card 980 Ti (because multi-GPU support has gotten worse and worse over the last few years), up to Titan X Pascal (significant upgrade), down to 1070 (made money on the deal and wasn't gaming as much). The 980 Ti and 1070 are indeed, roughly comparable in most workloads. However, my particular 980 Ti was a beast of an overclocker, hitting 1550MHz. My 1070 is only an average overclocker, reaching just over 2GHz. More recent releases tend to run slightly better on the 1070, with older titles being a bit faster on 980 Ti. Even ignoring my watercooling and overclocking I've run across others with similar experience with these cards. It is impressive what performance can be achieved by the 1070 with such low bandwidth, for sure. Pascal is very efficient in its bandwidth utilization.
     
    DavidGraham likes this.
  11. ShaidarHaran

    ShaidarHaran hardware monkey
    Veteran

    Joined:
    Mar 31, 2007
    Messages:
    3,984
    Likes Received:
    34
    I'll keep an eye on that, thanks. Might actually be worthwhile to pick up 2 cards again if their VRAM pools can be combined. Would be even better if the driver handles this automatically and game devs don't have to jump through hoops to make it work.
     
  12. Communism

    Newcomer

    Joined:
    Feb 1, 2014
    Messages:
    7
    Likes Received:
    1
    NVLink 2 is 25 gBps per direction per connection. Should be fine for SLI, but don't expect miracles.
     
  13. Ike Turner

    Veteran Regular

    Joined:
    Jul 30, 2005
    Messages:
    1,884
    Likes Received:
    1,759
    It will..


    People shouldn't downplay the RT Cores IMO. But they also shouldn't expect anything more that shoddy Gameworks stuff on the gaming side. RTRT in games for useful features that don't require pixel peeping is still years away. But for everything non-gaming Nvidia just brought one hell of a product & I can't wait to grab one for myself for work either a 2080Ti or Quadro RTX 5000 depending on price and performance for what I need.
     
    Rootax likes this.
  14. ShaidarHaran

    ShaidarHaran hardware monkey
    Veteran

    Joined:
    Mar 31, 2007
    Messages:
    3,984
    Likes Received:
    34
    My desire for more VRAM is not contingent on a specific performance requirement. I simply need more VRAM in a single pool with a certain baseline of performance available. 16GB ought to get the job done.
     
  15. CSI PC

    Veteran Newcomer

    Joined:
    Sep 2, 2015
    Messages:
    2,050
    Likes Received:
    844
    It also needs to be seen just how much impact the evolved SM-TPC-associated graphics related pipeline and processes-stages has on improving performance (also has a revised cache) and additional gains from the RT cores for other functionality.
    However using RT cores beyond their traditional function probably will take time to expose as it will require specific library/functions that do not exist yet; you could access them directly in a similar way to Tensor Cores (that also had limited libary function support to begin with but could use c++ directly) but that is usually more specialist related development-optimisation.
     
  16. silent_guy

    Veteran Subscriber

    Joined:
    Mar 7, 2006
    Messages:
    3,754
    Likes Received:
    1,379
    pharma likes this.
  17. Communism

    Newcomer

    Joined:
    Feb 1, 2014
    Messages:
    7
    Likes Received:
    1
    They are adding both directions for the 2 NVLink 2.0 ports together.

    Also note that my original post already says gBps (gigabytes per second) as denoted by the capitalized B (if that was what was confusing you about my post).
     
  18. silent_guy

    Veteran Subscriber

    Joined:
    Mar 7, 2006
    Messages:
    3,754
    Likes Received:
    1,379
    50 or 100 GB/s: both of those are nothing to sneeze at. ;-) Especially compared to what was available in the past. It won’t allow one to treat memory on the other side as identical to local memory, but it’s 3x better than anything could reasonably have in the past.

    My brain saw the lower case ‘g’ and used that to override the upper case ‘B’!
     
  19. Communism

    Newcomer

    Joined:
    Feb 1, 2014
    Messages:
    7
    Likes Received:
    1
    While great, it's still stuck in the canyon where you cannot simultaneously work on the same "frame" with both GPUs without latency/lag inducing queuing of multiple "frames" of input.

    Probably need ~50% of local memory bandwidth as link bandwidth between dies before that can be a reality (at massive power cost at the very least, not to mention the die area cost).

    Anything less than that and you are still left with AFR 2 with 2+ frames of input latency.
     
  20. pharma

    Veteran Regular

    Joined:
    Mar 29, 2004
    Messages:
    3,003
    Likes Received:
    1,687
    Assuming cards go pre-order on Monday, how long before we see the first reviews? I would imagine one of the things tested will be memory with the cards in NVLInk mode.
     
Loading...
Thread Status:
Not open for further replies.

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...