Will GPUs with 4GB VRAM age poorly?

Discussion in 'Architecture and Products' started by DavidGraham, Jul 10, 2016.

  1. sebbbi

    Veteran

    Joined:
    Nov 14, 2007
    Messages:
    2,924
    Likes Received:
    5,296
    Location:
    Helsinki, Finland
    That NV quote seems to be discussing a different technique, called spatial checkerboarding. Rainbow Six Siege is using temporal checkerboarding. Spatial checkerboarding is a single frame technique (no reprojection). It results in native edge sharpness, but shading (inside surfaces) is 2x lower resolution -> textures and lighting both look low res.

    Temporal checkerboarding reprojects last frame content and combines that with current frame content. As long as the reprojection succeeds, this results in very close to native results (and 100% perfect result when camera is still). Temporal checkerboarding doesn't require native resolution depth buffer (or id buffer) like spatial checkerboarding, because the temporal version can combine color and depth data from two frames (alternating checkerboard pattern = 100% pixels).

    I don't see any reason why SSAO should be noticeably lower quality with temporal checkerboarding. But I see lots of potential (difficult to solve) implementation reasons that might cause this issue.
     
    dogen, Lightman, OlegSH and 1 other person like this.
  2. OlegSH

    Regular

    Joined:
    Jan 10, 2010
    Messages:
    797
    Likes Received:
    1,624
    Nope, it's about temporal checkerboarding
    I just find the term "upscaling" confusing when applied to the 2xMSAA temporal checkerboard rendering even though it's described exactly like "upscaling" almost in every press article out of here.

    MSAA samples with checkerboard rendering are mapped on the output resolution pixels positions so that each 2x MSAA sample is at the centroid position of each output resolution pixel, half of pixels are obviously missing because MSAA is 2x, hence the checkerboard pattern across the screen, but press for whatever reason decided to call this "upsampling" even though there is no any upsampling involved in its traditional sense
     
    #182 OlegSH, Dec 1, 2016
    Last edited: Dec 1, 2016
    DavidGraham and pharma like this.
  3. sebbbi

    Veteran

    Joined:
    Nov 14, 2007
    Messages:
    2,924
    Likes Received:
    5,296
    Location:
    Helsinki, Finland
    Yes, temporal checkerboard does no upsampling. Spatial checkerboard does. Scanline intelaced rendering isn't upsampling either (odd/even scanlines interleaved). However both of these temporal techniques also fallback to upsampling when the reprojection fails. In this case checkerboard interpolates from "+" neighborhood (both X and Y neighbors = 4 neighbors), and scanline interleaving interpolates from +-Y neighbors (= 2 neighbors). This is one of the reasons why checkerboard is a better than scanline interleaving.
     
    Kej likes this.
  4. OlegSH

    Regular

    Joined:
    Jan 10, 2010
    Messages:
    797
    Likes Received:
    1,624
    I used to think of it as of interpolation, traditional upsampling is much closer to extrapolation which is less accurate by definition
     
  5. sebbbi

    Veteran

    Joined:
    Nov 14, 2007
    Messages:
    2,924
    Likes Received:
    5,296
    Location:
    Helsinki, Finland
    Technically they are both interpolating, since endpoints are known and the value lies between them. Checkerboard interpolation has "+" neighbors, while bilinear (center sample) has "X" neighbors. The main difference is that temporal checkerboard only needs to interpolate when reprojection fails (usually less than 10% of pixels). Other 90%+ pixels are reconstructed (not interpolated). However reconstruction usually also does some (usually higher order) interpolation (as motion vectors don't always point to exact pixel centers).
     
    #185 sebbbi, Dec 1, 2016
    Last edited: Dec 1, 2016
    Kej likes this.
  6. DavidGraham

    Veteran

    Joined:
    Dec 22, 2009
    Messages:
    3,976
    Likes Received:
    5,213
    iMacmatician, Kyyla, Razor1 and 3 others like this.
  7. swaaye

    swaaye Entirely Suboptimal
    Legend

    Joined:
    Mar 15, 2003
    Messages:
    9,044
    Likes Received:
    1,116
    Location:
    WI, USA
    The 3.5 GB stricken GTX 970 looks pretty good though.
     
  8. seahawk

    Regular

    Joined:
    May 18, 2004
    Messages:
    511
    Likes Received:
    141
    If you look at the min. FPS the 1050ti is nearly on the Fury level. I think AMD must have some problems of their own. (Also look at 980 4GB and even the 970)
     
  9. DavidGraham

    Veteran

    Joined:
    Dec 22, 2009
    Messages:
    3,976
    Likes Received:
    5,213
    HardOCP reports stuttering and hitches on the FuryX in Fallout 4 using the High Res texture pack. Apparently 4GB is not enough in this title. At least 6GB is recommended.

    http://www.hardocp.com/article/2017/02/12/fallout_4_high_resolution_texture_pack_review/13

    Note that the average fps of the card was not affected, also VRAM usage topped at 3.7GB, however the FuryX was swapping a lot of textures from the SSD to VRAM and vice versa, this manifested as stutters and hitches, which never happened on the 6GB 1060, or the 8GB 480. This mirrors a behavior common in Rise of The Tomb Raider as well (especially in wide areas, and during extended gameplay sessions).

    [​IMG]

    http://www.hardocp.com/article/2017/02/12/fallout_4_high_resolution_texture_pack_review/3
     
  10. Malo

    Malo Yak Mechanicum
    Legend Subscriber

    Joined:
    Feb 9, 2002
    Messages:
    8,929
    Likes Received:
    5,529
    Location:
    Pennsylvania
    The difference between the 4gb and 8gb 480 at only 1080p is massive. A clear winner why 4gb is not enough.

    My question is why this title is using so much VRAM at 1080p though? No msaa and the textures in bio hazard are very low resolution.
     
  11. DavidGraham

    Veteran

    Joined:
    Dec 22, 2009
    Messages:
    3,976
    Likes Received:
    5,213
    Even 6Gb is not enough with this game, the 8GB 480 is much faster than the 6GB 1060 and almost as fast as the 6GB 980Ti. The game scales well with memory size! The 3GB 1060 is completely crushed here.
    The game is a collection of large rooms/halls connected together through closed doors, that you frequently go through back and forth. The inside of these interiors is full of details and objects. This requires keeping all of these things close by in the VRAM for quick access, and to avoid pop ins and streaming issues. It also has to do with an option called Shadow Cache, which caches your shadows in the VRAM to avoid pop ins, and to reuse them in accelerating rendering. So that the more shadows you have cached, the more performance you will get, at least theoretically. Which increases VRAM load even more.
     
    Pixel, Alexko, pharma and 1 other person like this.
  12. Blazkowicz

    Legend

    Joined:
    Dec 24, 2004
    Messages:
    5,607
    Likes Received:
    256
    Thanks for describing the game.
    An issue is that to see the full picture, we need the benchmark results with Shadow Cache disabled. Tanking your framerate by blitting stuff over PCIe constantly so as to "accelerate" your shadows defeats the point.

    I had seen a link that gives such results, on vs off, same 1080p resolution tested.
    http://www.tomshardware.fr/articles/test-resident-evil-re7-benchmark,2-2618-3.html
    The boards are R9 390, GTX 1060 6GB, RX480 8GB, GTX 970, RX470 4GB, GTX 1060 3GB.

    Leaving this off seems to make 4GB cards adequate and 3GB cards bearable.

    On > 4GB cards, enabling shadow cache leaves average framerates about the same but minmum framerate is nicely up (btw, different benchmarking, different results...)
    There is the same behavior where AMD 4GB cards are a bit weak on minimum framerate, even with the cache off.

    Albeit min framerate is a weak measurement in itself, a single outlier frame will make it look bad. A decade or two ago, minimum framerate was too inconsistent or meaningless to be considered.
     
    #192 Blazkowicz, Feb 13, 2017
    Last edited: Feb 13, 2017
  13. lanek

    Veteran

    Joined:
    Mar 7, 2012
    Messages:
    2,469
    Likes Received:
    315
    Location:
    Switzerland
    It is even worst if you enable shadow cache....

    after lots of weird results with graphics card that have less then 4 GB of VRAM we found out that the problematic settings is Shadow cache. If you have an up-to 4 GB graphics card, please turn it off and you'll be playing properly. The charts are now updated to reflect that.

    (result shadow cache off )

    [​IMG]
    [​IMG]
    At 4K--- there the 4GB show their limit. ( need 5GB+ effectively. ).

    But Resident Evil have a lot of inconstency when benchmarked.... (performance wise ).
     
    #193 lanek, Feb 13, 2017
    Last edited: Feb 13, 2017
    CSI PC and BRiT like this.
  14. DavidGraham

    Veteran

    Joined:
    Dec 22, 2009
    Messages:
    3,976
    Likes Received:
    5,213
    Can't say I trust Guru3D that much, their testing has been sloppy recently and frequently, with many updates to their reviews due to test system errors. Anyhow the inconsistency steamed from them testing 4GB cards with Shadow Cache on, which caused their performance to tank. They should have tested 4GB cards with the Cache off, and 8GB cards with the Cache On.

    On my 1070, Shadow Cache boosts my fps by about 8-10 frames just standing still and looking at the same scenery.
     
    pharma likes this.
  15. lanek

    Veteran

    Joined:
    Mar 7, 2012
    Messages:
    2,469
    Likes Received:
    315
    Location:
    Switzerland
    Yes i know for guru3D.

    Same on 480 and 390x 8GB ... But on GPU's who have only 4GB it will completely cripple the performance.

    shadow cache off.
    http://www.pcgameshardware.de/Resid.../Specials/Benchmark-PC-Anforderungen-1219005/

    The problem is with this game, reviewers should show both result ( with cache on and off ), instead of choose one method over the other.
     
    #195 lanek, Feb 13, 2017
    Last edited: Feb 13, 2017
    BRiT, pharma and DavidGraham like this.
  16. Putas

    Regular

    Joined:
    Nov 7, 2004
    Messages:
    737
    Likes Received:
    354
    Only in case of bad benchmarks practices, which can invalidate other metrics as well. I found results of 20 years old cards very consistent.
     
  17. Ike Turner

    Veteran

    Joined:
    Jul 30, 2005
    Messages:
    2,110
    Likes Received:
    2,304
    Relevant:

    Fallout 4 Very High Texture Pack: http://gamegpu.com/rpg/роллевые/fallout-4-high-resolution-texture-pack-test-gpu

    HOCP reports stuttering that doesn't affect framerate (??) on Fiji GPUs.. but I would like to have a non biased (I wouldn't trust anything Kyle says about AMD/Radeon products..) third party report on this matter..anyone else with a Fury X or Nano can please share their experience with Fallout 4+High texture pack?

    [​IMG]

    [​IMG]

    [​IMG]

    For Honor Beta: http://gamegpu.com/action-/-fps-/-tps/for-honor-beta-test-gpu

    [​IMG]

    [​IMG]

    [​IMG]

    Halo Wars 2: http://gamegpu.com/rts-/-стратегии/halo-wars-2-test-gpu

    [​IMG]

    ...Fury X... ¯\_(ツ)_/¯
     
    #197 Ike Turner, Feb 20, 2017
    Last edited: Feb 20, 2017
  18. Anarchist4000

    Veteran

    Joined:
    May 8, 2004
    Messages:
    1,439
    Likes Received:
    359
    Sounds like a driver bug or something. He said the Fury was using nearly 1GB (~4.8GB vs 5.8GB) less VRAM (dedicated + dynamic) than a 480 and the SSD was going crazy. So something important was getting evicted from even system memory. Otherwise I'd expect the SSD usage to be the same between cards and similarly allocated VRAM. That being the case, I doubt he was wrong about occasional stuttering.
     
    BRiT likes this.
  19. Rootax

    Veteran

    Joined:
    Jan 2, 2006
    Messages:
    2,400
    Likes Received:
    1,845
    Location:
    France
    So, I played a few hours of sniper elite 4 on my pc (5820k@4.2ghz, 16gb of ram, and a fury X), @1440p, everything ultra and post processing FSAA at high. My video ram is full (It's always between 3900mb and 4016mb), but no stuttering at all, which is kind of nice, very smooth. I use DX12. So I don't know if AMD made a good driver with the 17.2.1, or if the game engine is pretty well done in swapping in and out stuff from vram without causing "pauses". But it's nice to see.
     
  20. Alessio1989

    Regular

    Joined:
    Jun 6, 2015
    Messages:
    614
    Likes Received:
    321
    I don't have the last Biohazard... does anyone have some information about it's streaming techniques or if at least if the game uses tiled resources/sparse textures/virtual textures/whatever_you_like_call_it?

    As for Fallout 4... I would not take it in consideration... Except for a bug-race..
     
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...