Next Generation Hardware Speculation with a Technical Spin [post GDC 2020] [XBSX, PS5]

Discussion in 'Console Technology' started by Proelite, Mar 16, 2020.

  1. psorcerer

    Regular

    Joined:
    Aug 9, 2004
    Messages:
    732
    Likes Received:
    134
    That's all cool and stuff.
    But do you remember that for oblique angles the mip 0 is used on a pretty small portion of the screen?
    All the farther pixels will use higher mips.
    That's why if you could possibly load just a part of the texture, tiny part of that 16K, the life will be so much easier... wait, that's what SFS does. Is it not?
     
  2. goonergaz

    Veteran

    Joined:
    Jun 3, 2005
    Messages:
    3,511
    Likes Received:
    996
    It seems that Sony are making life harder for devs which goes against the whole Cerny approach of making life easier for devs?
     
  3. 3dilettante

    Legend Alpha

    Joined:
    Sep 15, 2003
    Messages:
    8,346
    Likes Received:
    3,864
    Location:
    Well within 3d
    It does sound like the PS5 is pushing things, although we don't know what the relative power budgets are for the consoles. An early decision for a certain die size and power budget could have left the PS5 in this position.

    I'm assuming that should apply more to the GPU than CPU, since the CPU clocks look like they're more in the linear part of the Zen clock curve.
    On the other hand, I have seen some attempts at graphing the 5700 XT's clock and power curve, and some of the overclocks that nominally exceed 2 GHz get a slope like that at the end. I'm not sure how scientific those attempts were, but if representative they also show that RDNA2's clock curve isn't significantly offset at the upper extreme.
     
    tinokun, PSman1700 and disco_ like this.
  4. iroboto

    iroboto Daft Funk
    Legend Regular Subscriber

    Joined:
    Mar 6, 2014
    Messages:
    9,847
    Likes Received:
    9,170
    Location:
    Self Imposed Work Exile: The North
    Sure and what happens when you're not looking at oblique textures? Have you done the math on what it takes to render 16K textures? Or you just assuming it's capable because I/O isn't the limiter anymore?

    Can someone else provide some thoughts here? I don't want to be dismissive, but I have found that texture performance for a game is mainly a factor of I/O, Memory Capacity and available Memory bandwidth - which the latter 2 will be severely lacking with respect to the boost we got in I/O

    Some metrics on movie scene quality here:
    source: http://theillusionden.blogspot.com/2016/03/
    Film characters and models can have 8k+ texture maps or even hundreds of 4-8k maps per model

    “Almost every asset rendered by Weta for Avatar was painted to some extent in MARI. A typical character was around 150 to 170 patches, with 30 or more channels (specular, diffuse, sub-surface, etc) and 500k polygons at Sub-division level one. The full texture set ran to several tens of gigabytes, all of which could be loaded in MARI at the same time. The biggest asset I saw being painted was the shuttle, which came in at 30Gb per channel for the fine displacement detail (500, 4K textures). Assets of over 20M polys can be painted.” - Jack Greasley

    You think PS5 is capable of this in real time because of their SSD?
     
    #1464 iroboto, Mar 30, 2020
    Last edited: Mar 30, 2020
  5. PSman1700

    Veteran Newcomer

    Joined:
    Mar 22, 2019
    Messages:
    2,322
    Likes Received:
    720
    It was closest to a 7850 which was quite abit more powerfull then a 5850 yes. A GPU that's ranging between 9 and 10TF might seem low end compared to whats going to be available for PC space, but for a console i wouldn't call it bad by any means. In raw TF, it's abit over 5 times more powerfull then the base PS4, and abit over two compared to the Pro.

    He can't compare the XSX to the PS5 because we haven't seen anything SoC/design/case related to the PS5 yet. MS went with a rather PC-style design, double PCB, a wide/slower GPU, fixed clocks. What kind of powerusage have MS/Sony aimed for? They could have completely aimed for different goals from the beginning there.
    I can imagine that extreme clocks isn't that efficient either (if 2.3ghz+ range can be considered extreme for RDNA2, maybe its normal, but why didn't MS clock theirs higher).

    The 2ghz/3Ghz with boost have most likely nothing to do with a bad or poor design, but rather to do with a different design. I guess that both MS and Sony had their design plans already laid out 4 to 5 years ago, but both had different plans in mind.

    Where do dimishing returns appear though? I know that in OCing my CPU's/GPU's, after a certain overclock/boost, practical results start to diminish, they mostly become usefull for benchmarks then. For example, a i7 920 clocks easily to 4ghz or beyond, atleast the D0's do, but after 3.6ghz you have about zero advantages going higher in real world gaming. I had the same experiences with GPU's in general, although there you dont work with 1+Ghz overclocks.
     
  6. psorcerer

    Regular

    Joined:
    Aug 9, 2004
    Messages:
    732
    Likes Received:
    134
    You will get a lower mip. Like mip 2 will be 4K.

    Cache. You still use it.
    Unless you wanna do RT. then your cache is busted by random access all other the place.
    That's why RT wasn't ever used in realtime, not because nobody could do the puny intersection fixed path.

    That's because they can. Not because they need it.
    Good luck finding any artist with a good understanding of hardware in the movie industry. :)
     
  7. Love_In_Rio

    Veteran

    Joined:
    Apr 21, 2004
    Messages:
    1,577
    Likes Received:
    193
    Yes, you are right, I wanted to mean a Radeon 7850 that IIRC had 16 CUs instead of the 18 of the PS4.
     
    PSman1700 likes this.
  8. dobwal

    Legend Veteran

    Joined:
    Oct 26, 2005
    Messages:
    5,314
    Likes Received:
    1,341
    What textures will be targeted because while the needed memory for 16k textures in vram may be kept small, the amount of storage memory cannot be? There is over an order of magnitude difference in pixel count between a 4k texture and a 16k texture.
     
  9. Metal_Spirit

    Regular Newcomer

    Joined:
    Jan 3, 2007
    Messages:
    545
    Likes Received:
    335
    The Tempest Engine has great similarities to AMD Audio Next.
    [​IMG]
    Question is: On RDNA 2 and Tempest Engine, will it still be a reserved CU on the GPU, or an extra one?
     
    disco_ likes this.
  10. chris1515

    Veteran Regular

    Joined:
    Jul 24, 2005
    Messages:
    4,505
    Likes Received:
    3,332
    Location:
    Barcelona Spain
    No, why fo you want them to reserve a CU from the GPU and it is different to a CU, it is a hybrid CU/SPU and they did this for letting a dedicated part for audio and 3d audio. You can dream to see graphics developer let some CU from the GPU to audio.

    This is what a third party told and the important part is dicreet nothing to share with graphics guy.





    And what a first-party guy from Naughty Dog told as a joke.

    [​IMG]

    [​IMG]
     
    disco_ likes this.
  11. function

    function None functional
    Legend Veteran

    Joined:
    Mar 27, 2003
    Messages:
    5,229
    Likes Received:
    2,490
    Location:
    Wrong thread
    I've been intrigued by those comments from MS's Game Stack dude who talked about real time ML uprezzing of game textures loaded from disk.

    I wonder if you could keep some memory free to upres appropriate textures for the occasions they got really, really close.

    Infact, I wonder if hints for the ML upscaler could become part of a texture compression scheme .... :?:
     
    Silenti, disco_ and BRiT like this.
  12. PSman1700

    Veteran Newcomer

    Joined:
    Mar 22, 2019
    Messages:
    2,322
    Likes Received:
    720
    Those where shared the day (or day after??) spec release two weeks ago, why are they appearing again and again? With that i mean the tweets :p
    There are a boatload of MS devs 'joking' too, but let's spare those maybe. A Sony dev is going to say PS5 is better and vice versa. I think we get it now ;)
     
    egoless likes this.
  13. iroboto

    iroboto Daft Funk
    Legend Regular Subscriber

    Joined:
    Mar 6, 2014
    Messages:
    9,847
    Likes Received:
    9,170
    Location:
    Self Imposed Work Exile: The North
    Sure all this is possible but at the cost of having to perform ML as soon as the textures arrive and store the final outputs ready before they are called for rendering. I guess if there are spare cycles, perhaps async compute or some other format to fill in the dips, this might be ideal. If there is a trained model that is successful at doing this up-rez solution I'd like to see it's outputs in any title, just to see how well it performed. I know with enough compute time, ML can create wonderful up-resolutions, but a real-time application tends to be much more difficult. Not sure how much power is available for this type of thing.

    There could be some use cases for games where this will make sense to do, games that are naturally slower easier to determine what to load next. But twitch based games, this doesn't seem like an ideal use case for.
     
    Silenti, function and BRiT like this.
  14. Kaotik

    Kaotik Drunk Member
    Legend

    Joined:
    Apr 16, 2003
    Messages:
    8,860
    Likes Received:
    2,791
    Location:
    Finland
    Tempest Engine is separate, it's based on unknown AMD Compute Unit (so could be GCN or RDNA, we don't know) which they stripped out of caches. Apparently only one unit too, or at least I think Cerny never used plural about it.
     
    PSman1700 likes this.
  15. dobwal

    Legend Veteran

    Joined:
    Oct 26, 2005
    Messages:
    5,314
    Likes Received:
    1,341
    I have a question. With int4 and int8 support, can that mean that the XSX offers tensor core like performance with those precision modes?

    I ask because the RDNA white paper makes reference to Navi CU variants.

    AMD has a patent for a parallel matrix multiply pipeline using dot product units.

    http://www.freepatentsonline.com/y2019/0171448.html
     
    #1475 dobwal, Mar 30, 2020
    Last edited: Mar 31, 2020
  16. iroboto

    iroboto Daft Funk
    Legend Regular Subscriber

    Joined:
    Mar 6, 2014
    Messages:
    9,847
    Likes Received:
    9,170
    Location:
    Self Imposed Work Exile: The North
    Not quite the same. hard to explain. Like a google Tensor Processing Unit is different from the Nvidia.

    Nvidia does a single matrix multiply accumulate in a single clock cycle. It does this with mixed precision, so 4,8,16 etc. And outputs i believe a 32 bit value. I could be wrong. It is fast though.(I’m wish I had one to play with)

    I’m not sure what googles does; they made an ASIC for theirs. It’s very fast.

    as for AMD; unless they can do more operations in a single cycle; it will likely not be as fast as a tensor core(nvidia)
     
    #1476 iroboto, Mar 31, 2020
    Last edited: Mar 31, 2020
    PSman1700 and pharma like this.
  17. dobwal

    Legend Veteran

    Joined:
    Oct 26, 2005
    Messages:
    5,314
    Likes Received:
    1,341
    I added more details that I initially came across that warranted my question.
     
  18. iroboto

    iroboto Daft Funk
    Legend Regular Subscriber

    Joined:
    Mar 6, 2014
    Messages:
    9,847
    Likes Received:
    9,170
    Location:
    Self Imposed Work Exile: The North
    Thanks;
    Reading that (if it completes in a single clock cycle, yea sounds pretty close at a high level)

    should be very similar to a tensor core. This is nice. Thanks for the heads up. I hope that the community embraces this (now that the hardware is available).

    CUDA has dominated this industry, making nvidia the main card of choice as the result. I hope that we see these libraries expand to OpenCL.

    Many of us also have Macs ;) and others etc, I would like to consider buying a AMD GPU if it supports more ML libraries without the pain (or I guess I could i get to trying to code my own algorithms)
     
    #1478 iroboto, Mar 31, 2020
    Last edited: Mar 31, 2020
  19. pharma

    Veteran Regular

    Joined:
    Mar 29, 2004
    Messages:
    3,416
    Likes Received:
    2,072
    https://www.techspot.com/community/...next-navi-configurations.256148/#post-1767329
     
    Silenti, PSman1700 and iroboto like this.
  20. dobwal

    Legend Veteran

    Joined:
    Oct 26, 2005
    Messages:
    5,314
    Likes Received:
    1,341
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...