PS5 Pro *spawn

iroboto · Wednesday at 1:49 PM

Charlietus said:
What would they do with that hardware beside doing ai things? I'm sure that with time and-or future playstation hardware they will add things like ray reconstruction, frame gen and other stuff. But right now they are simply not ready.

Also, we have games like Alan wake 2 that are upscaling to 4k from the same base resolution as the PS5 and adding the settings of the quality mode. If they just used the additional CU's to upscale they wouldn't have the time for the higher settings.

PS5 has like 50 tops (can't find a number), and pro has 300. They added something, whatever it is.

17.55TF is 5pro
X2 for dual submission 35TF
X2 to downscale to 16FP = 70 FPOPS
X2 to downscale to 8bit = 140 TOPs
X2 to allow for ML sparsity customization = 280~300 TOPs

It really comes down that base number which is between 17TF and 18TF.

chris1515 · Wednesday at 2:17 PM

https://twitter.com/x/status/1849055291737804878

Helldivers 2 PS5 Pro support

Charlietus · Wednesday at 2:29 PM

A bit too many responses for the time I have today

We should define dedicated hardware in this discussion. Because even tensor cores are part of the compute units.

Also, Kepler has talked about how the PS5 pro takes some aspects of rDNA 4, like the matrix acceleration and sparsity. To me that is dedicated hardware, by my definition.

Seanspeed · Wednesday at 2:42 PM

Shifty Geezer said:
The proposition here is that there's bespoke HW in the PS5Pro that spends 2 ms of a 16.6666 ms frame upscaling, and then 14.6666 ms the rest of the frame doing absolutely nothing.

That's exactly what tensor cores do with DLSS, so I'm not sure why it'd be so strange here.

MrSpiggott · Wednesday at 3:30 PM

Seanspeed said:
That's exactly what tensor cores do with DLSS, so I'm not sure why it'd be so strange here.

The Tensor cores are there predominantly for non gaming tasks such as machine learning. They are fitted as standard, for which you're paying a premium, so may as well be used for upscaling even if it's only for a fraction of a frame. What's the argument for there inclusion on a console?

snc · Wednesday at 4:13 PM

chris1515 said:
https://twitter.com/x/status/1849055291737804878

Helldivers 2 PS5 Pro support

Biggest titles still missing are Wukong, Avatar, Returnal, Cyberpunk

chris1515 · Wednesday at 5:36 PM

https://twitter.com/x/status/1849058197367390608

ErneX · Wednesday at 5:39 PM

snc said:
Biggest titles still missing are Wukong, Avatar, Returnal, Cyberpunk

+ Space Marines 2

snc · Wednesday at 5:43 PM

ErneX said:
+ Space Marines 2

Tough dev or marketing guy on twitter wrote they are looking into it

Karamazov · Wednesday at 6:22 PM

They said last year they were done with CP77 updates, don't think we'll get anything.

Seanspeed · Wednesday at 7:22 PM

MrSpiggott said:
The Tensor cores are there predominantly for non gaming tasks such as machine learning. They are fitted as standard, for which you're paying a premium, so may as well be used for upscaling even if it's only for a fraction of a frame. What's the argument for there inclusion on a console?

You aren't paying a premium for tensor cores 'fitted'. Of course they will have a cost in die size, but they're really not that big, and wouldn't need to be that big for something with a more specialized purpose like in a console, purely for reconstruction. Given what it essentially adds in terms of performance overhead, it's an easily justifiable inclusion. How much they're active per frame really doesn't need to be relevant.

Shifty Geezer · Wednesday at 7:36 PM

Seanspeed said:
You aren't paying a premium for tensor cores 'fitted'. Of course they will have a cost in die size, but they're really not that big, and wouldn't need to be that big for something with a more specialized purpose like in a console, purely for reconstruction.

ML hardware is just ML hardware. You can't specialise it for upscaling. A dedicated HW upscaler could maybe be smaller but that's clearly not what we've got as it's never been described as such.

Seanspeed said:
Given what it essentially adds in terms of performance overhead, it's an easily justifiable inclusion. How much they're active per frame really doesn't need to be relevant.

If you want them to work in 2ms, you'll need a certain size, that's then doing nothing when not upscaling. If you want the optimal HW choice, you want just enough ML hardware to process the frame in 15 ms and be a frame behind, which isn't what we've got.

The Tensor cores in DLSS are there because nVidia wanted ML hardware in their GPU for AI stuff, nothing to do with gaming. They then found a use for it for gamers. How much time do those Tensor cores spend on DLSS? Maybe a couple of ms. At which point, the rest of the time, it's dead silicon. It could of course be used by devs, but only really on nVidia exclusive content. Hence we get a situation where the Tensor cores are used briefly to upscale and then do nothing the rest of the frame. Dead silicon is not really an ideal for consoles that need more efficient HW. Moving the upscaling to either a dedicated upscaler or across the existing compute achieves this.

DavidGraham · Wednesday at 8:13 PM

Shifty Geezer said:
How much time do those Tensor cores spend on DLSS? Maybe a couple of ms. At which point, the rest of the time, it's dead silicon

They are not practically dead silicon though, they are being used concurrently with the shader cores all the time, they also do a lot more than upscaling now, they do frame generation, denoising in several ray traced and path traced titles, and they also do HDR conversion post processing in most title. They are almost as active as the shader cores now.

iroboto · Wednesday at 11:12 PM

Seanspeed said:
You aren't paying a premium for tensor cores 'fitted'. Of course they will have a cost in die size, but they're really not that big, and wouldn't need to be that big for something with a more specialized purpose like in a console, purely for reconstruction. Given what it essentially adds in terms of performance overhead, it's an easily justifiable inclusion. How much they're active per frame really doesn't need to be relevant.

Tensor silicon in inherently way different than SIMD units. Yes they are housed in the SM. But how they access cache is dramatically different and what can be accomplished in a single cycle on tensor cores would take 20+ cycles on a CU. Tensor cores sit idle waiting to be filled with memory to do work, they only spend 1 cycle to do all the calculations they need to do. But needing to write out and wait for the next batch of data to come in.

RDGoodla · 2024-10-24T00:48:08+0100

AMD already has NPU in Strix Point. It has 50 TOPS and is very small.

https://www.techpowerup.com/img/mzRgJkIVXcjkP1t1.jpg

Even double or triple the size, NPU is still small maybe close to 16 CUs.

RobertR1 · 2024-10-24T08:23:48+0100

DavidGraham said:
They are not practically dead silicon though, they are being used concurrently with the shader cores all the time, they also do a lot more than upscaling now, they do frame generation, denoising in several ray traced and path traced titles, and they also do HDR conversion post processing in most title. They are almost as active as the shader cores now.

And thank god for rtx hdr so I don’t to suffer one shit implementation after another.

Seanspeed · 2024-10-24T12:42:36+0100

Shifty Geezer said:
ML hardware is just ML hardware. You can't specialise it for upscaling. A dedicated HW upscaler could maybe be smaller but that's clearly not what we've got as it's never been described as such.

If you want them to work in 2ms, you'll need a certain size, that's then doing nothing when not upscaling. If you want the optimal HW choice, you want just enough ML hardware to process the frame in 15 ms and be a frame behind, which isn't what we've got.

The Tensor cores in DLSS are there because nVidia wanted ML hardware in their GPU for AI stuff, nothing to do with gaming. They then found a use for it for gamers. How much time do those Tensor cores spend on DLSS? Maybe a couple of ms. At which point, the rest of the time, it's dead silicon. It could of course be used by devs, but only really on nVidia exclusive content. Hence we get a situation where the Tensor cores are used briefly to upscale and then do nothing the rest of the frame. Dead silicon is not really an ideal for consoles that need more efficient HW. Moving the upscaling to either a dedicated upscaler or across the existing compute achieves this.

You could absolutely specialize hardware to focus on accelerating a specific type of instruction above all. 'ML hardware' is not some strict, fixed thing at all. Different ways to skin a cat.

I really just do not understand your preoccupation with this idea that you need the ML hardware to be active the whole time for it to be justifiable. Using a small bit of extra die space for what essentially gives you a large performance boost is easily justified. Just shifting it to compute instead cuts into the main rendering budget, and because it'll be slower, you're *really* eating into that budget. Making up for it would basically require a fair chunk larger GPU, which will be much more costly in terms of die space.

And tensor cores were absolutely included in the GAMING line of GPU's in order for games to make use of them. By your reasoning, GeForce GPU's should strip the tensor cores out and just do DLSS via general compute, because they aren't being utilized enough. But we know that's absurd. They are very worth their inclusion for DLSS alone.

PS5 Pro *spawn

iroboto

Daft Funk

chris1515

Charlietus

Seanspeed

MrSpiggott

snc

chris1515

ErneX

snc

Karamazov

Seanspeed

Shifty Geezer

uber-Troll!

DavidGraham

iroboto

Daft Funk

RDGoodla

RobertR1

Pro

Seanspeed

Similar threads