Baseless Next Generation Rumors with no Technical Merits [post E3 2019, pre GDC 2020] [XBSX, PS5]

disco_ · Jan 8, 2020

ToTTenTranz said:
Wasn't Jason Schreier who said Sony was planning for a 2019 release at some point, but they delayed one year to guarantee a stronger launch lineup?
If so, I'd say he's a very substantial source.

Jason said that in early 2018 that 2020 was the launch plan. It was klee and matt that said 2019 was the plan til it was canned early 2017.

Fair enough.
Does it cease to be GPU hardware if the RT is on a separate chip?

No clue. If he said "in the APU" there's more wiggle room. Someone did mention that even PowerVR RT would be integrated into the gpu as that's how it's designed.

BRiT · Jan 8, 2020

Don't mistake people's Hope's with actual hardware, their Hope's are getting more interesting.

Shifty Geezer · Jan 8, 2020

mpg1 said:
Console makers are not really interested in getting involved with custom hardware anymore...let alone tackle something like RT hardware(that would then have to work with the APU).

It's like this patent doesn't exist.

Clearly Sony were open to custom hardware in 2014 (related photon mapping patent) and 2015.

Just noticed the timeline has "2020-01-08 Application status is Active". Anyone know what "application state active" means and why we'd have a date for that from August this year where the patent was granted in 2017? As far as I knew, patents are granted, that's it. No idea what 'active' is and the internet throws up nothing. The implied meaning is a date from which the patent is being active in some form...

AbsoluteBeginner · Jan 8, 2020

AquariusZi on Renoir (4000 series just announced). This is from 2019/06/14

https://www.ptt.cc/bbs/PC_Shopping/M.1560496597.A.CC8.html

1. 7nm Renoir has just arrived in Shinto for evaluation

2. It should be revised to Socket FP6 by the way

3. From the layout point of view, the probability of 4C is relatively large (It should be, but it is not very sure.) There is more area (in proportion) for GPU than Picasso

4. About 150mm2

From todays Anandtech

https://www.anandtech.com/show/1532...oming-q1?utm_source=twitter&utm_medium=social

The new 8 core / 8 CU chip on 7nm is also very small. AMD wouldn’t let us measure it directly, or take pictures before the keynote, but by putting the chip next to a Zen 2 desktop chiplet, I was able to make out that the new APU is pretty much double the size of a single Zen 2 chiplet, and double 74mm2 is 148mm2 or 150mm2 for a round number.

Now obviously, he predicted bigger GPU and 4 cores, and what we got is smaller GPU (11 > 8CUs) and more cores, but this tells you (along with his Navi 10 and Vega leaks) that his info is on point.

Oberon, constant revisions, "one size smaller then Arden" (50mm²)...I would give him a benefit of a doubt. It fits perfectly well with what good intern from AMD has leaked to us.

BRiT · Jan 8, 2020

Your date formats are wrong. They should always be YYYY-MM-DD.

Barrabas · Jan 8, 2020

Shifty Geezer said:
It's like this patent doesn't exist. Clearly Sony were open to custom hardware in 2014 (related photon mapping patent) and 2015.

Just noticed the timeline has "2020-01-08 Application status is Active". Anyone know what "application state active" means

Found this. "With an active patent, no one except the patent owner may manufacture, import, export, sell or otherwise profit from the sale of an item so protected. This effectively gives the patent owner a monopoly on the production of their invention"
https://www.bigcommerce.com/ecommerce-answers/what-patent/

MrFox · Jan 8, 2020

Shifty Geezer said:
It's like this patent doesn't exist. Clearly Sony were open to custom hardware in 2014 (related photon mapping patent) and 2015.

Just noticed the timeline has "2020-01-08 Application status is Active". Anyone know what "application state active" means and why we'd have a date for that from August this year where the patent was granted in 2017? As far as I knew, patents are granted, that's it. No idea what 'active' is and the internet throws up nothing. The implied meaning is a date from which the patent is being active in some form...

Maybe Active means it's not yet expired?

I saw patents applied (status : pending), then published, and it can take years before it's granted. I have also seen years between filing and becoming public and then years before being granted.

Edit: USPTO requires the patent to be renewed, and if the company doesn't pay it expires. So status active must simply mean they continue to renew it.

disco_ · Jan 8, 2020

Shifty Geezer said:
It's like this patent doesn't exist. Clearly Sony were open to custom hardware in 2014 (related photon mapping patent) and 2015.

Just noticed the timeline has "2020-01-08 Application status is Active". Anyone know what "application state active" means and why we'd have a date for that from August this year where the patent was granted in 2017? As far as I knew, patents are granted, that's it. No idea what 'active' is and the internet throws up nothing. The implied meaning is a date from which the patent is being active in some form...

Active means granted/valid til the given expiration date.

BRiT · Jan 8, 2020

Shifty Geezer said:
It's like this patent doesn't exist. Clearly Sony were open to custom hardware in 2014 (related photon mapping patent) and 2015.

And Microsoft were open to RayTracing back in 2008 -- https://patents.google.com/patent/U...+tracing&q=microsoft&oq=ray+tracing+microsoft

mpg1 · Jan 8, 2020

My take: these consoles are just getting whatever AMD's answer to RTX is. The only difference being one console is using DXR for the API and the other essentially using whatever Vulkan's implementation ends up being.

AbsoluteBeginner · Jan 8, 2020

https://twitter.com/i/web/status/1215038964928667648

disco_ · Jan 8, 2020

AbsoluteBeginner said:
https://twitter.com/i/web/status/1215038964928667648

If xsx and ps5 are the movie in his analogy, I'm not going into either, blind.

MrFox · Jan 8, 2020

AbsoluteBeginner said:
https://twitter.com/i/web/status/1215038964928667648

What a weird analogy.

AbsoluteBeginner · Jan 8, 2020

MrFox said:
What a weird analogy.

Its posturing I think. Guy literally has "8K" in his name.

They trademarked "Power your dreams" slogan and have been putting die shots for avatars on twitter. Hell, even Aaron Grenberg put one lol

BRiT · Jan 8, 2020

disco_ said:
If xsx and ps5 are the movie in his analogy, I'm not going into either, blind.

You certainly won't get enough out of it just by listening.

JoeJ · Jan 8, 2020

mpg1 said:
My take: these consoles are just getting whatever AMD's answer to RTX is. The only difference being one console is using DXR for the API and the other essentially using whatever Vulkan's implementation ends up being.

Hearing something like that the second time i need to ask: Are there any plans from Sony to support Vulkan? Guess no, or did i miss that?

I'm pretty sure AMD will just adopt NVs VK extensions. There are already places where some flags would run out of available bits otherwise, IIRC. It's very likely Khronos will just add duplicates of defines removing the "NV", like it often happened with NV/AMD -> ARB and finally core with OpenGL. Minor changes expected, mostly about new features in new hardware.

Sony can and will do whatever suits their hardware without compromise.

3dilettante · Jan 8, 2020

iroboto said:
I have strong doubts someone knows the engineering choices of both chips.

At least it is supposed to be the case that the information is kept separate. The tiny doubt at this point stems from the comparative leaks in 2013 that hinted that AMD's compartments weren't airtight.
I had thought things would be more refined this time around, so I suppose comparing the alleged leaks we have now with whatever comes out will be the test for that.

Globalisateur said:
But that's exactly what Sony have done with their custom ID Buffer in order to implement their own method of CBR or improved TAA. They came with their own hardware solution to resolve a problem tailored for their specific needs: how to display relatively sharp 4K with only 2X 1080p pixels.

And I actually expect that Sony will use some RDNA2 features ported on their custom RDNA GPU (the same way they used Vega features on their GCN GPU). But I won't be surprised if their solution is totally custom because they have already done it in the past.

The ID buffer itself operates like a tweaked depth unit from the existing RBEs. Its output is parallel with the Z-buffer and it updates a given pixel in step with the Z-buffer.
I think it's an example of how externally different features can be created by modifying or repurposing similar starting elements.
AMD's patent is a broad set of claims that doesn't commit to a single implementation, and it's still not guaranteed that the one we know about will be reflective of what is used.
If it's used as a starting point, there are sub-elements that clients could modify or replace like the algorithms used by the node traversal hardware, the implementation/presence of intersection hardware. Since the hardware is in a sub-block of a compute unit, there may be implementation details about what kind of CU those blocks are linked to, and where they might be placed/reserved. AMD's patent doesn't define what level of exposure the RT hardware has to developers, and varying what is available for coding to the metal versus custom microcoded tasks or API calls can significantly change what the solution looks like to the programmer.

ToTTenTranz said:
The PS5 using a separate chip for RT would be much nicer as long as said RT chip was able to offload the majority of raytracing implementation overheads towards it.
However, in games where RT isn't needed the SeX would have a sizeable performance advantage. Plus, we'd also need to know if this same RT chip would process ray traced audio, or if the GPU still needs to distribute those 9.2TF for audio.

One significant source of overhead is the compute resources devoted to building the acceleration structures, which Nvidia's driver manages using the SM hardware. If part of that was offloadable, then a separate chip might have some decent general compute capability itself, perhaps for a subset of asynchronous compute. That might put more pressure on bandwidth if it's reliant on the GPU's pool, and could complicate the geometry phase or transition to pixel shading. A separate memory pool would be more complex in a manner Sony wanted to avoid earlier.

Traversal and intersection testing would be another set of tasks for a separate chip, and those might be more modest in terms of bandwidth and could function on a separate chip on a more modest inter-chip link.
The latency question for a separate solution would rear its head. I'm not as concerned about link latency, but rather that GPUs are more weakly synchronized and can take significant amounts of time before work done in one portion of the chip is visible to the other--and there's usually a cost in throughput or bandwidth to make things more readily visible.

I think there could a more complex set of trade-offs in terms of flexibility, ease of use, latency, and performance impact that the console vendors would have to make decisions on even with hardware that has strong similarities at the sub-unit level.

Besides avoiding lower yields from having a larger 400mm^2 chip?
Even more if you consider the PS5 was initially thought to launch this year, using 2019's 7nm yields.

If that happened, there could be a higher chance that Sony would have been overly pessimistic about the yield picture in 2020, since TSMC has been touting highly improved yields now versus fears in earlier years that the improvement curve would be longer.
Even without that, Sony's position as market leader in a way could make the PS5 a victim of the PS4's success. Projected sales for Sony versus Microsoft would put more weight on per-die cost on the volume leader versus the minority player. If there's an expectation of selling 100 million or more PS5 chips, even modest extra die cost adds up to a big number that Sony may have decided wasn't worth the upside.

If only AMD had released several solutions that use high-bandwidth / low-latency communication between chiplets and I/O chips using Infinity Fabric over a substrate, during the past couple of years...?

That may depend on what workloads would straddle the link. Within the SOC we could be talking about hundreds of GB/s, and within the GPU domain several TB/s.

MrFox said:
An array of tensilica cores with additional custom instructions specific to RT would be very flexible both for RT tasks and for audio. Cadence sells this IP as a semi-custom business and allows significant custmization.

It does cost extra money to license it, which I could see the console makers exploring options to replace. AMD likely wouldn't mind cutting them out and pocketing the difference.
Some of AMD's patents on modified CUs with different programming models and hardware layouts do point to a possible desire to create a set of standard parts that could serve in this capacity.
Some of the more advanced chiplet 3D integration or MCM integration patents similarly hint at a desire to have AMD sub-blocks that can be inserted for client needs in custom solutions.

Scott_Arm · Jan 9, 2020

BRiT said:
Your date formats are wrong. They should always be YYYY-MM-DD.

Y-DMYM-YYD

JoeJ · Jan 9, 2020

Shifty Geezer said:
Functionally, it could do something as different as process the scene independently and generate lighting detail only using simplified geometry and no surface shader evaluation. This could then be combined in GPU shaders for RT lighting+shadows, but not RT reflections.

Reflections and hard shadows would be possible if you have a robust mapping between detailed and simplified geometry, for example UVs. Accurate normals from the detailed geometry could be used, so only position would cause some error. (Though, all this is much harder than it sounds - i doubt devs would be happy.)

Shading evaluation could happen on GPU, so the process could be: While generating GBuffer also send packets of rays to RT unit, get packets of ray hits back after some time (maybe nicely sorted by some ID like material), update / shade frame buffer using those results, eventually recurse. Requires pending tasks while waiting on tracing results?

Maybe the RT unit is also programmable, so it could launch a shadow ray automatically. But without access to texture and material no form of importance sampling would be possible. But programmable shaders + texture units... this would end up as a second GPU almost.

How could this look like for real? Tight coupling to GPU totally necessary?
Maybe Sony started on this before the big progress in denoising? What were their expections back then?

disco_ · Jan 9, 2020

AbsoluteBeginner said:
https://twitter.com/i/web/status/1215038964928667648

Some context for the original tweet.

Nah I love the community, never turn away from it, here to help answer any questions I can about industry, just hard sometimes when walking at CES and people get mad I won't tell them specs. Literally walk away calling me an Asshole.

Baseless Next Generation Rumors with no Technical Merits [post E3 2019, pre GDC 2020] [XBSX, PS5]

disco_

BRiT

(>• •)>⌐■-■ (⌐■-■)

Shifty Geezer

uber-Troll!

AbsoluteBeginner

BRiT

(>• •)>⌐■-■ (⌐■-■)

Barrabas

MrFox

Deludedly Fantastic

disco_

BRiT

(>• •)>⌐■-■ (⌐■-■)

mpg1

AbsoluteBeginner

disco_

MrFox

Deludedly Fantastic

AbsoluteBeginner

BRiT

(>• •)>⌐■-■ (⌐■-■)

JoeJ

3dilettante

Scott_Arm

JoeJ

disco_

Similar threads