D3D10 Deferred Shading: AA possible?

psurge · Sep 19, 2006

Jawed - right. I guess the correct term would be compression of bandwidth and not storage

. That said, with virtual memory, one should be able to allocate MSAA buffers for some ridiculous AA levels with the end result that many pages in the full-sized buffer never get written, and hence don't consume VRAM (just slots in the page file).

Xmas · Sep 19, 2006

Jawed said:
A problem here is that an edge in one G-buffer is not an edge in another G-buffer. e.g. two triangles abut (sharing a single pixel), with identical Z and identical normal, but different colour.

So the detection of edges needs to be executed for each G-buffer independently. Doesn't it?

I'm not convinced that's the case. In terms of framebuffer compression, an edge is likely an edge, even when the samples on both sides have identical values. I guess it's much easier to implement compression based on whether the coverage mask for a tile is all ones, instead of actually checking whether all samples for each pixel in the tile are identical.

As to SS'd edges versus MSAA'd, isn't this always going to be a question of ordered-grid versus sparse-sampling? Is it possible to rasterise around edges (at the super-sampling res) with a non-ordered-grid pattern? Wouldn't that simply shift the artefacts of edge-AA further towards the interior?

For a sparse grid edge supersampling, you could have a rasterizer that turns an edge quad (where the coverage mask is not all ones) into multiple quads, each having only one mask bit set per pixel. Then you also need forced centroid sampling and derivative correction for these quads.

However, as OpenGL guy said, you'd still get filtering artifacts. In fact even more so, since not only edge pixels are affected but edge quads. I don't think partly supersampling a surface is a good idea, as you will see the differences in many cases.

Jawed · Sep 19, 2006

Xmas said:
I'm not convinced that's the case. In terms of framebuffer compression, an edge is likely an edge, even when the samples on both sides have identical values. I guess it's much easier to implement compression based on whether the coverage mask for a tile is all ones, instead of actually checking whether all samples for each pixel in the tile are identical.

Perhaps there's been a mixup: I was talking about the final phase lighting shader that tries to detect edges in the final 16xMSAA G-buffers, in order to avoid "supersampling" execution across all 16 samples.

An example might be with normals in one G-buffer and material IDs in another, you could end up with different sets of edges, depending upon which G-buffer you use to identify edges. An edge pixel in the normal G-buffer could easily end up as a continous surface in the material ID G-buffer.

It seems to me the only solution is to OR the edge detection across all G-buffers.

(But, hey, I don't program graphics!)

---

As to MSAA compression, this is how I think it works (with a major caveat to come). If you have 4xAA and a pixel consists of:

2 samples: A, B are both colour=red, Z=54
2 samples: C, D are both colour=blue, Z=77

and then along comes a coverage mask saying that samples A, B are blue, Z=77. Before applying this new coverage mask, which implies an edge, the ROP has to read in the pixel's entire previous samples. It should know to do this because the pixel is already flagged as an edge. When the ROP processes the new samples against the existing samples, all four samples end up as blue, Z=77.

Because this checking is performed in the ROP's buffer cache, the final result written back to the buffer (in memory) can be fully compressed, based not on the incoming coverage mask, but on the resulting coverage mask.

---

Having said all that, I've got to wonder if it's worthwhile for the ROPs to even try to write compressed pixels to the buffer, if the pixel was previously an edge. As frame rendering progresses (normal rendering, not deferred), the amount of fragmentation of (interior versus edge) pixels surely means that once you've taken into account buffer tiling and DRAM burst length, there's no value in doing a compressed write to a few pixels (because if there was an edge nearby, there prolly still is)...

In other words, is AA compression a win only when the framebuffer is fresh and the first triangles hit each pixel? When overdraw is currently 0?

---

So, Xmas, I would tend to agree with you, the hassle of converting an edge pixel into an interior pixel is prolly not worth it.

But I think G-buffers, being independent of each other in terms of their raw 16xMSAA samples, need to be edge-tested as a unified whole within the final lighting shader. If you're even going to bother with edge-testing, that is...

Jawed

Xmas · Sep 19, 2006

Jawed said:
Perhaps there's been a mixup: I was talking about the final phase lighting shader that tries to detect edges in the final 16xMSAA G-buffers, in order to avoid "supersampling" execution across all 16 samples.

Ah, ok. I thought you were talking about the hardware providing this information as a flag to the shader, based on compression data.

As to MSAA compression, this is how I think it works (with a major caveat to come). If you have 4xAA and a pixel consists of:

2 samples: A, B are both colour=red, Z=54

2 samples: C, D are both colour=blue, Z=77

and then along comes a coverage mask saying that samples A, B are blue, Z=77. Before applying this new coverage mask, which implies an edge, the ROP has to read in the pixel's entire previous samples. It should know to do this because the pixel is already flagged as an edge. When the ROP processes the new samples against the existing samples, all four samples end up as blue, Z=77.

Because this checking is performed in the ROP's buffer cache, the final result written back to the buffer (in memory) can be fully compressed, based not on the incoming coverage mask, but on the resulting coverage mask.

There is no resulting coverage mask, only resulting color and Z values. Sure, the result can be compressed, but that doesn't mean it is. Because that would actually require comparing all the samples per pixel in a tile before compressing and writing it.

Having said all that, I've got to wonder if it's worthwhile for the ROPs to even try to write compressed pixels to the buffer, if the pixel was previously an edge. As frame rendering progresses (normal rendering, not deferred), the amount of fragmentation of (interior versus edge) pixels surely means that once you've taken into account buffer tiling and DRAM burst length, there's no value in doing a compressed write to a few pixels (because if there was an edge nearby, there prolly still is)...

In other words, is AA compression a win only when the framebuffer is fresh and the first triangles hit each pixel? When overdraw is currently 0?

I'm not sure what you mean here. Compression is only applied to tiles that are completely inside one triangle. So it depends mainly on the size of the triangles. For the later triangles there is a chance that they will be partly covered by previous ones, so they will appear "smaller". Plus, with front-to-back ordering you render distant objects last. But if you render a large, close triangle last, most of it will be compressed.

D3D10 Deferred Shading: AA possible?

psurge

Xmas

Porous

Jawed

Xmas

Porous

Similar threads