Hmm, 6XS and 8XS look like 6X and 8X (i.e., the MS is OG...atleast that is my theory). What would cause that? I wonder if it is a driver bug?
Anyways, I have a question: Why can't the GF FX do 4xS using the post filter? I.e., it was said the 4xS shots represented game play without computer side post processing.
I have some theories on how it could work using postfiltering...could someone tell me if my reasoning is invalid?
....
Here is my guess at an explanation of GF 4 modes (please correct if in error...which seems likely because I just made it up
) and two theories on GF FX operation.
If there is a conceptual error, please point it out and I'll go back and fix it throughout if it wrecks my theory. Please ignore the RAM squander
in theory 2 for the purposes of theorizing a 256 MB card sometime in the future.
I assume,
for the purposes of analyzing performance drawbacks, the postfilter is capable of all of the operations I specify below:
(I hope I'm not spouting badly conceived math everywhere...
)
Per Hyp-X correction, D = blend render rate, D' = blend read rate, R = refresh rate.
It struck me that changing bus utilization of one due to constraints of the bus in regards to the other might be a factor...for example, output blend bandwidth usage could be 1/N blend read bandwidth usage...and the difference between the two might factor in analysis.
Also, I should list vsync and triple buffering as possible factors in latency effects.
BLENDED NxS:
bandwidth =
(N/2)x SS write * 2x MS write (back buffer)
+ D' * (N/2)x SS read * 2x MS read (back buffer)
+ D * 1x write (front buffer) +
"latency effect" (from tying up write bandwidth) +
R * 1x read (front buffer)
RAM =
1x front buffer RAM used
RAMDAC NxS (theory 1, if efficient access isn't a problem):
bandwidth =
(N/2)x SS write * 2x MS write (back buffer)
+ D' * (N/2)x SS read * 2x MS read (back buffer)
+ D * 2x MS write (front buffer) +
"latency effect" (from tying up write bandwidth) +
R * 2x MS read (front buffer)
RAM =
2x front buffer RAM used
RAMDAC NxS (theory 2, if efficient access for blending is an issue above):
bandwidth =
(N/2)x SS write * 2x MS write (back buffer)
+ R * (N/2)x SS read * 2x MS read (front buffer)
RAM =
(N)x front buffer RAM used
Also,
CCdiff = the difference between theoretical maximum MS color compression and actual compression, ignoring overhead (for a screen with no edges, a theoretical "0"). The "latency effect" from this difference is ignored.
Compared to GF 4 implementation (positive is a
possible penalty,
in red if always so, and a negative is a benefit):
For NxS (theory 1):
+ D * (CCdiff)x write + R * (CCdiff)x read + 1x front buffer RAM used (might cause
further performance hit on 128 MB card at higher resolution)
For NxS (theory 2):
+ (R - D') * (N/2)x SS read * 2x MS read - D * 1x write - "latency effect" - D' * 1x read + (N-1)x front buffer RAM used (might this be conceivable for 4xS at some resolutions, even with 128MB? Looks pretty ugly for anything above that even on a 256 MB card I think...)
Assuming SS has no color compression benefit, and the postfilter is capable of this blend:
(N/2)x SS read/write * 2x MS read/write = (N/2) * (CCdiff + 1x read/write)
(R - D') * (N/2)x SS read * 2x MS read - D * 1x write - R * 1x read =
R * (N/2)x SS read * 2x MS read - D' * N/2 SS read * 2 MS read - D * 1x write - R * 1x read =
R * (N/2) * (CCdiff + 1x read) - D' * (N/2) * (CCdiff + 1x read) - D * 1x write - R * 1x read =
(R - D') * (N/2) * CCdiff + R(N/2 - 1) * 1x read - D' * (N/2) * 1x read - D * 1x write
+ (R - D') * (N/2) * CCdiff + R * (N/2 - 1) * 1x read - D' * (N/2) 1x read - D * 1x write - "latency effect" + (N-1)x front buffer RAM used
With CCdiff presumed to be near 0, bandwidth savings is primarily determined by the relationship of R to D and D'.
For NxS (theory 2) to offer bandwidth savings using post filter, R * (N/2 -1) would have to be less than D' * (N/2) + D.
Assuming D and D' equivalent, render rate (D) would have to be > refresh rate (R) * (N - 2) / (N + 2), or : D > R/3 for 4xS, D > R/2 for 6xS, D > R * 0.6 for 8xS. Tthis does not look to have any redeeming values.
And/or my math could be in error.
Hmm...this time I'm pretty positive I've messed up the math but I'll check it at a later time.
---
If my reasoning isn't fatally flawed, EDIT1:
is the above the reason for no postfilter usage for anything besides 2x and Quincunx?
EDIT2: I've gone and corrected based on Hyp-X's correction (or atleast my understanding of it) from another thread in
brown, and added
italic edited comments for clarification. Note that I haven't performed a sanity check on the math yet, and it might show.
PS: Yes, I know the colors are overboard, I'm seeking distraction.