Jaws said:
It wasn't thrown around. It was derived. Also, If you read the thread, I've already derived it to be a factor of 8 out. Thanks to Arjan. It would be 8 Gb/sec compressed and 64 GB/sec uncompressed peak. Do people even read threads before commenting these days?
I did see your post, and it had flaws that showed you still didn't understand AF. The 8 GB/s number doesn't only hold for 2xAF, it holds for all AF. And arjan's comment was regarding your assumption of 16 texels, not universally for 2xAF (see below).
Frankly no one bothered to answer Titanios question apart from me. I knew what I was posting thank you. And I posted it clearly so you could see the working. The fact that I didn't know all available info (4 cycle AF2) at the time is neither here nor there. From the subsequent discussion, I learnt something. Which is what a discussion board is about.
So Mintmaster, please read the thread before making a pointless comment.
It was not pointless. Your post was very much authoritative in nature, without even an "IMO" or "I think" or "Sound right?", when you did not know how things worked.
"4 cycle AF2", as you call it, is nothing new. I said several times that Xenos is only capable of 16 bilinearly filtered samples per clock. A trilinear sample is two bilinear samples, and AF has N bilinear samples. This has been the case for
every video card from ATI and NVidia since 2000.
That's six years, buddy. I have absolutely no problem with you not knowing that, but A) I explained it in a post with my comment about 16 samples for a very steep surface, and B) if you didn't know this fact or understand my comment then you're not qualified to make confident calculations like you did.
Even that bit of info from arjan was interpreted incorrectly by you. Most of the time 2xAF would only need 2 bilinear samples (see Dave's comment about trilinear), in which case it's 2 cycle. Even if you're near the mipmap transition, you only need 16 point samples (and thus 4 cycles) if piecing together trilinear samples, which is unlikely because it's doing redundant work. The lower resolution mipmap used in trilinear filtering doesn't need as many samples to cover the same area. In the end it should be viewed as the number of bilinear samples, and that's it, because you don't know anything else about the hardware.
I always assumed 16xAF means 16 bilinear samples total, as this makes sense from a hardware point of view. Either way, as I explained before, it doesn't matter how many samples there are per pixel, because Xenos can only do 16 of them per clock. I said this before your calculation post.
Now, I agree that I wasn't too clear about the texel sharing, but you picked this part up fine. (4 texels for bilinear * 0.5 for sharing) * 16 * 500MHz * 4-bits = 8GB/s. No need to complicate this with your weird starting numbers or "4 cycle AF2" or whatnot. The only thing that can change with AF level is the sharing factor, and when over 2xAF it gets near 0.5 in the worst case. For a simple bilinear filtering, it will be 0.25 worst case.
If you don't like my posts, I don't care, because others find them useful. Just don't go around misinforming people, especially without acknowledging your limited knowledge on the subject.