Comparing Cayman to Cypress - it's simply hard to tell how big Cayman is?
You can easily fit a larger chip on cypress' package)
I dont think you've understood what was said in that post. Obviously ATI/NV treats full scene in the post-processing, thats where alpha-masks helps. I see no reason why masks wouldnt work in post-processing with MLAA, they work just fine with other post-processing filters. Only issue is extra work involved.No, it's silly. There's obviously no way for ATI/NV to know what parts of the final image came from 3D rendering (and should have MLAA) vs not. The right solution is for the developers to just implement MLAA (or one of the numerous better alternatives like... MSAA or a hybrid) and apply it to the right places. None of this control panel BS.
Ohayo gozaimasu:
So the rumors about 2GB of RAM are fake?
24 Cores ?
So either 1536 ALU VLIW 4 or 1920 ALU VLIW 5 .
24 Cores ?
So either 1536 ALU VLIW 4 or 1920 ALU VLIW 5 .
indeedOr maybe its Cayman Pro with only 24 of 30 SIMDs active.
Clock looks too high for me for Cayman Pro. Even if that's the Pro it seems unlikely so many simds would be deactivated, that is XT would have fewer than 30 simds. Also, if that's 24 simds what's the organization? 2x12? 3x8? 4x6?Or maybe its Cayman Pro with only 24 of 30 SIMDs active.
A "Pro" with a higher core clock than a "XT"? Would be a first.indeed
(860*1920)/(890*1536)= %20,7
If 1920 is viable for VLIW-4, then that would be 30 SIMDs of 64.Clock looks too high for me for Cayman Pro. Even if that's the Pro it seems unlikely so many simds would be deactivated, that is XT would have fewer than 30 simds. Also, if that's 24 simds what's the organization? 2x12? 3x8? 4x6?
Ohayo gozaimasu:
cut
Didn't understand you there , could you please reformulate the question ?How much bandwidth would 6970 generate if it's 1536 @ 890Mhz with 1GB memory @ 4800 Mhz?
If VLIW 5 lanes had a utilization rate of 60~80% , then VLIW 4 lanes would up that to 80~85% (maybe more) , an improvement of 20% at least .If it's really 64 lanes per SIMD, then we get 1536 lanes. The increase in SIMD count and the increased average utilisation of VLIW-4, in comparison with VLIW-5, would compensate for 1536 being lower than Cypress's 1600 lanes.
Yes, but I was thinking lately maybe efficiency drops if you have "too many" simds per shader engine. And if you don't like the 3 shader engines, what about 4 instead? Though I agree only 6 per dispatch processor would be quite low - I want the chip to have 28 simds in a 4x7 arrangement, with the pro being 4x6 instead . (Barts also has only 7 simds in a group, though they are of course VLIW-5.)Which would appear to be organised as 3 shader engines, each with 10 SIMDs. Disabling 2 SIMDs in each would lead to 24 SIMDs.
One of my qualms here is that 3 shader engines doesn't mesh with 8 quads of ROPs. If screen space is tiled amongst shader engines, then it doesn't divide equally amongst 3 shader engines.
This makes sense. Even if you assume you could get the same performance out of a VLIW-4 simd compared to a VLIW-5 (which is a bit of a stretch) 24 is only 20% more simds however, so performance improvements beyond that have to come from elsewhere. Also note there's an obvious difference between utilization of alu slots and alu instructions issued per clock - since transcendentals now require 3 slots even serial dependent transcendentals have 75% utilization - but obviously they aren't any faster than the 20% utilization of the same sequence in Evergreen.Alternatively 1920 ALU lanes was someone's mistaken interpretation of 24 SIMDs, each with 80 ALU lanes - way back in the mists of time. If it's really 64 lanes per SIMD, then we get 1536 lanes. The increase in SIMD count and the increased average utilisation of VLIW-4, in comparison with VLIW-5, would compensate for 1536 being lower than Cypress's 1600 lanes.
Isn't that just LDS size (would be same as Evergreen)?L1 is 32KB then?