Two AMD 40nm chips have been rumoured. Apart from RV740, one could be RV870 or "RV720". In the first half of this year, though, I think we can discount RV870. And then there's the scarcity of "RV720" rumour.The 3 leadoff 40nm chips are - RV740, GT216 and GT218.
IGPs being what they are (glacial), it is definitely possible they are the other 40nm chip not to be seen for most of the year.Not sure what else comes after, amd also needs something to replace their igp, also i swear i read last week somewhere jsen huang let it slip ion would dx11 before the end of the year(will spend rest of today looking for this link ).
Is there a precedent for GP->GT causing a chip to grow substantially?The rumor i heard was ~290mm2, that was last year. Wasn't sure if the extra size was due to the change from 55GP to GT or they had added extra units.
I see VR-Zone has an "update from CJ" attached, which says there'll be an OC version of RV790 bringing 25-30% performance gains. Assuming that's based on clocks and extra clusters, then I suppose 12 clusters would easily fit in 290mm2. 750MHz 12 clusters is 120% faster, at 850MHz that's 36% faster.Sorry is not 40nm, the vr-zone link said 1.3V above TSMC's 1.2V max for 40nm...exceeded on the first chip, doesnt seem very probable.
$300 would be the higher-clocked (XT, or OC as VR-Zone is calling it) GPU, but if it's only 10-15% faster than the Pro GPU then pricing does seem screwy, relying solely upon the difference in memory between them.Is interesting planning to try and put out a US$300 part, that seems way high without a lower priced volume part. Maybe they did something to the MC/RBE's so it only rally works well with GDDR5, samsung expects GDDR5 to be 20% of the market this year.
Texturing shouldn't be bandwidth constrained generally.alu count seems reasonable, but memory bandwidth seems to have very little growth compared to rv770, when it was already texturing limited in certain games.
D3D11 arguably needs a major rejig in on-die memory architecture because pixel shading is allowed to read and write render targets.This makes me wonder, could amd decide to implement some edram (ie xenos like) stuff on future stuff (may be not rv790/740 but on rv8xx perhaps or may be even later), that would relieve a lot of bandwidth requirements. It could be on die if necessary and since display resolutions don't grow as fast as moore' law, the cost could be manageable/negligible in not too distant future.
I'm wondering if all colour blend operations will be performed by adding instructions into the pixel shader to read then blend.
Carsten,
I know that you understand German. So you can read that the informations are refered to AMD and another, not publish source.
The 12 SIMD seems like your own speculation based on "more SIMD" and "+30% overall performance"?Yes, it looks like 12 SIMD units.
Source: AMD RV790 and RV740 not before April
Yeah, agreed 12 clusters implies 55nm. But I still think 12 clusters is pure guesswork.The 12 SIMD seems like your own speculation based on "more SIMD" and "+30% overall performance"?
This would IMHO be a too small chip in 40nm for 256bit gddr5. Sound more likely for the 55GT variant (although I don't believe in a redesign and a more expensive chip for +20% performance, especially not from AMD).
I suspect HD48xx GPUs are a bit short on texturing but with a bit more texturing they'd simply run into a wall with Z fillrate. I wish someone would make a concerted effort to test this stuff.The 16 SIMD seems more plausible regarding size, however, this would probably be quite RBE limited unless something is done on those. Maybe the 16 SIMD and nothing else would be close to overall +30% actual game performance on the same clock?
RV740 RBEs are a big deal - I can't see how they can make use of ~60GB/s unless there's twice as many as in RV730 or the Z configuration is doubled.We still don't know a whole lot about the RV740 RBEs - if they could use those in the '90? Do we even have a reliable size estimate on the '40?
Blimey, there's a tricky question, dunno.What does fixed function blending cost relative to other ROP bits like AA and compression etc?
Remember AA resolve works fine in R6xx's pixel shaders and custom resolve functionality is something that deferred rendering engines can do.Also would pixel shader blends imply that AA happens there too? And if it does how would AA sample compression work in the shaders? Seems like a lot of stuff would slow down and/or bandwidth efficiency would be lost.
(The thought has occurred to me that once Larrabee style GPUs take over, GPU architecture just won't be at all interesting )
(The thought has occurred to me that once Larrabee style GPUs take over, GPU architecture just won't be at all interesting )
That was a very basic estimation, which more than one person came up with back in Dec, before we even had any hard info. Still, 100mm2 isn't too far off when a linear shrink of RV770 is right at ~140mm2.I think I saw 100mm2 for RV740 rumoured, earlier in this thread? Seems way too low to me - somewhere in the region of 120-130mm2 with all the extra ALUs. I'm assuming that it'll be a 4:1 GPU with 8 clusters.
Jawed