Hmm, if the information is true I think the cards would look as follows:
rv770: 6 shader clusters with 16 of the vliw (vec5) units (in contrast to rv670 which has 4 clusters with 16 vliw units). The (8ta, 4tf) texture units are shared among all clusters like with rv670, but there are simply twice as many (or maybe they got downgraded to 4ta, 4tf).
So to make all chips fit:
rv770 = 6 clusters with 16 vec5 units, 8 quad-tu
rv740 = 4 clusters with 12 vec5 units, 6 quad-tu
rv710 = 2 clusters with 4 vec5 units, 2 quad-tu.
So maybe within a cluster you could still only get the results of one quad-tu (for simpler design), but two clusters could get texture sampling results at the same time.
The rv710 seems to have a very high tex:alu ratio (well even with the r6xx series rv615 had a twice as high tex:alu ratio compared to r600, but now it's 3 times higher, and compared to rv610 twice as high), almost overkill, and the rv740 ratio would be quite high too but all chips would have the same coupling of the simd arrays with texture units.
In contrast to "old" chips:
r600/rv670 = 4 clusters with 16 vec5 units, 4 quad-tu
rv630/rv635 = 3 clusters with 8 vec5 units, 2 quad-tu
rv615/rv620 = 2 clusters with 4 vec5 units, 1 quad-tu
It would be possible the tu's are oct-units instead, but I don't think it would make sense (it would mean all the rv7xx chips would have half the clusters as outlined above, but with twice as many vec5 units in them).
Well that's just how I'd design the chips given the number of tmus / shader units in that rumour
I don't mention ROPs because these are boring for speculation. Decoupled completely already in r6xx from both shader clusters and memory interface widths, any number seems plausible.
(btw I still don't quite understand why branch granularity is 64 pixels on r600 - I thought it should be 16, because the clusters don't have to run the same prog. Maybe I misunderstood something there, which would make the whole possible rv7xx configuration I've just written pointless...)