Pantagruel's Friend
Newcomer
I see now you're talking 4*8*(4+1)=160 but at 2* clock =320.
Still the same scheduling issue, RV670 is already smaller in transistors than G92 & I think you'd be down to 8 texture units?
Yes, but not down to 8 units but up to 32 (I was thinking 8 units / cluster, but obviously it can be 8 quads also). I was trying to "save" transistors to make that possible, but as mczak writes, this may not be the case.
Btw. mczak, I understand this much of multiplier design, but to be honest I think it's much more heavily laden with lookup tables, and that's why I didn't assume a heavy increase in tr count. Point can be, though, that it may work quite differently in the GHz domain than I'd think, so some reading definitely won't hurt :smile:
Thanks for the responses regarding the ring bus, looks like I'll invest some reading here also.