DemoCoder said:Tridam said:For example, my tests showed that RSQ is done in 2 cycles by the first unit. Microsoft says that RSQ should use only 1 instruction slot.
Slot != cycle time. Microsoft says "most instructions should execute in 1 cycle". It doesn't demand it. Slots are just a mechanism for counting the max number of instructions.
Of course but I don't understand where you want to go with this... If a 1 slot instruction takes 2 cycles to execute it conflicts with the spec from my understanding. Try running more than 256 of this instruction on a 512 instructions slots limited architecture and you'll see what I mean
Maybe I'm jus wrong but I think that the max instruction caps corresponds roughly to the max number of pipeline passes. So the hardware won't always be able to execute every shader even if it fits into the max number of instruction slots. It could be the case when an instruction takes more cycles to execute than its number of instruction slots. If I'm true then there's a lot of other cases.