Can someone clarify this whole mess to a layman?
Instruction Count
Each pixel shader is allowed anywhere from 512 up to the number of slots in the MaxPixelShader30InstructionSlots member of D3DCAPS9 (not more than 32768). The number of instructions run can be much higher because of the looping support. The MaxPShaderInstructionsExecuted cap in D3DCAPS9 should be at least 2^16.
The hardware needs to support 512 instructions for PS and VS separately, or in total (256 for each)?
And yes I have those links for quite some time now and I haven't noticed any differences since then. In any case the way I understood it up to now is that each shader should have a minimum of 512 instruction slots.
Whatever the answer my original question still stands: isn't there headroom enough to base at least one more generation of hardware on dx9.0?