Testing RCP is kinda the same issue. How do you write a dependent RCP shader, with just R in which you can't just simplify the shader to #RCP%2 number of RCP's?
I can't
Since RCP is a scaler instruction, I think one may use more input slot to get the shader long enough, then guess the 'real' issue rate.