With much higher code density and instruction decoding out of the critical path, much more sophisticated branch prediction & cache architecture & elaborate prefetching, a robust OOOE implementation and Integer ALUs at 4 and 4.8 GHz respectively, a single Xenon core would surpass it only on very,very biased workloads; but given a fair and representive set of real world problems and equally elaborate implementations on both architectures, the P4 would pull far ahead on average.
ditto. i don't see a similarly-clocked (as in 'within %100') in-order core beating an OOOe core in a general purpose scenario. for specialized cases - maybe, depening on what specialization that in-order core has.