A simple (not sophisticated) OoOE could double that scallar performance without a large watts or silicon space penalty.
Even simple OOO will require fast tracking logic and that consumes power, it'll also require a whole load more read ports on the register file, that's lots more power. In order to allow multiple issue you'll need to double the execution units, that is again more power and a load more chip space. You'll also need to increase the SPE LS read bandwidth and memory bandwidth as well. OOO is going to have no small impact on Wattage.
As for OOO doubling computing power, how? Intel quotes high figures (300%) for OOO but that's because it's using OOO to get around the limitations of the rather limited x86 instruction set, OOO effectively gives it a lot more registers than it has as standard.
PPC is quite different and IBM quotes figures of around 35%, there's less limitations to get around so it's not helping nearly as much.
The SPEs are different again, they have tons of registers and the compiler can make use of them to do exactly the sorts of things OOO does, it's impact on SPE is likely to be rather less than even 35%
So in the case of the SPE while OOO may gain some performance, it's likely to be very limited. The cost of adding it likely to be much greater than any performance gained and you'll probably find that's exactly why it's not present.
If you ask me they should drop the PPC cores. Way too many problems, especially because they are in-order - Cache misses, Load-Hit-Stores, Branch penalties etc.
You evidently haven't heard of POWER6, an in-order PPC running at 4.7GHz, it's predecessor was an aggressive OOO design but the new chip runs rings around it.
I expect some of this tech could make it's way into future PPEs & Xenons giving a healthy performance boost without a big power increase.
--
As for MS using Larrabee, I doubt it but I wouldn't fully discount it. MS could always include a Xenon for backwards compatibility and OS functions and use Larrabee for the grunt.
However Intel's problem is they won't want to sell MS a powerful version of the chip as it'll potentially hit their high end.