PZ said:Oh god, fugly ... so the software can know 18 cycles ahead of time which way a branch will go, but will not have a way to tell the hardware that?
Yeah, I am concerned that this chip will require a super compiler, OS, and scheduler to work properly without overwhelming the programmer. It seems as though they took a lot of the complexity out of the hardware in order to get speed and shifted the complexity to the OS and compiler and ultimately back onto the general purpose PPC core (is that good?)..
Compiler, certainly yes. But OS ? The OS will run on the PPE, all it has to do to make the SPUs run correct is set up the IOMMUs (or equivalent) for these, - similar to what all OS already do for PCI on hardware that support IOMMUs.
SPU context switches are going to be super expensive, but that's allright since the entire thing is only meant to run one big honking app. at a time.
Cheers
Gubbi