JF_Aidan_Pryde
Regular
It's said that you can pass the computed results of one SPE to another and daisy chain them to 'stream' process.
Why would you want to do this?
The only reason why we want to pass on work is if it can be done by the next stage better than the current stage. But each SPE is identical. Why would I want to pass on work when I can just do it myself using multiple cycles?
I mean it's not like the work load has to go from SPE0 -> SPE7 before it can be outputed. We can store it back into main memory from any SPE, at any time.
So why stream?
And also, what is *really* the difference between a vector CPU (eg. Cray) and a SIMD SPE?
Why would you want to do this?
The only reason why we want to pass on work is if it can be done by the next stage better than the current stage. But each SPE is identical. Why would I want to pass on work when I can just do it myself using multiple cycles?
I mean it's not like the work load has to go from SPE0 -> SPE7 before it can be outputed. We can store it back into main memory from any SPE, at any time.
So why stream?
And also, what is *really* the difference between a vector CPU (eg. Cray) and a SIMD SPE?