MrWibble said:
Which is odd considering you're just outputting a stream of FUD, some of which flies in the face of the actual real world experience of those on this board who are programming the thing on a daily basis.
I don't believe that I'm outputing a stream of FUD. But then again, if I'm outputting a stream of FUD many on this board are outputting a stream of propaganda.
You are confusing (I feel deliberately, thought it might just be stupidity) the *choice* of a hw designer to place certain functions in the PPE instead of an SPU with design decisions made out of necessity.
It was not choice. There was a deliberate need on the part of the architects to reduce the size of the SPU in order to fit a large number on the chip in order to meet some marketting created target flops number. The architects and implementors would ideally, in both mine and I'm fairly certain theirs, replicated the PPE as many time as required to get the targetted flops number. The issue with this was the die size of the device that would subsequently be desired.
This led to designing a secondary attached processor with a limitted functionality set. But don't confuse this with choice. There are a lot of significant compromises in the SPU design that are there in order to acheive the flop performance target.
Some of the things you mention aren't even absent from the current SPUs anyway.. memory access?? My SPUs can access memory just fine, thank you very much.
Fine, why don't you program your SPU to walk a linked list stored over 512KB of memory. Did I mention that the walk will end up pseudo-random? Have fun. Let me know how that performance is going.
You are living in a dream world if you think that your SPU can access memory. The closest approximation to accessing memory is setting up a descriptor and initiating the start of a DMA transaction, waiting for the DMA transaction to complete, and then reading some datum from the LS. This isn't memory access.
I might as well suggest that the X360 CPU is crippled and not a "real CPU" because its memory interface is on a seperate chip (the GPU) rather than being integrated.
You might, you'd be two sheets to the wind and jumping the snark.
If the XGPU was not present I'm sure XCPU could have a memory interface, and if the PPE was not present then SPUs could be given more responsibility.
If the SPU's had been given more capability then they would be significantly bigger in size and less would fit on a die. A fully featured SPU would look somewhat like the PPE.
Certainly in practice the machine can work exactly as described by Sony's slides - the SPUs can entirely self-manage without intervention. So whatever you're claiming can be disproven simply by observation - my code certainly hasn't stopped working just because you've said it isn't possible, and runtime performance is not bound by reliance on the PPE.
They cannot self-manage without intervention. They can run long segmented programs, but they cannot self-manage. They rely on the PPE to herd them along and switch out programs when demand so requires.
And I never said your code wasn't possible, its just a long segmented program with branching. Any concept that it is really a multi-tasking kernal is figment of your imagination.
Aaron Spink
speaking for myself inc.