jeff_rigby
Banned
IT IS NOT AN ADD-ON FOR PS3! The patent describes a hardware feature of a processor that needs a massive connecting bus to function as a processing extension. There's nothing about the PS3 that enables this - it's original Cell lacks the dual-channel IO interface hardware, and it has no external bus that can support the patent's suggest 60 GB/s communications systems. This rumour needs to die.
It can't be. you'll never have an external interconnect capable of running as fast as on-chip communications. Three seprrate dies will communicate with each other an order of magnitude slower than on the same die, and memory access will be less efficient if they're having to manage memory access in three places instead of just one. The only way your proposed idea would make sense is if there's a reason to have a GPU with embedded Cell, such as for a common graphics engine. But that prescribes the same GPU for all devices, which doesn't make sense as GPU requirements vary considerably. It makes far more sense to have separate 4-SPU Cells connected up with whatever GPU is suitably to that device, and then use the patent to combine these Cell across devices. If you need more than that 4 SPU chip in numbers needed for a console, it makes more sense to have them all on one chip, in exactly the same way we have multicore chips for a reason!
The patent describes the processes in which a 4 element SPU Cell could be used. Those are descriptions of SOFTWARE functions. Direct connection to the memory buss was mentioned.
Limiting factors are now pin count/heat with density and real world functions. I.E. Waiting on input and output. Having a common memory buss that is running nearly 1:1 with the CPUs changes design criteria.
We are not talking Cray speeds here so some issues are irrelevant like requiring all three Cell processors on the same die to reduce the length of buss lines. RISC chips were designed to increase yield by having fewer transistors in a CPU which both reduced % failures and allowed more CPUs per wafer. Putting 3 Cell processors in the same package would increase costs because % failure would increase with complexity. Failures due to heat would also increase.
There have been papers describing the features and faults in a heterogeneous CPU design like the Cell and I believe the conclusion is the idiot- Savant SPU needs supervision and a ratio of more than 1PPU to 4 SPUs results in performance issues. Thus having more than 4 SPUs connected to a common buss with one PPU is not a good design. Just so we understand each other, the buss inside a Cell should contain a ratio of 1PPU to 4 SPUs. This internal on die buss should not have other Cell processor connected to it. Outside the cell it's connected to a common memory buss.
There is, as you mentioned, issues with multiple Cells addressing the same memory buss (or inside a die the same cell buss) and I expect a max number might be three before collisions reduce performance. The Cell between real world I/O and the memory buss would decrease memory access and increase performance.
There will be NO three 4 element SPU cells on the same die. This has been confirmed by IBM (no 32 SPU or 16 SPU chips). Separate 4 SPU cells will be positioned far enough apart to more easily allow heat dissipation but close enough that buss length is not an issue. They will have assigned functions like GPU, IO or Central Processor. I can see a die with generalized functions and one 4 element SPU usable in multiple products.
Last edited by a moderator: