nitpicking mode: it's not a cache, it's a local sram. SPUs have a L1 cache too.Guden Oden said:why would Cell in your opinion use a cache that runs at 4x clock of the ALU fed from the cache? You never answered this.
nitpicking mode: it's not a cache, it's a local sram. SPUs have a L1 cache too.Guden Oden said:why would Cell in your opinion use a cache that runs at 4x clock of the ALU fed from the cache? You never answered this.
one said:Brimstone said:but I'm still very skeptical.
ISSCC is the most authoritative conference in the semiconductor academic society and a paper without an actually running sample is not accepted. Don't confuse it with self-proclaimed press releases.
The most common reason for paper rejection is a lack of clear circuit evidence of what is novel in the work and the extent to
which it advances the state of the art. Successful submissions contain specific new results, sufficient detail and data to be
understood, and schematics and measured results for key circuits when appropriate.
aaronspink said:one said:Brimstone said:but I'm still very skeptical.
ISSCC is the most authoritative conference in the semiconductor academic society and a paper without an actually running sample is not accepted. Don't confuse it with self-proclaimed press releases.
Just FYI, this isn't true.
Guden Oden said:Brimstone said:I ask you once more: why would Cell in your opinion use a cache that runs at 4x clock of the ALU fed from the cache? You never answered this.
Because a Stream Processor is constantly prefetching data unlike a regular CPU cache. Also every port on SRAM has the possibliity of being accessed simultaneously on a stream processor.
"IBM, SONY AND SCEI POWER-ON CELL PROCESSOR-BASED WORKSTATION"Jaws said:Jov said:Cryect said:I'm curious where did this 15 TFLOPs workstation come from?
And why would Sony be sending 15 TFLOP workstations for a console thats only 1/15th that at most? (heh I sure hope its not so we can have really nice CGI cutscenes)
The Register link Jaw's points to sounds definately more believable.
I have to say that rack looks nice and maybe the multi-threading is their idea of hyperthreading? Just throwing out a wild guess with that one.
At 90nm, was it 16 TFlops for a rack mountable machine and 2 TFlops for the Workstation?
STI press release, 16TFlops "will reach", implies 2nd gen. The Register mention "prototype" is 2TFlops, implies, they're the ones going to devs as 1st gen.
We are not talking about a cache, but a local ram. SPUs already have a prefetch cacheBrimstone said:Because a Stream Processor is constantly prefetching data unlike a regular CPU cache
who are the actors that needs to simultaneously make accesses to APU local sram?Also every port on SRAM has the possibliity of being accessed simultaneously on a stream processor.
bad braincell!! try to think before postin!!
PC-Engine said:bad braincell!! try to think before postin!!
Please be more careful next time, since you only have one CELL left.
Actually I'm only using one.Please be more careful next time, since you only have one CELL left.
"IBM, SONY AND SCEI POWER-ON CELL PROCESSOR-BASED WORKSTATION"
BM, Sony Corporation (Sony) and Sony
Computer Entertainment Inc. (SCEI) announced today that they have powered-on the first Cell* processor-based workstation.
Vince said:Pana said:The SPU (in CELL related IBM literature and patent portfolio) would be the Synergistic Processing Unit, another name for the APU we have seen in the CELL patents which is as patents say a versatile "stream processor with 4-way SIMD engine"
I've heard that an Synergistic PU is an APU 'core' with a local flow-controller and some sort of cache. Something alone the lines of what I stated here. We'll see if it turns out to be true?
10. A multiprocessor computer system comprising: one or more processors, each processor having a local store; one or more memory flow controllers (MFCs) each included in each processor, a first MFC having a load access pattern leading to a prediction of at least one potential load of data; a system memory; and a cache coupled between at least one processor and the system memory, wherein, in response to the prediction, the data is prefetched from the system memory to the cache before the first DMAC requests the data.
11. The multiprocessor computer system of claim 10, wherein at least one of the processors is a synergistic processor complex (SPC).
12. The multiprocessor computer system of claim 11, wherein the synergistic processor complex (SPC) includes a synergistic processor unit (SPU).