AutomatedMech said:CELL uses 970 core as its CPU.
Please try again, your completely wrong.
AutomatedMech said:CELL uses 970 core as its CPU.
AutomatedMech said:
IBM's PowerPC 970FX's voltage/clock scaling chart. PowerPC970FX is fabbed on same fab as CELL and CELL uses 970 core as its CPU. This chart is IBM's admission that PowerPC 970FX burns 100 watts at 2.5 Ghz.
I hit the bull's eye on CELL's clockspeed and FLOPS rating, 64 GFLOPS @ 1 Ghz. Too be honest, even I was shocked I got it years before.
Right now, CELL was designed for 1 Ghz operation and tested successfully upto 1.4 Ghz according to SCEI's released material. The final clockspeed depends on how much loss per hardware Kutaragi Ken is willing to take on the hardware.
AutomatedMech said:A 4 Ghz CPU does not exist, Intel CEO even bowed down before an audience to apologize for its canceleration.
AutomatedMech said:and CELL uses 970 core as its CPU.
AutomatedMech said:I hit the bull's eye on CELL's clockspeed and FLOPS rating, 64 GFLOPS @ 1 Ghz. Too be honest, even I was shocked I got it years before.
Your trolling is not welcome. Stay out of this thread, create your own topics and get those locked.AutomatedMech said:A 4 Ghz CPU does not exist, Intel CEO even bowed down before an audience to apologize for its canceleration.
AutomatedMech said:Troll? This is analysis of new CELL information. If predictions are coming true then I might mention it.
Am I the only one concerned about inflated performance? As noted, SCEI is claiming they can fit 16 floats and compute them in parallel in 16 byte wide register for 256GLops (with maybe some vertex compression). KutaragiFLOP™ definition is interesting but should not be used, for the sake of fair comparison. Not as bad as nvidia FLOPS though.
nAo said:What about SPE instructions latency/troughput?
SiBoy said:nAo said:What about SPE instructions latency/troughput?
Here are pipeline depth and instruction latency for each. I'll use the notation {E/O,A,B} where E/O indicates even or odd pipe, A is the unit pipeline depth, B the instruction latency.
word arithmetic, logicals, count leading zero's, selects, compares {E,2,2}
word shifts and rotates {E,3,4}
SP floating point multiply-accumulate {E,6,6}
integer multiply accumulate {E,7,7}
byte pop count, absolute sum of differences, byte avg, byte sum {E,3,4}
permute {O,3,4}
load/store from 256KB SRAM {O,6,6}
channel read/write {O,5,6}
branches {O,3,4} (mispredict is 18 cycles like I said earlier)