I've already posted that a page ago, but thanks for the effort.version said:
I am pretty sure that's what it is - as mentioned, this is quite like the approach taken on VUs in PS2, and what you said about execution units being split across pipelines pretty much confirms it.Hmmm, on a second thought, are there different upper and lower Instructionsets
Those instructions can't really be said to be flops now can they... They sound more like ints to me, unless there's a way that I missed learning of to branch to a fractional address or somesuch of course!nAo said:The second pipeline is probably devoted to load/store, dma queues, branching, etc..
If we factor in even those operations we can inflate the 256 GFlops/s figure
I left out div or other complex fp instructions (thanks Faf!)Guden Oden said:Those instructions can't really be said to be flops now can they... They sound more like ints to me, unless there's a way that I missed learning of to branch to a fractional address or somesuch of course!
AutomatedMech said:Something is not right. Each CELL APU burns only 1 watt @ 0.9 V at 2 Ghz???? 11 watts at 5 Ghz??? If IBM had such technology, it can forget about making chips for a living, license that tech to Intel and make billions/year.
DP? Who knows..version said:where is DIVIDE ?
Likely nowhere. Probably has a 1/x estimate instead. Use Newton Raphson to reach desired precision.version said:
PiNkY said:Why is local storage divided into four banks? Can each be individualy addressed during a 128bit load/store and what does "permute" offer (beyond bit/byte permutations) for its large estate requirements...?
Fafalada said:Anyway so if the banks are for that purpose memory is single ported I assume.