ISSCC 2005

Discussion in 'Console Technology' started by ChryZ, Jan 20, 2005.

  1. DeanoC

    DeanoC Trust me, I'm a renderer person!
    Veteran Subscriber

    Joined:
    Feb 6, 2003
    Messages:
    1,469
    Likes Received:
    185
    Location:
    Viking lands
    Please try again, your completely wrong.
     
  2. Xenus

    Veteran

    Joined:
    Nov 2, 2004
    Messages:
    1,316
    Likes Received:
    6
    Location:
    Ohio
    And someone really needs to learn how to admit they were wrong.
     
  3. version

    Regular

    Joined:
    Jul 27, 2004
    Messages:
    452
    Likes Received:
    5
    2 GHZ =40 watt, synerdistc cpus run 4 GHZ :)
     
  4. AutomatedMech

    Newcomer

    Joined:
    Feb 7, 2005
    Messages:
    10
    Likes Received:
    0
    A 4 Ghz CPU does not exist, Intel CEO even bowed down before an audience to apologize for its canceleration.
     
  5. Panajev2001a

    Veteran

    Joined:
    Mar 31, 2002
    Messages:
    3,187
    Likes Received:
    8
    Correction: a 4 GHz Pentium 4 Prescott does not exist ;).
     
  6. Guden Oden

    Guden Oden Senior Member
    Legend

    Joined:
    Dec 20, 2003
    Messages:
    6,201
    Likes Received:
    91
    No it doesn't. Cell PPC is dual-issue. 970 is considerably wider (as well as MUCH heavier in logic, even with L1 caches stripped it would take up a much bigger die area than is allotted for the PPC core. Read Hannibal's article on Cell if you're confused on this issue...

    The rest of your post is just pure nonsense (and lies).
     
  7. Titanio

    Legend

    Joined:
    Dec 1, 2004
    Messages:
    5,670
    Likes Received:
    51
  8. Ty

    Ty Roberta E. Lee
    Veteran

    Joined:
    Feb 7, 2002
    Messages:
    2,448
    Likes Received:
    52
    LOL. Who is this kook? Obviously DeadMeat needs to be banned again.
     
  9. ChryZ

    Veteran

    Joined:
    Nov 10, 2002
    Messages:
    1,179
    Likes Received:
    3
    Location:
    €UROP€
    Your trolling is not welcome. Stay out of this thread, create your own topics and get those locked.
     
  10. AutomatedMech

    Newcomer

    Joined:
    Feb 7, 2005
    Messages:
    10
    Likes Received:
    0
    Troll? This is analysis of new CELL information. If predictions are coming true then I might mention it.

    Am I the only one concerned about inflated performance? As noted, SCEI is claiming they can fit 16 floats and compute them in parallel in 16 byte wide register for 256GLops (with maybe some vertex compression). KutaragiFLOPâ„¢ definition is interesting but should not be used, for the sake of fair comparison. Not as bad as nvidia FLOPS though.
     
  11. SiBoy

    Newcomer

    Joined:
    Apr 26, 2004
    Messages:
    117
    Likes Received:
    1
    The overall CELL paper this morning was a little disappointing from a disclosure standpoint. They stuck pretty much to the written paper which I already posted some info on.

    The package is a 42.5x42.5mm flip chip BGA. There is going to be a paper at ECTC later this year which is supposed to go into more of the packaging side.

    90nm SOI process with 8 layers of metal (copper interconnect).

    They mentioned 20% of the power was due to leakage and another 20% due to clock tree power, but wouldn't give the absolute numbers (which tells you something...).

    The device taped out in January 2004, about 10 months after the high-level architecture was completed.

    The EIB (bus interconnect) contains 4 128-bit rings with a 64-bit tag. No clue on the actual configuration. The EIB runs at HALF the PPE/SPE clock rate (so 2GHz-ish). The EIB can move 96 bytes total per cycle.

    Everything connected to the EIB (I listed this earlier) can each individually move 16 bytes per cycle in to/out of the EIB, except for the FlexIO interface which can move twice this much. When they say "per cycle" I'm not sure if they're referring to the EIB half-rate cycle or the PPE/SPE full-rate cycle.

    The PPE and SPE, local 256KB SRAMs, etc. are all on one clock network (same frequency, sorry AutomatedMech). The EIB is another clock network, and the external memory interface a third.

    Incidentally, the SPU/SPE paper referred to 3 other papers submitted to the IEEE Symposium on VLSI Circuits for June '05 (describing physical design details, the fixed point unit of the SPE and the floating point unit of the SPE).

    One last detail from the SPE/SPU talk I forgot to list:

    The even pipeline contains the simple fixed point and SP float instructions, shifts/rotates, integer multiply-acc, byte operations (pop count, absolute differences, byte average, byte sum).

    The odd pipeline contains the permute, load/store, channel read/write (built-in blocking message passing interface supported by 3 instructions: channel read, channel write and read channel capacity) and branch instructions.
     
  12. V3

    V3
    Veteran

    Joined:
    Feb 7, 2002
    Messages:
    3,304
    Likes Received:
    5
    Can you give us the pipeline length of those processors, if not too much trouble ?
     
  13. nAo

    nAo Nutella Nutellae
    Veteran

    Joined:
    Feb 6, 2002
    Messages:
    4,400
    Likes Received:
    440
    Location:
    San Francisco
    What about SPE instructions latency/troughput? :?:
     
  14. version

    Regular

    Joined:
    Jul 27, 2004
    Messages:
    452
    Likes Received:
    5
    'The even pipeline contains the simple fixed point and SP float instructions, shifts/rotates, integer multiply-acc, byte operations (pop count, absolute differences, byte average, byte sum).'

    hmm, iam dissapointman , dont run paralell integer and floating
     
  15. darkblu

    Veteran

    Joined:
    Feb 7, 2002
    Messages:
    2,642
    Likes Received:
    22
    very informative post, SiBoy, still: in which pipe is the fmadd/fmacc ? ..is there such a single op at all?
     
  16. tweakoz

    Newcomer

    Joined:
    Feb 8, 2005
    Messages:
    1
    Likes Received:
    0
    Location:
    san diego, ca
    didnt it ever occur to you ?

    didnt it ever occur to you that the 16 wide (across a 128bit register) SIMD could be refering to integer SIMD ?

    thats my guess.... (since ps2 had that same support)

    mtm
     
  17. version

    Regular

    Joined:
    Jul 27, 2004
    Messages:
    452
    Likes Received:
    5
    you speak from DIVIDE please:) latency ?
     
  18. Npl

    Npl
    Veteran

    Joined:
    Dec 19, 2004
    Messages:
    1,905
    Likes Received:
    7
    there are "simple fixed point operations"?
    I could only imagine that for a very limited range you would have a 32Bit Mantissa, other than that, why use it :oops:
     
  19. SiBoy

    Newcomer

    Joined:
    Apr 26, 2004
    Messages:
    117
    Likes Received:
    1
    Here are pipeline depth and instruction latency for each. I'll use the notation {E/O,A,B} where E/O indicates even or odd pipe, A is the unit pipeline depth, B the instruction latency.

    word arithmetic, logicals, count leading zero's, selects, compares {E,2,2}

    word shifts and rotates {E,3,4}

    SP floating point multiply-accumulate {E,6,6}

    integer multiply accumulate {E,7,7}

    byte pop count, absolute sum of differences, byte avg, byte sum {E,3,4}

    permute {O,3,4}

    load/store from 256KB SRAM {O,6,6}

    channel read/write {O,5,6}

    branches {O,3,4} (mispredict is 18 cycles like I said earlier)
     
  20. version

    Regular

    Joined:
    Jul 27, 2004
    Messages:
    452
    Likes Received:
    5



    6,7 ??? what a fuck this is much, ps2 VU 4 latency
     
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...