The Blue Gene/L processor has a peak of 2.8 GFlop/s @ 700 MHz
using a 128bit double precision SIMD FPU, 4 Flop per cycle for
DP FP-MADD by the looks of it. If both cores of the chip are
used (1 is usually used for handling MPI) it peaks at 5.6 GFlop/s.
Obviously with SP it could hit 5.6 and 11.2 GFlop/s respectively,
whether such a mode is supported is not clear, though I expect
it is not.
Not bad for 700MHz.
Only the L2 and L3 caches are cohererent so some software
management of memory is required which generally the
second core is used for, managing the scratchpad and allocated
blocks of the 4MB L3 EDRAM.
Not sure about the process, at a guess I'd say 130nm, though
90nm is just about possible, though no that likely.
2 cores per chip.
2 chips per card.
16 cards per board.
32 boards per rack.
64 racks for the full system.
using a 128bit double precision SIMD FPU, 4 Flop per cycle for
DP FP-MADD by the looks of it. If both cores of the chip are
used (1 is usually used for handling MPI) it peaks at 5.6 GFlop/s.
Obviously with SP it could hit 5.6 and 11.2 GFlop/s respectively,
whether such a mode is supported is not clear, though I expect
it is not.
Not bad for 700MHz.
Only the L2 and L3 caches are cohererent so some software
management of memory is required which generally the
second core is used for, managing the scratchpad and allocated
blocks of the 4MB L3 EDRAM.
Not sure about the process, at a guess I'd say 130nm, though
90nm is just about possible, though no that likely.
2 cores per chip.
2 chips per card.
16 cards per board.
32 boards per rack.
64 racks for the full system.