Given that Kepler GPU architecture increased the amount of SP FLOP from 64 to 96 in a single cycle, we understand the market demand for SP. While HPC customers will probably opt to wait for GK110-based Kepler boards, there is a large amount of customers that said that they don't give a "rat's behind" for DP and that they demand a Kepler part for their computational needs.
Nvidia packs two Tesla K10s on a single PCI Express Gen 3 board to deliver 4.58 teraflops of single-precision floating point performance and 320 gigabytes per second of memory bandwidth.
They support ECC memory and parallel programming models such as the message passing interface (MPI) and are cooled by server chassis subsystems.
The Tesla chips are based on the same 28 nm Kepler core Nividia announced for consumer graphics chips in March. It sports 1,536 Nvidia proprietary Cuda rendering cores and 192 control logic cores on a GHz clock. That’s up from 512 and 32 cores in the 40 nm Fermi parts that ran at 772 MHz.
4.58 TFlop/s with two GK104 equals a clock of ~745 MHz. Or if they used the salvage versions with 1344 SPs per GPU
Bandwidth should do nothing for peak flops.You are forgetting that enabling ECC reduces bandwidth and that is probably why it is 4.58.
( Autocad etc ), this could make an excellent base with lower cost/year.
But it does.Bandwidth should do nothing for peak flops.