Follow along with the video below to see how to install our site as a web app on your home screen.
Note: This feature may not be available in some browsers.
In five years' time we could be looking at systems that use wireless buses. Suddenly the point-to-point restrictiveness of ultra-high-speed memory buses disappears.
Another thing to consider is that x86 cores have fairly lowsy floating point performance because they're focussed on correctness, IEEE, double-precision, the works.
Surely it can't be long before Intel decides it's time to put in some single-precision pipelines and steal-back some of the glory that Cell's taken?
At the same time, ATI knows it is practically impossible to compete on Intel's home turf: double-precision. That complexity may never come to GPUs, it's utter over-kill and immensely costly in terms of transistors (well over 2x I think).
In five years' time we could be looking at systems that use wireless buses. Suddenly the point-to-point restrictiveness of ultra-high-speed memory buses disappears.
A wireless bus for stacked chips was developed by utilizing inductive coupling among them. This paper discusses inductor layout optimization and transceiver circuit design. [...]
A test chip was fabricated in 0.35-[micrometre] CMOS technology. Accuracy of the models is verified. Bit-error rate is investigated for various inductor layouts and communication distance. The maximum data rate is 1.25 Gb/s/channel. Power dissipation is 43 mW in the transmitter and 2.6 mW in the receiver at 3.3 V. If chip thickness is reduced to 30-[micrometre] in 90-nm device generation, power dissipation will be 1 mW/channel or bandwidth will be 1 Tb/s/mm2.
In five years' time we could be looking at systems that use wireless buses. Suddenly the point-to-point restrictiveness of ultra-high-speed memory buses disappears.
http://www.research.ibm.com/actc/workshop2006/perrone.pptThat supercomputer prolly derives most of its DP capability from Clearspeed chips.
http://www.beyond3d.com/forum/showthread.php?t=33410
Jawed
But Cell DP doesn't exist yet... Otherwise we'd have heard about it.
If you run the DP numbers:
Clearspeed - 32,000 (2 per board) @ 25 GFLOPs : 800 TFLOPs
Cell - 16,000 @ 31 GFLOPs : 500 TFLOPs
Opteron - 16,000 @ 12 GFLOPs: 200 TFLOPs
Total of 1.5 TFLOPs - being a bit generous there...
Jawed
(feeling embarrassed counting GFLOPs)
Clearly I aint qualified on this stuff
Analysis and Design of Inductive Coupling and Transceiver Circuit for Inductive Inter-Chip Wireless Superconnect
I was actually envisaging a broader scale wireless infrastructure - one not dependent on chip stacking (where discrete channels are almost in contact), but something more like a conventional 2D layout on a circuit board. The paper describes something that is effectively point-to-point and clearly I'm thinking of something that's not.
But do ray-tracing, game physics, game graphics or AI need DP? As far as I can tell a lot of the emphasis here is on the data-parallel part (ultra high bandwidth, SIMD/vector) rather than precision, per se.
If IBM can contemplate separate SP and DP Cell processors, why can't AMD and Intel?