Let's skip the foreplay and head for the main course right away.
1e Question: Almost all chips are synchronous designs and my question would simply be why? Clockless circuits are faster, require far less power without the need for additional power management circuitry, clock gating and other power saving techniques and also producing a lower electromagnetic signature. No more problems with clock skews, clock distribution and signal skews. Basicly lowering design complexity and freeing up some space on the die while making the chip more power efficient and faster. Why wouldn't chip makers in such a competitive market jump over to Asynchronous Circuits?
2e Question: Just how much larger are MIMD stream processors compared to SIMD stream processors (roughly)? And in which types of workloads do they (MIMD SPs) excell? Graphics? Physics? Scientific Computation? Heterogeneous computation?
3e Question: I've been looking at CPU die shots lately realizing just how small the Execution Units are compared to the entire core. AMD's Bulldozer basicly got me wondering wether it would actually be possible to share more circuitry, more specifically parts of the uncore?
That's all, for now. Every bit of explanaition is much appreciated. :smile:
1e Question: Almost all chips are synchronous designs and my question would simply be why? Clockless circuits are faster, require far less power without the need for additional power management circuitry, clock gating and other power saving techniques and also producing a lower electromagnetic signature. No more problems with clock skews, clock distribution and signal skews. Basicly lowering design complexity and freeing up some space on the die while making the chip more power efficient and faster. Why wouldn't chip makers in such a competitive market jump over to Asynchronous Circuits?
2e Question: Just how much larger are MIMD stream processors compared to SIMD stream processors (roughly)? And in which types of workloads do they (MIMD SPs) excell? Graphics? Physics? Scientific Computation? Heterogeneous computation?
3e Question: I've been looking at CPU die shots lately realizing just how small the Execution Units are compared to the entire core. AMD's Bulldozer basicly got me wondering wether it would actually be possible to share more circuitry, more specifically parts of the uncore?
That's all, for now. Every bit of explanaition is much appreciated. :smile: