one other question
If ontario at 800MHz and without gpu consumes 5W, how much must be downclocked to reach sub 1W? :|
Those TDP numbers don't tell much about how they will compare to Atom. E.g. I doubt the 18W part really consumes twice what the 9W part does. We'll need idle consumption, chipset wattage and real measured numbers to say anything certain.
But the numbers are in line with what was expected, right?
Clock for clock, most likely. Given what we know about Bobcat's architecture, I can't see how it could be faster than Stars, clock for clock.
After looking at the Whetstone code I found that additions and subtractions play a bigger role than multiplications and there are also several divisions and even square root, and transcendental and trigonometric functions. The add/sub instructions shouldn't have a lower throughput like DP multiplications. And I remembered another paper, covering the division algorithm. You can read it here. The described Goldschmidt division algorithm achieves a rather low latency using the rectangular multiplier.
Comparing the Bobcat core to an Atom core at 1.66 GHz (as this one), the benchmarked Bobcat cores are about 2x the integer performance and 3x the FP performance of Atom. Thanks, informal.
Those TDP numbers don't tell much about how they will compare to Atom. E.g. I doubt the 18W part really consumes twice what the 9W part does. We'll need idle consumption, chipset wattage and real measured numbers to say anything certain.
But the numbers are in line with what was expected, right?
what do we really know about Bobcats architecture to determine IPC? we know nothing about the stuff that matters, how clever the branch predictors, prefetch, it can decode and retire two instruction per clock so depending on the scheduler/alu's there is a real chance of higher IPC vs stars. But you then have to add in the overhead of using pointers and clocking up of required cache lines/predictors/etc.
reading the published slides there is alot on power reduction
http://www.anandtech.com/Gallery/Album/753#1
Also i know people are a little caught up on clock speed because you know the 3.2ghz precossots where so good compared to a 2.X conroe.
according to AMD slides bobcat has both clock gating and power gating.
then you have the salty stuff like
http://citavia.blog.de/2010/07/01/some-updates-regarding-ontario-boinc-and-bobcat-8899248/
cheers
The TDPs paint Ontario into a higher-power niche than what Atom can reach.
The lowest TDP Ontario is an 800 MHz single core with GPU off.
Atom can manage 1.6 GHz with a GPU, at a .5W penalty over the lowest Ontario.
Ontario may be good, but a 50% clock deficit is a lot, and that's with no GPU at all.
The top-end Ontario chips have a significantly higher TDP than the nearest Atoms. While it is likely that they will outperform Atom, the TDP difference is potentially more signficant for embedded platforms.
Perhaps Ontario could have been engineered to offer a higher clock rate at 5.5W, though the argument would be that its density figures would be much closer to Atom if its frequency/voltage range was that wide.