I could be wrong though, it's been known to happen.
No Never....
I could be wrong though, it's been known to happen.
Well, non-P4 processors typically do not have a trace cache either, whose whole purpose is to essentially take the decoder stages out of the equation, from what I've read.
Of course, that depends on the hit rate (and miss penalty) of P4's trace cache compared to a more traditional L1-I cache.
except a bulldozer module (L2 inc) is about the same size as a SB core (L3 included). You can spin it any way you want, but the simple fact is that in the 17 watt class intel has a 2c/4t and in a month or two Trinity will be out with 4c/4t and from the looks of it is seems much improved over both bulldozer and stars.But the reality is that HT costs minimal die space, while doubling core count almost doubles the die space (and rises power draw dramatically). You should be comparing 2c/2t with 2c/4t (and 4c/4t with 4c/8t) because those designs are similar in power draw, transistor count and manufacturing costs.
Comparing 2c/4t directly with 4c/4t is not a fair comparison, because the 2c/4t CPU is much cheaper to produce and consumes considerably less power (half the cores, half the execution units, etc).
except a bulldozer module (L2 inc) is about the same size as a SB core (L3 included). You can spin it any way you want, but the simple fact is that in the 17 watt class intel has a 2c/4t and in a month or two Trinity will be out with 4c/4t and from the looks of it is seems much improved over both bulldozer and stars.
Look at the 1.5Ghz data! Sebbi is on to something, I believe The "cave" scene shows that even a mostly fillrate limited scene still needs two physical cores, and so does the "City" scene (same one from my first test but now with the enhanced ugrids and shadows) although four threads is best if you're not going to overclock.
I'm one of those enthusiasts that also care about noise. Having the CPU consume a ton more power and thus needing beefier cooling would mean more noise.in many cases an enthusiast would do just as well with an AMD multicore as an Intel multicore as enthusiast graphic settings.
AMDs way of sharing resources (vector units) between two threads is also a very well designed one. Vector pipelines are longer than scalar/logic/fetch pipelines, so it's harder to keep them fed from just single thread ILP alone. Getting instructions from two threads benefits vector processing even more than generic processing. Additionally vector instructions are not used as frequently as normal instructions (usual usage pattern contains heavy bursts and lots of pipeline idling between). So sharing a vector pipeline & execution units between two threads improves the vector pipeline usage even more than HT does for generic processing.You can spin it any way you want, but the simple fact is that in the 17 watt class intel has a 2c/4t and in a month or two Trinity will be out with 4c/4t and from the looks of it is seems much improved over both bulldozer and stars.
Except with multi-GPU setups, where I've seen some curious test results with Bulldozer. I don't know if Phenom II was better but no point in multi GPU on one of those anymore anyway.That's also an excellant illustration of why AMD multi-core CPUs are just about as good as Sandy Bridge CPU's for enthusiast game settings in current games that can take advantage of multiple threads.
No, pre-Prescott Netbursts had 20 stage pipeline and Prescott and up had 31 stages.
In what workloads?
what?? C2/nehalem had around 13-14.. more or less like K8/10 design.For comparison Sandy Bridge has a 18 stage pipeline.
That's also an excellant illustration of why AMD multi-core CPUs are just about as good as Sandy Bridge CPU's for enthusiast game settings in current games that can take advantage of multiple threads.
In single threaded or lightly threaded games or games which stress the CPU more than GPU then SB can sometimes have a noticeable lead, but in many cases an enthusiast would do just as well with an AMD multicore as an Intel multicore as enthusiast graphic settings.
Yes, that's another point I forgot to mention earlier.
Moving from a single core to even a measily 2.8GHz Northwood P4 with HT was by far the biggest upgrade to system responsiveness I've seen. EVER. And that includes moving from 233MHz P1 with 16M RAM @ 40MB/s to 500MHz P3 with half a gig at some 5x+ higher throughput.
Just give me a 22nm 6.4GHz Xenon CPU
Very small, cheap and fast for entertainment applications.
And probably cool too.
For gaming and entertainment it would be fast and cheap.Even at 6.4Ghz Xenon would be slow compared to a modern mid range quad core.
For gaming and entertainment it would be fast and cheap.
Probably something around 230 GFlops peak