Thanks! It looks like they got it out a day early, ZOMG!
I don't think this is their entire presentation @HotChips.
Thanks! It looks like they got it out a day early, ZOMG!
Instruction-set extensions ... does it what I think? Real Bulldozer (or 2i:1f shared lego) instructions? Bypass this or that at front or back, exclusive FP mode for a thread, FP-free thread declaration, 128bit/256bit FP-stack distinction, etc.?
Giving up on BD by THAT margin already? Or is it just a typo?If 2 module (4-core) Bulldozer CPUs go up against dual-core Sandy Bridge things could get very interesting.
Giving up on BD by THAT margin already? Or is it just a typo?
I think AMD has just pulled another socket 939 fiasco. AM3 motherboards won't accept Bulldozer.
Oh buggerAM3 motherboards won't accept Bulldozer.
I'd think Bulldozer would be hard-pressed to make up for a full node disadvantage (edit: as in that is what is likely to be its competition for most of its volume release). I don't think it is revolutionary enough to compensate for being late to the party, and that is with the hope that process issues won't constrain it. The smaller die size may be necessary just to keep yield acceptable.
The third ALU/AGU in K8 an K10 was actually mostly redundant, from the point of utilization. AMD left it in the last iteration of its architecture to keep the dispatch symmetry and avoid major and costly layout redesign.The integer side is narrower than Phenom, with a 2 INT and 2 AGU setup. I had thought it could have been something a little more than that, such as having a 4-way INT setup with two of the lanes capable of AGU and some simple operations. It does not seem to be that way, going with what has been released so far.
It's not necessarily a killer because with so many reg/mem ops out there the expected throughput would be closer to the maximum number of sustainable loads, not the number of integer units stuck doing nothing. I would expect a per-clock deficit in codes that do manage to find a way to use 3 or more integer units.
The third ALU/AGU in K8 an K10 was actually mostly redundant, from the point of utilization. AMD left it in the last iteration of its architecture to keep the dispatch symmetry and avoid major and costly layout redesign.
Valencia is a 4 module chip.As for core counts, here is what we have committed to at this point:
* “Interlagos” – 16-core server processor
* “Valencia” – 8-core server processor
* “Zambezi” – 8-core client processor
A hint that they have made sure Windows scheduler will play nicely with Modules? Or more mundane like Anand says power gating etc is Module level.Bulldozer 20 questions said:Modules do impact the way that certain CPU features are addressed
I'll take that as confirmation of my premise that INT:FP instructions is typically at least 2:1Bulldozer 20 questions said:Today most workloads are integer with a much smaller portion being floating point.