If this is your first visit, be sure to check out the FAQ by clicking the link above. You may have to register before you can post: click the register link above to proceed. To start viewing messages, select the forum that you want to visit from the selection below.
![]() |
|
|
#976 |
|
Senior Member
|
Considering people from AMD working on the kernel patch were talking about 3% improvement I'd take that 40-70% with a truckload of salt.
|
|
|
|
|
|
#977 |
|
Senior Member
Join Date: Dec 2004
Location: Toulouse
Posts: 4,133
|
|
|
|
|
|
|
#978 |
|
Member
Join Date: Jan 2006
Location: France
Posts: 197
|
Phenom I were at least competitive against older AMD generation, no ?
__________________
- I'm french. Sorry if you don't understand what i say - |
|
|
|
|
|
#979 |
|
Senior Member
|
Mostly - yes, but there were quite few instances where Phenom slightly lagged behind the top A64 X2 models or even outright took the last place. Later on, the TLB bug patch sliced off a couple of percents on top of this.
__________________
Apple: China -- Brutal leadership done right.
Google: United States -- Somewhat democratic. Microsoft: Russia -- Big and bloated. Linux: EU -- Diverse and broke. |
|
|
|
|
|
#980 |
|
Entirely Suboptimal
Join Date: Mar 2003
Location: WI, USA
Posts: 6,845
|
Well I thought Phenom I was terrible too.
|
|
|
|
|
|
#981 | ||
|
Dangerously Mirthful
Join Date: Feb 2002
Location: Winfield, IN USA
Posts: 15,292
|
Quote:
Quote:
__________________
Elite Bastards - Adminish “Be polite, be professional, but have a plan to kill everybody you meet.” - General James N. Mattis |
||
|
|
|
|
|
#982 |
|
Senior Member
Join Date: Feb 2004
Posts: 2,439
|
|
|
|
|
|
|
#983 | |
|
Dangerously Mirthful
Join Date: Feb 2002
Location: Winfield, IN USA
Posts: 15,292
|
Quote:
__________________
Elite Bastards - Adminish “Be polite, be professional, but have a plan to kill everybody you meet.” - General James N. Mattis |
|
|
|
|
|
|
#984 |
|
Senior Member
Join Date: Feb 2004
Posts: 2,439
|
|
|
|
|
|
|
#985 |
|
Senior Member
|
__________________
Apple: China -- Brutal leadership done right.
Google: United States -- Somewhat democratic. Microsoft: Russia -- Big and bloated. Linux: EU -- Diverse and broke. |
|
|
|
|
|
#986 |
|
Senior Member
|
I wonder what difference this will make. Any clues?
__________________
"Well, you mentioned Disneyland, I thought of this porn site, and then bam! A blue Hulk." —The Creature My (currently dormant) blog: Teχlog |
|
|
|
|
|
#987 |
|
Member
Join Date: Jul 2010
Location: Istanbul
Posts: 727
|
at same vid, 300MHz bump
|
|
|
|
|
|
#988 |
|
Senior Member
Join Date: Oct 2002
Posts: 2,433
|
Since the revision guide only mentions B2 step (not older, not newer) and all bugs there are tagged with "No Fix planned" anyway it's hard to tell but I'd guess nothing earth-shattering. Maybe some slightly optimized design here and there to increase possible frequency at the same voltage a bit?
|
|
|
|
|
|
#989 |
|
Member
Join Date: Jan 2006
Location: France
Posts: 197
|
I tough "20-30% increase" was the way to go with each BD news ?!
__________________
- I'm french. Sorry if you don't understand what i say - |
|
|
|
|
|
#990 |
|
Senior Member
Join Date: Sep 2003
Posts: 2,076
|
Performance is terribly depressing.
Why such slow L3/northbridge??? Its big but not that big & I'd expected the 32nm to allow faster cache plus expected they'd have tweaked it for better performance with the different core architecture & all the years since they launched Phenom I. I saw reference to there being something like 900million transistors 'missing' somewhere in the uncore/northbridge. Its a huge number & they don't even have an onboard PCIE controller like Intel has. Main core clocks are far below my expectation. I can't understand the poor per-clock performance. My understanding was that Bobcat cores were performing well per-clock on a similar architecture, which should have meant good things for Bulldozer. Perhaps they could just stick 8 Bobcat cores on a die
__________________
But it's DOUBLE CONFIRMED |
|
|
|
|
|
#991 |
|
PM
Join Date: Dec 2002
Posts: 1,371
|
I wonder what 32nm versions of Deneb and Thuban would look like...
|
|
|
|
|
|
#992 |
|
Member
Join Date: Jan 2010
Posts: 416
|
They could try to make 2 module desktop chip without the L3 and giant uncore. 2 modules with 2 MB L2 cache vertically aligned would make just 2*30.9 mm˛. Thats just 62 mm˛ + IO and memory controller under the L2 cache. Improve cache (probably just leaving out the slow L3 and uncore would help a lot with latency, L1 associativity).
And with 95W TDP they could bump up base clocks to 5 GHz with that tiny die size. Which in turn would increase cache bandwith too and help a lot with single threaded performance. The fact is 30.9 mm˛ module with 2MB L2 cache looks good, while the 2 bilion transistor serverdozer not. |
|
|
|
|
|
#993 | ||
|
Senior Member
Join Date: Oct 2002
Posts: 2,433
|
Quote:
Quote:
Also, saying it doesn't even have onboard PCIE is a bit unfair. Even just one HT link will use about the same die area, and this thing has 4 of them, 3 of them unused in desktops. After all Westmere-EP doesn't have PCIE neither. (Of course there's no IGP neither, and that's a fair chunk of die size and transistors of SNB - but this die would have space for an IGP if you'd leave out the unneeded HT links and could use the unused areas - I'd bet Trinity will make far more efficient use of the available die area for desktop use.) |
||
|
|
|
|
|
#994 |
|
Senior Member
|
I think AMD uses 8T SRAM cells for all major memory arrays in BD -- they already do for Llano's L1 caches at least. Factoring in the parity/ECC bits, the L3 cache alone should be ~600M transistors and that's without considering the bunch of SRAM tags. There's hardly any transistors "missing" in there.
__________________
Apple: China -- Brutal leadership done right.
Google: United States -- Somewhat democratic. Microsoft: Russia -- Big and bloated. Linux: EU -- Diverse and broke. |
|
|
|
|
|
#995 | |
|
Member
Join Date: Jan 2010
Posts: 416
|
Quote:
For AMD same performance on a smaller are would be crucial these days. They sold much bigger chips for less than intel now for several years. |
|
|
|
|
|
|
#996 | ||
|
Senior Member
Join Date: Sep 2003
Posts: 2,076
|
Quote:
Quote:
__________________
But it's DOUBLE CONFIRMED |
||
|
|
|
|
|
#997 | |
|
Senior Member
Join Date: Sep 2003
Location: Well within 3d
Posts: 4,071
|
Quote:
Depending on when the gate count is made, the totals can be different. There was a margin of error of 165M transistors for SB, which is a chip close to 1/2 the transistor count of BD. 330M could be taken off if a proportionate mixup occurred relative to what happened with SB. Given that this is a marketing number, 100M either way could have been rounded in. There goes over half of the supposed disparity. A less optimized circuit implementation may have an even larger inflation than SB. Then we have the remainder for the expanded uncore and connectivity features.
__________________
Dreaming of a .065 micron etch-a-sketch. |
|
|
|
|
|
|
#998 | ||||
|
Senior Member
Join Date: Oct 2002
Posts: 2,433
|
Quote:
Quote:
Quote:
Quote:
Last edited by mczak; 18-Oct-2011 at 18:35. |
||||
|
|
|
|
|
#999 |
|
Mr. Upgrade
Join Date: Nov 2003
Location: Finland
Posts: 1,335
|
|
|
|
|
|
|
#1000 |
|
Senior Member
Join Date: Sep 2003
Location: Well within 3d
Posts: 4,071
|
The general themes seem consistent with an AMD that is trying to build an architecture competitive with Intel, but with more severe constraints in resources and process technology. Potentially, the company's organization and leadership are also inferior.
The passages lionizing Dirk Meyer I could do without, especially since BD is an architecture that was very much a part of his tenure at AMD. At best, the article could congratulate Dirk on owning up to a screwup he had a huge hand in bringing about instead of humiliating himself further. There's some of ranting about how Dirk's honesty about screwing the pooch caused him to be punished by the financial community, as opposed to them rewarding him for failing to compete or something. I'm not sure about Charlie's understanding of the cache hierarchy of BD. The text becomes increasingly muddled at the end, where he starts having problems distinguishing between the front end and the Icache path and the subdivided data cache path. He does not justify why spliting the L2 cache would massively reduce latency for this design. I do not think the quality of Charlie's sources at AMD has improved, or it has, and the quality of AMD is what has gone down. A lot of the article and its sequel is supposition with little in-depth analysis, and honestly I think this thread offers better insight in total, and definitely per word expended, and I do not claim that this thread has any great epiphanies in it. I wonder how much of that article is compensating for Charlie's hinting about secret improvements that would surprise all the doubters in the leadup to the release. If they surprised anyone, they did so in the wrong direction.
__________________
Dreaming of a .065 micron etch-a-sketch. |
|
|
|
![]() |
| Tags |
| amd, blewdozer, oh well, patents |
| Thread Tools | |
| Display Modes | |
|
|