If this is your first visit, be sure to check out the FAQ by clicking the link above. You may have to register before you can post: click the register link above to proceed. To start viewing messages, select the forum that you want to visit from the selection below.
![]() |
|
|
#1051 | |
|
Senior Member
Join Date: Feb 2002
Posts: 2,569
|
Quote:
Miss own L1 cache Miss own L2 cache Broadcast read-request through uncore. Tags in other modules are checked. Hit in foreign L1 Wait for foreign L1 store queue to drain Read data from foreign L2. Cheers
__________________
I'm pink, therefore I'm spam |
|
|
|
|
|
|
#1052 |
|
Senior Member
Join Date: Sep 2003
Location: Well within 3d
Posts: 4,134
|
The 30ns time period for the in-module transfer may be due to the draining of queues and traffic to the WCC, L2 and L1.
The weird part is when it goes across the interconnect, where the latency shoots up for all AMD chips. Is it querying the memory controller, is it the SRQ? Are there outputs for dual/quad/hex AMD cores for comparison?
__________________
Dreaming of a .065 micron etch-a-sketch. |
|
|
|
|
|
#1053 | |
|
Senior Member
|
Quote:
CPU0<->CPU1: 120.9nS per ping-pong Not sure what specific model or at what speed it was. |
|
|
|
|
|
|
#1054 |
|
Senior Member
Join Date: Sep 2003
Location: Well within 3d
Posts: 4,134
|
That big chunk of time spent in the uncore puzzles me.
Perhaps it is waiting on the SRQ to process through or buffers in the IMC to empty. I wonder what Intel is doing differently, since it has had an integrated north bridge since Nehalem.
__________________
Dreaming of a .065 micron etch-a-sketch. |
|
|
|
|
|
#1055 |
|
Member
Join Date: May 2007
Posts: 131
|
My PII 940 @ 3000Mhz
Code:
CPU0<->CPU1: 113.6nS per ping-pong CPU0<->CPU2: 113.5nS per ping-pong CPU0<->CPU3: 113.4nS per ping-pong CPU1<->CPU2: 114.2nS per ping-pong CPU1<->CPU3: 112.8nS per ping-pong CPU2<->CPU3: 113.0nS per ping-pong |
|
|
|
|
|
#1056 | ||
|
Senior Member
|
Quote:
Quote:
__________________
Apple: China -- Brutal leadership done right.
Google: United States -- Somewhat democratic. Microsoft: Russia -- Big and bloated. Linux: EU -- Diverse and broke. |
||
|
|
|
|
|
#1057 | ||
|
Member
|
Quote:
PCI-E isn't a part of it untill Lynnfield released Quote:
__________________
Well I'm not a native English speaker so there might be misuse through my words. I just hope it won't cause too much misunderstanding. |
||
|
|
|
|
|
#1058 |
|
Senior Member
Join Date: Dec 2006
Posts: 2,290
|
I just traded my 5Ghz Phenom 2 x6 for Sandy Bridge, If I would of checked this a few days ago I could of tested the latency all varying clocks.
Guys when testing the latency do one at default and then run it again with an overclock on the HT and Northbridge.
__________________
(\__/) (='.'=) This is Bunny. Put Bunny into your sig to help him take over the world. (")_(") |
|
|
|
|
|
#1059 |
|
Senior Member
|
More results, this time from stock FX8120:
|
|
|
|
|
|
#1060 |
|
Member
|
Besides the high latency we can guess, One thing is interesting
The number of the eight cores in Bulldozer are [01][27][34][56] resptectively Where the "[]" represent the module That's so wired..
__________________
Well I'm not a native English speaker so there might be misuse through my words. I just hope it won't cause too much misunderstanding. |
|
|
|
|
|
#1061 | |
|
Senior Member
Join Date: Feb 2002
Posts: 2,569
|
Quote:
Cheers
__________________
I'm pink, therefore I'm spam |
|
|
|
|
|
|
#1062 |
|
Senior Member
|
Probably the low idle clocks and TurboCORE are dragging the timings long here. The test is loading each pair of cores once at a time until all non-repetitive permutations are exhausted.
__________________
Apple: China -- Brutal leadership done right.
Google: United States -- Somewhat democratic. Microsoft: Russia -- Big and bloated. Linux: EU -- Diverse and broke. |
|
|
|
|
|
#1063 |
|
Senior Member
|
Shouldn't the cores pick up speed quite fast once they get any load? The test loads each core for several seconds, isn't that enough time?
|
|
|
|
|
|
#1064 |
|
Member
|
That's sounds reasonable...This software was written at a time where there's no turbo and cnq as well as EIST is just for notebook computer.. But it's still too high compared to others. My friends's X5570 with EIST on just reach about 70ns
__________________
Well I'm not a native English speaker so there might be misuse through my words. I just hope it won't cause too much misunderstanding. |
|
|
|
|
|
#1065 |
|
Senior Member
Join Date: Sep 2003
Posts: 2,076
|
PII x6 1055T @ 3.7 & 2.4NB, turbo off
CPU0<->CPU1: 94.4nS per ping-pong CPU0<->CPU2: 91.6nS per ping-pong CPU0<->CPU3: 91.6nS per ping-pong CPU0<->CPU4: 93.6nS per ping-pong CPU0<->CPU5: 93.3nS per ping-pong CPU1<->CPU2: 93.8nS per ping-pong CPU1<->CPU3: 93.3nS per ping-pong CPU1<->CPU4: 95.2nS per ping-pong CPU1<->CPU5: 96.7nS per ping-pong CPU2<->CPU3: 91.2nS per ping-pong CPU2<->CPU4: 91.8nS per ping-pong CPU2<->CPU5: 92.3nS per ping-pong CPU3<->CPU4: 92.3nS per ping-pong CPU3<->CPU5: 95.0nS per ping-pong CPU4<->CPU5: 95.4nS per ping-pong
__________________
But it's DOUBLE CONFIRMED |
|
|
|
|
|
#1066 | |
|
Senior Member
Join Date: Sep 2003
Location: Well within 3d
Posts: 4,134
|
Quote:
The northbridge has been a shadow of its former self since.
__________________
Dreaming of a .065 micron etch-a-sketch. |
|
|
|
|
|
|
#1067 | |
|
Member
|
Quote:
What's the core arbitration logic?
__________________
Well I'm not a native English speaker so there might be misuse through my words. I just hope it won't cause too much misunderstanding. |
|
|
|
|
|
|
#1068 |
|
Member
|
Maybe we can turn cnq & tubro down and see what's gonna happen.
__________________
Well I'm not a native English speaker so there might be misuse through my words. I just hope it won't cause too much misunderstanding. |
|
|
|
|
|
#1069 |
|
Senior Member
|
It's official now: AMD Revises Bulldozer Transistor Count: 1.2B, not 2B
__________________
Apple: China -- Brutal leadership done right.
Google: United States -- Somewhat democratic. Microsoft: Russia -- Big and bloated. Linux: EU -- Diverse and broke. |
|
|
|
|
|
#1070 |
|
Senior Member
Join Date: Jul 2008
Posts: 2,160
|
How is it even possible that they initially "mistook" the number of transistors by that much?
Could this have been a reason for some layoffs in the marketing department? |
|
|
|
|
|
#1071 |
|
Senior Member
|
AFAIK, exact count of the planar elements for any IC comes from the manufacturing foundry first. But probably some miscommunication within AMD departments could carry the blame.
__________________
Apple: China -- Brutal leadership done right.
Google: United States -- Somewhat democratic. Microsoft: Russia -- Big and bloated. Linux: EU -- Diverse and broke. |
|
|
|
|
|
#1072 |
|
Senior Member
|
I'd love to know how did AMD's transistor density grow going from 65 to 32nm if those numbers are correct
|
|
|
|
|
|
#1073 |
|
Senior Member
|
Llano's density is heavily skewed due to the presence of a highly compact structure like the IGP part, that takes a hefty chunk of the transistor budget.
__________________
Apple: China -- Brutal leadership done right.
Google: United States -- Somewhat democratic. Microsoft: Russia -- Big and bloated. Linux: EU -- Diverse and broke. |
|
|
|
|
|
#1074 |
|
Senior Member
Join Date: Sep 2003
Location: Well within 3d
Posts: 4,134
|
AMD is not maintaining a consistent count still. The number of transistors per module it disclosed earlier is 213M, so that x4 plus 400M in the L3 is enough to hit 1.2B, so something still seems off.
Going by 1.2B, the density scaling is notably inferior to Intel, probably due to that bloated uncore. The Anandtech count for SB may not be comparable to AMD's wonky count. They are using the schematic count of 995M, while physically it has 1.16B.
__________________
Dreaming of a .065 micron etch-a-sketch. |
|
|
|
|
|
#1075 |
|
Member
Join Date: Nov 2006
Location: Somewhere over the ocean
Posts: 634
|
Bulldozer!
Now with 40% less transistor! |
|
|
|
![]() |
| Tags |
| amd, blewdozer, oh well, patents |
| Thread Tools | |
| Display Modes | |
|
|