To me it is "just" doubled ... I zoomed into that pic:
http://images.bit-tech.net/content_images/2012/11/amd-fx-8350-review/piledriver-b.jpg
and I see also 4 rows for Piledriver.
Yeah, I was using the big die shot of Zambezi that AnandTech has, which is rather fuzzy. SA's Vishera die shot is much more clear. Not sure what led me down the road to thinking it was 24K, but I think I'm accepting that they're 32KB each. That is some bizzare looking L1D$, if you ask me. I suppose Intel's looks even weirder, though. High associativity caches look odd...
Anyways, I guess that the page size is doubling to 8KB, and the associativity is standing pat at 4?
I did it, see the thread at S|A. The rest is boring ;-)
To me it looked like they used the high-density libraries. If you scale the official AMD demonstration pic so that the retire banks have the same size, then the width of SR's unit is identical, it is just deeper/wider, due to the 2 additional banks, probably for 2 additional threads.
Eh, I don't think I could be convinced that it's EX at this point. Well, maybe. I guess the question I have is, if this does happen to be SR, what the hell does AMD do from here? You can't really make this thing any fatter.
From the leaked information of BSN:
Branch Target Buffer and "banks" would suggest that's it's SRAM banks that are doubling, not logic (logic is different than memory, right?). That's why I was questioning your statement.
There are four more of the "dark blue-green" structures that are present on this die shot that are not present on Vishera or Zambezi. There are two extra of those structures are present on Trinity's implementation of Piledriver, though. I have no idea why. My understanding is that they're some form of SRAM banks.
Anyway, even if we compare just Zambezi/Vishera to this shot, it's a 50% increase, rather than a 100% increase. So it shouldn't be the BTB.
However...
...if we go waaaay back to
AMD's Opteron 64, we can see that the BTB and branch selectors are virtually indistinguishable from each other. It amazes me how little logic was devoted to the BPU back then, but I digress.
Given that the BTB and branch selectors look exactly like each other on AMD's Hammer core, and the "dark blue-green" banks in the SR die shot are virtually indistinguishable from each other... we must be onto something. Here's what I propose: the two leftmost banks are the branch selectors. The top four make up the BTB. Zambezi and Vishera had two of these structures, which double to four in this SR shot. From this, I think we can safely conclude that they are the BTB banks.
...and you ended up coming to the same conclusion on your own. Oh well.
Of course, now I'm confused as to why Trinity has a larger BTB than Vishera... perhaps Trinity's Piledriver cores are ever so slightly better than Vishera's, not accounting for L3? I guess the world will never know.
Interesting point. The only thing that doesn't jive with AMD's block diagram for SR is that we're not expecting a 256 bit wide FPU until Excavator. This isn't the first time AMD's said one thing and come out w/ another, recall that the specs for the 4850 were supposedly 480 shader units and they delivered 800.
I think I'm starting to get aboard the EX train now. There's some serious logical progression between what we know of SR and this core. With SR, I'm guessing the ALUs and AGUs stay at 2. L1D stands at 16KB, and this 2x256 bit FPU nonsense becomes the 2x128 bit FPU we're expecting.
In the meantime, we're getting the expected 96KB L1I$, BTB doubling, and decode doubling. I suppose that's halfway between this core and Piledriver. EX brings the extra FPU and INT hardware, and doubles the L1D. SR = front end, EX = back end.
Okay, that makes a lot of sense now. I suppose I could see that a lot of the blocks are automated, but at a high level this still looks very hand drawn. There is still a ton of wasted space, but I suppose it'd look worse if it weren't more automated.
Has anyone else noticed that AMD's clock drivers have always taken up an absurd amount of space? Those massive "walls" have got to be taking up at least 10% of the core on their own.