The watch.impress segment has some interesting breakdowns of the chip.
Notably, the amount of HVT transistors drops massively from 32nm to 28nm.
The overall shift to having more nominal Vt transistors and a larger proportion being regular-length seems to match up with the general premise that Steamroller's top end is somewhat lower, so more can be put in the nominal pool than before.
However, the drop in HVT was such that I wonder if it had to do with some quirk like dropping SOI.
The leakage numbers show a generally more leakage-resistant process, except for again the fastest and leakiest transistors.
Electronics Weekly had one sentence mentioning resonant clocking, but is it any more so than the non-appearance for Piledriver?
Another blurb is the mention of the vdroop-detecting clocking scheme.
These days, these dynamic schemes echo for Intel's Foxton technology more than ever.
Intel has a vdroop-aware clock scheme for its experimental graphics core.
AMD seems to be using it to keep things functional at regular voltages, while Intel's for near-threshold.
edit:
One thing I forgot to comment on was the number of custom macros for Steamroller.
It has an order of magnitude more than AMD's Jaguar.
Part of that may go to the requirements for Steamroller's per-core performance range, as well as the historical tie that architectural line has with the old AMD fabs.
I would wonder if a Bulldozer-derived core would ever be found on a non-GF process with that level of specificity (and since Jaguar with much less hasn't been hopping fabs), and whether that could have been what scared Sony away from the rumored Steamroller PS4.