My bad, I misready you saying 56 for some reason instead of 64. RX Vega 64 however does the same, beats the Polaris 10 in perf/watt, as long as you use the powersaver profile instead of default, which indicates they had to push Vega further past it's optimal range compared to Polaris (which also is past it's optimal range, which would be around 1 GHz IIRC)
I believe Vega is currently held back even more than earlier GCNs at launch simply because of the various changes in the architecture. There are several completely new features, which are said to be enabled but can't be controlled by devs or reviewers - they're just there and used when the driver thinks they should be used. That said, devs apparently can tailor their code to fit those new features better even when they have no control over it, which could bring in the improvements you're looking for.
Performance/watt is IMO one of the best performance metrics, since it enables you to compare how much each chip is doing useful work from end user perspective, per watt. You can then compare those to the theoretical numbers with ease, too.