Despite the blur, it looks like some things can be teased out.
I think the CUs themselves appear to have a similar arrangement of blocks to Polaris and others (traversing from the mid-line of the array): L1/MEM, filtering?, SIMD0/1, LDS, scalar, shared front end,SIMD(2/3)
Vega might be slightly different in the L1/MEM and filtering section. Polaris has two regions that add up to roughly twice the length of a SIMD block that appear to match the L1 and filtering sections in the marketing diagram.
Vega seems to have three somewhat distinct areas, although their total length appears to be similar. There is more distinct structure across the blocks along the mid-line. Perhaps the L1 and interconnect portion have been more heavily segregated, leaving a somewhat blank-looking block to the side that might be related to the L/S or address units that might have been more intermingled in the past.
That L1 area in the wafer shot appears like it reflects light a little differently, which is perhaps a photographic artifact or an indication that it is implemented differently somehow. It would be in an area where the CU arrays and their caches would be plugging into an interconnect.
The diagrams AMD has for Vega's NCU and its earlier artistic rendering seem to do reality less justice than usual. The blue and pink marketing diagram for the RX 480 is actually decently reflective of the arrangement, and even the artistic rendition of the Polaris die shot can be somewhat construed to match reality.
The center strip of geometry hardware and command processors has some areas that appear to be bulked up in terms of SRAM, one area that abuts what I think is the ACE and command processor section might have twice as many blocks as well.
The command processor section appears to be more evenly laid out than before. Polaris and Fiji have what appear to be 8 blocks in that region, although in terms of size, position, and internal structures there is significant variation within them and between them. Some blocks get a lot of SRAM, or have very different arrangements and sizes.
Vega seems to have a few consistent themes at blurry high level. A pair of blocks, one square with SRAM along much of its perimeter and sitting on the narrow side of a rectangular section. Then two larger square blocks with somewhat similar L-shaped SRAM arrays of apparently decent size. Then two pairs of blocks that look similar to the square+rectangle mentioned first.
Overall, some very rough pixel counting makes me think Vega's central strip is tangibly wider relative to the width of an adjacent CU than the one in Polaris.
Going by some of AMD's slides about working through wire delays, this may line up with the slide they had about consulting with Zen's designers on the register file (may have some significance as to its flexibility if it specializes for GF like Zen's customizations likely did) and the layout changes for the L1 section. Those memory sections and the capping of sharing the front end to 3 NCUs max may hint at where delays were most notable in prior GPUs.
PS: comparison shots are from the flickr account of Fritzchens Fritz, source of the first known clear Polaris die shot and now die shots including a wide range of chips like Fiji, various x86 chips, classic graphics chips, and recently GP104 and GP106.
https://www.flickr.com/photos/130561288@N04/with/35468030783/