Is it that simple to basically dial down the multiplier and then under-volt the chip or are the two unrelated in that you can lower the clockspeed handily via multipliers but undervolting takes more work?
In ARM cores that were designed for handhelds it should be no effort at all. There's most probably a pre-determined power-state for 1GHz operation in TX1, by design. In this case all they have to do is disable the power-saving, rush-to-sleep and other features through the OS kernel and simply fix the CPU frequency at 1GHz.
@Nebuchadnezzar usually does an excellent job at evaluating that in Anandtech articles. I don't think they did this for the TX1, but
here's an example of how much the A57 cluster consumes with 4 cores @ 1GHz, in Exynos 5433 (also 20nm):
And if you want to see how much Nintendo/nvidia will be loosing by being greedy as fuck if they haven't gone with
16FF with A72 cores:
1.83W vs. ~1W, though since the A72 cores have a substantially higher IPC than the A57, they could probably go with 900MHz and get the same performance (or clock the CPU at 1.7GHz and get the same power consumption), so 4*A72 @ 16FF would probably consume less than half at the same performance as 4*A57, or get twice the performance at same power consumption.
With A73 the difference would be even larger, if you go by ARM's promises. The A73 is supposedly a lot smaller and consumes less power than A72.
How much dye space do the A53 cores take up?
It's not linear because the cores can use either area-optimized or performance (frequency)-optimized transistors. Except for those chinese SoCs with dual 4*A53 clusters (where the higher-clocked cluster is visibly larger than the lower-clocked one), most LITTLE A53 clusters are usually designed to use low frequencies so they are area-optimized and end up being significantly smaller than the big A57/A72 clusters. In SoCs like e.g. Snapdragon 810, Exynos 5433 or TX1, my bet would be one A53 = ~1/4 A57 in area.
Easy answer would be,
if you go by this comparison, 4x A53 would shave off little more than 2mm^2, plus whatever glue logic they need:
I suspect the ISP and video codec hardware take a whole lot more die area than the A53 cluster.