Did they show it working or was this a mockup with woodscrews?
Assuming roughly ~1 TFLOPs FP32 for each Tegra 7 (expecting twice the GPU performance of Tegra X1, following the cadence between previous iterations), that's 2 TFLOPs for both Tegras combined and 6 TFLOPs for the discrete GPUs. 3 TFLOPs per GPU.
Thinking in mobile graphics solutions because those are MXM cards, 3 TFLOPs is close to a Geforce GTX 980M, or twice that of a GTX 960M using a GM107.
I'm guessing those are two Pascal GP107 cards, if the Pascal architecture turns out more of a Maxwell 3 with FinFet for twice the transistors and execution resources, as it's been suggested.
If GM107 doubled the performance of GK107, it makes sense that GP107 makes that transition again.
On the desktop front, if they end up using say 20% higher clocks, then we are indeed looking at the compute performance of a GTX 970, though probably with significantly less fillrate resources (only 32 ROPs for a 128-bit bus).
Either way, these cards are probably far away from AMD's Polaris Mini that they showed up and running. Different performance segments at least.