Twice the power of one x translates to about 10TF of rdna1.0, with rdna2.0 being even more efficient i can't see how he ment 12TF of rdna2.0. Or atleast, i doubt it. Also 13.3TF is 2080Ti level of performance, on paper atleast. I know it's the baseless section so you can throw whatever you want, but anyway.
If you follow Shifty's logic; you'll see why tunnel visioning on just 'FLOPs' is the flaw for the argument. Like comparing AMD and Nvidia flops with people saying 2 AMD flops = 1 Nvidia flop. No mention of memory, bandwidth, the test at hand, the architecture etc. It'll never fly. The most obvious way to compare like for like is to take it as spec.
if someone told you that a car a has double the power of car b. But all car a has is upgraded parts and a massive twin turbo. It's the same engine though, but you're going to measure horsepower at the flywheel for both. Regardless of how the power is generated the measurement at the end is what is going to matter.
It is a mistake to say well then 2x Scoprio means 9 TF rdna, because you're not measuring raw performance. You're measuring how well it handles different loads. That's like saying when we add a driver and a race course; I now expect the car A to only be 1.3x as fast as car B. Which might be true if the course is loaded with a bunch of curves in which the driver can't put down the pedal. But that doesn't mean the car isn't capable of delivering 2x the power at peak horsepower.
The hardware is the car.
The course and driver are the developers.
The car can't really change, but the developers can make a course and drive it in a way that would maximize the performance of the car (thus exclusives).
You can't get more FLOPs than what is calculated. Mul + Add is 2 operations. And that's it. It doesn't matter which architecture you are running, the most the GPU can do is a Mul + Add in a single clock cycle. It's just a matter of feeding the architecture to do work (bandwidth/latency on cache, reducing idle time), reducing workloads so that it does less work (compression), making design choices so that you're working smarter (optimization), leveraging new hardware features/accelerators to perform certain tasks much faster (VRS, RT, Mesh Shading, etc)