Tridam's excellent gtx 750 ti review is finally ready: http://www.hardware.fr/articles/916-1/nvidia-geforce-gtx-750-ti-gtx-750-maxwell-fait-ses-debuts.html
Interestingly, the fillrate test there does not mirror the uber high bandwidth efficiency of the fillrate test of 3dmark (as seen by anandtech). (Though Tridam's conclusion there are wrong, as he wrongly assumed number of ROPs were doubled. Especially fp32 blending is definitely just very slow, completely ROP bound and not bandwidth limited.)
Thanks, I fixed that! Of course the GM107 has 16 ROP as well but benefits from the bigger rasterizer and 4-5 SMM able to deliver 16-20 4 bytes pixels per clock. And you're correct that FP32 throughput is not limited by memory bandwidth, I just double checked that.
I'm using my own test. It shows the best case when it comes to data compression opportunities.