Saving transistors is not the aim, better utilization and even opening up entire new problem domains is. Needing to have coherent branching locks you in tight in what you can attempt on a GPU at the moment..I'm not sure that going fully independent will save that many transistors over what is present now.