If the CPU and the GPU are going to unify, I believe a good next step would be to remove the GPU command processor and let the CPU cores directly spawn the waves/warps/etc to the array of compute units. Obviously this needs shared caches between the CPU and GPU and full coherence and efficient fine grained synchronization. Intel is almost there already with Broadwell.
Is this removing the command processor, or just supplanting its role for initiating wavefront launches?
Absent more significant reworkings of the GPU, just taking any single sub-processor out and replacing it with a host CPU may lead to there needing to be one or more sub-processors between the CPU and the internal networks and custom queues that the original processor interfaced with.
Significantly more than coherent caches would need to be done, since a good chunk of the CP's interactions with the CUs and fixed function pipeline wouldn't go through a cache, and caches don't do push updates well. Of course, all those clients could be upgraded to have their own little processors that poll for updates to the queues in memory.
Intel was (long time ago) performing vertex shaders on the CPU. The CPU would be more suited to do the command processor's tasks. This would obviously allow us to do crazy stuff that is not possible with the current designs. And would at the same time sidestep all the IO/security problems.
From a conceptual point of view, the CPU would be at least as suited.
From a practical aspect, at least right now, the existing CPUs would not given the nature of implementation. They physically do not have the same linkages, so a reworked core and likely reworked GPU architecture would have to come hand-in-hand.
Involving a host CPU is also where I noted the difference in opinion between AMD and others when it came to kernel visibility of the GPU's handling of the worklist. Involving a host CPU would very likely mean involving the kernel, which at least AMD doesn't care to do for the time being. Giving the client domain access to a host CPU without involving the system would reopen the system concerns in a big way.