Doesn't the off-die bandwidth required go down as the quantity of on-die cache is increased? If we're talking about embedded platforms which have to feed just a single 1920 by 1080 sized screen then if they increased the cache on the GPU and increased the cache size available to both the GPU and CPU they could avert a decent number of bus calls and therefore save bandwidth.