Does anyone know why Cell is limited in reading from GDDR to 16MB/s? Is there some advantage in having a one way data path? Is it because the data transfer rates from Cell is throttled in order to ensure that RSX memory access is not interfered with, and because read from GDDR (16MB/s) is rarely required whereas write to GDDR (4GB/s) is much more useful?
CPU's reading from VRAM isn't something you generally do, consider it a small miracle that you can do it all. Try doing it on your PC sometime and see what happens.