Awesome.Since they support 48KB Shared Memory + 16KB L1 per 192 ALU SMX on Kepler, each 32 ALU SM will need to have at least 48KB Shared Memory for backwards compatibility. That's a LOT more shared memory (and associated bandwidth) than on Kepler!
I thought Kepler was a step backwards. They increased the ALU's / core but didn't increase the SM/L1 at all. All in all, it's great that GPU's are finally getting more cache.
KNL is supposed to be brutal with it's caching, see RWT.