Using an interface that keeps the chip from functioning as a peer in the system's memory space is a barrier to usability.
This can be due to latency and bandwidth constraints related to the physical expansion bus.
There's also the higher-level abstraction of the expansion bus interface and the OS and driver layers you have to drill through.
Something more desirable would be a communication method that operates below as many layers as possible and is capable of operating as autonomously as possible, which goes back to something like the cache-coherent interconnects already used by CPUs.
This is particularly true if on-die, since you can discard all the design choices that trade off performance and latency for a plastic slot that needs to cater to everything that can try (sometimes poorly) to plug into a controller. On-die, there's also no need to cater to every off-spec implementation, either.
For short-distance and low-latency work, physical integration is a stronger bet. Optical interconnects seem to be the next step for traversing distances larger than the silicon package. The bandwidth numbers can be much higher, although latencies will be longer than on-die. That might be a cost range above consumer tech for some time, as that seems to be finding its first use in larger scale server/HPC uses with storage, throughput, or system needs that physically cannot be satisfied by a consumer box.
This can be due to latency and bandwidth constraints related to the physical expansion bus.
There's also the higher-level abstraction of the expansion bus interface and the OS and driver layers you have to drill through.
Something more desirable would be a communication method that operates below as many layers as possible and is capable of operating as autonomously as possible, which goes back to something like the cache-coherent interconnects already used by CPUs.
This is particularly true if on-die, since you can discard all the design choices that trade off performance and latency for a plastic slot that needs to cater to everything that can try (sometimes poorly) to plug into a controller. On-die, there's also no need to cater to every off-spec implementation, either.
For short-distance and low-latency work, physical integration is a stronger bet. Optical interconnects seem to be the next step for traversing distances larger than the silicon package. The bandwidth numbers can be much higher, although latencies will be longer than on-die. That might be a cost range above consumer tech for some time, as that seems to be finding its first use in larger scale server/HPC uses with storage, throughput, or system needs that physically cannot be satisfied by a consumer box.