Maybe I missed it but does all data still have to go through the CPU first?
There is a latency cost that has yet to alleviated compared to the implementation on consoles. I don’t know enough to know if this matters in practice though.It goes via system RAM. But there is basically no CPU impact of that.
There is a latency cost that has yet to alleviated compared to the implementation on consoles. I don’t know enough to know if this matters in practice though.
The NVMe latency still applies to the PC as well it just has more stops before it can be decompressed.I wouldn't be so sure of that tbh. The hardware decompressor in the consoles will add it's own latency. And the latency of reading from an NVMe drive is going to be huge compared to the latency added by system memory copies which have much lower latency than NV memory of the SSD.
Certainly Intels SFS demo seems to be working pretty spectacularly so I doubt there's much reason to be concerned here.
The NVMe latency still applies to the PC as well it just has more stops before it can be decompressed.
GPU GDEFLATE:
16 MiB staging buffer: .......... 4.57729 GB/s mean cycle time: 154632000
32 MiB staging buffer: .......... 7.46996 GB/s mean cycle time: 98937830
64 MiB staging buffer: .......... 11.8437 GB/s mean cycle time: 81842042
128 MiB staging buffer: .......... 13.7098 GB/s mean cycle time: 100085301
256 MiB staging buffer: .......... 13.3529 GB/s mean cycle time: 106768209
512 MiB staging buffer: .......... 11.7419 GB/s mean cycle time: 187876636
1024 MiB staging buffer: .......... 6.61114 GB/s mean cycle time: 22161204
Maybe I'm reading your post wrong (didn't watch the video) but 100% cpu vs 90% gpu usage & less than a 0.5 sec difference in loading time sound to me like it basically makes no difference? Well, expect that a cpu costing half of what that gpu costs is barely any slower.