Frank
29-Sep-2006, 22:02
If you build a GPGPU, you can do the same thing CPUs do: add more local buffers to reduce the latency by having more threads in flight (and store more states and the data requested by them), or go for more bandwidth.
Instead of more local stores and/or cache, isn't it better to add as much SRAM to the package as possible? That might offer more gain than increasing the GP bit, when you want fast results.
Instead of more local stores and/or cache, isn't it better to add as much SRAM to the package as possible? That might offer more gain than increasing the GP bit, when you want fast results.