Paulakers2005
Newcomer
1. Nvidia claims that there is no direct mapping between video memory and main memory. So when you have the following simple block digrams:
--> CPU --> L1 --> L2 --> MM(Main memory) --> VirtualM(I/O) for the CPU datapath
--> GPU --> SPs w/register files/load store --> video memory --> PCIe.
Two questions
(a) it's a consensus that the texture is directly loaded from the main memory for all previous generations of GPU. So if that's the case, for G80, is it still the same? We suppose that if the program is GPU-oriented( e.g. CUDA), then the texture can be moved from I/O to the video memory for faster executions later. But if that's the case, does G80 try to take the data not via CPU but through the PCIe, then the I/O via the south bridge of all the chipsets? We think this question is very important because even though the texture size is smaller than 768MB(8800GTX) or larger than 768MB(which will generate the issue of VM), this could be one issue which leads to the difference of performance.
(b) If there's no direct mapping between video memory and the main memory, what's the cost of interaction?
2. Related to Q.4 as well. Now we have AMD and Intel platforms(haven't done a huge research on the datapath on chipsets yet, but we will do it in the near future.). We all know that memory controllers and PCIe go through the northbridge of the Intel Platforms(nForce4, 570i, 590i, 680i, and perhaps P35), but the memory controllers are integrated in AMD processors, shown in
http://www.xbitlabs.com/articles/cpu/display/amd-quad-fx_4.html#sect0
http://www.xbitlabs.com/articles/cpu/display/athlon64-e3-mem_2.html
And the below are for Intel chipsets:
http://www.pcper.com/article.php?type=expert&aid=320
http://www.extremetech.com/article2/0,1697,2134184,00.asp
How could we end up yielding the similar bandwidth and the performance between AMD and Intel chipsets.
I will really appreciate if we could hear from you soon.
Best,
Paul
--> CPU --> L1 --> L2 --> MM(Main memory) --> VirtualM(I/O) for the CPU datapath
--> GPU --> SPs w/register files/load store --> video memory --> PCIe.
Two questions
(a) it's a consensus that the texture is directly loaded from the main memory for all previous generations of GPU. So if that's the case, for G80, is it still the same? We suppose that if the program is GPU-oriented( e.g. CUDA), then the texture can be moved from I/O to the video memory for faster executions later. But if that's the case, does G80 try to take the data not via CPU but through the PCIe, then the I/O via the south bridge of all the chipsets? We think this question is very important because even though the texture size is smaller than 768MB(8800GTX) or larger than 768MB(which will generate the issue of VM), this could be one issue which leads to the difference of performance.
(b) If there's no direct mapping between video memory and the main memory, what's the cost of interaction?
2. Related to Q.4 as well. Now we have AMD and Intel platforms(haven't done a huge research on the datapath on chipsets yet, but we will do it in the near future.). We all know that memory controllers and PCIe go through the northbridge of the Intel Platforms(nForce4, 570i, 590i, 680i, and perhaps P35), but the memory controllers are integrated in AMD processors, shown in
http://www.xbitlabs.com/articles/cpu/display/amd-quad-fx_4.html#sect0
http://www.xbitlabs.com/articles/cpu/display/athlon64-e3-mem_2.html
And the below are for Intel chipsets:
http://www.pcper.com/article.php?type=expert&aid=320
http://www.extremetech.com/article2/0,1697,2134184,00.asp
How could we end up yielding the similar bandwidth and the performance between AMD and Intel chipsets.
I will really appreciate if we could hear from you soon.
Best,
Paul