gamervivek
Regular
Has anyone actually measured and confirmed Fury X's bandwidth as 512GB/sec? Tech report measured Fury X's random texture sampling bandwidth as 333 GB/sec, but that's a low bound, not an estimate, of the full bandwidth. But has any other benchmark tool verified the full 512, perhaps with an OpenCL bandwidth test?
You can't expect 99% sustained performance to the theoretical figures of any kind of DRAM at the time.
What I find notable is where the compressible texture's bandwidth goes above theoretical for the later Nvidia GPUs, which seems to point to a lot of bus accesses being saved due to some combination of compression and keeping the data on-die.
http://www.guru3d.com/articles_pages/asus_geforce_gtx_980_ti_strix_review,6.htmlNvidia reworked the memory subsystem quite a bit, enabling much higher memory clock frequency speeds compared to previous generation GeForce GPUs. The result is this; memory speeds up-to 7 Gbps combined with a faster 384-bit wide bus. Combined with some clever advancements in color compression Nvidia can claim even more bandwidth as Maxwell cards now use 3rd generation delta color compression (eg. 7 Gbps *1/75%) = 9.3 Gbps effective bandwidth thanks to enhanced color compression and enhanced caching techniques.
To reduce DRAM bandwidth demands, NVIDIA GPUs make use of lossless compression techniques as data is written out to memory. The bandwidth savings from this compression are realized a second time when clients such as the Texture Unit later read the data. As illustrated in the preceding figure, the compression engine has multiple layers of compression algorithms.
...
Therefore, starting in Fermi Nvidia also implemented support for a “delta color compression” mode. In this mode, they calculate the difference between each pixel in the block and its neighbor, and then try to pack these different values together using the minimum number of bits.
...
The effectiveness of delta color compression depends on the specifics of which pixel ordering is chosen for the delta color calculation. Maxwell contains the third generation of delta color compression, which improves effectiveness by offering more choices of delta calculation to the compressor. Thanks to the improvements in caching and compression in Maxwell, the GPU is able to significantly reduce the number of bytes that have to be fetched from memory per frame. Maxwell uses roughly 25% fewer bytes per frame compared to Kepler.
The OCL BW test reached 80% on a GTX 980. (178/224).You can't expect 99% sustained performance to the theoretical figures of any kind of DRAM at the time.
2x HD7970 3GB, Normally it should not use both gpu's as i have not put a monitor on my second one, but i will need retest it. ( one 7970 should got 288GB/S )
Actually, if you have two monitors then you should connect one monitor to each card.
AMD GPUs have had this ridiculous problem for ~3 years where the memory clock goes up into "3D gaming mode" if you have two or more monitors connected to the same card.
If you have only one monitor per card, you'll probably save a lot of power.
With 364GB/s, I assume you're overclocking the memory to 7.5Gbps?Bandwidth can measured in several ways. In AIDA's GPGPU test applet (OpenCL), the mem copy gives me ~280GB/s with my 780 Ti out of 364GB/s theoretical. The old GPC Benchmark suite is still good for more detailed breakdown with block-size comparison.
7.6 to be exact. I could try higher but it's pointless with this core.With 364GB/s, I assume you're overclocking the memory to 7.5Gbps?
Who is ignoring it? The technology is interesting. But if performance and perf/$ are major concerns, it's not the obvious slam-dunk the AMD claimed it would be.
Very interesting. I'll believe it when I see it, but AMD has done dumb stuff in the past.According to this rumor, AMD actually are planning a top to bottom HBM based range...
http://wccftech.com/amd-working-entire-range-hbm-gpus-follow-fiji-fury-lineup/
what is dumb about it ?Very interesting. I'll believe it when I see it, but AMD has done dumb stuff in the past.
The top to bottom phrasing is not particularly well-written, although I can see several ways that a new product range could have HBM throughout its covered range even if that range does not include every part of AMD's overall market.what is dumb about it ?
It should make the low end boards very very small with great performance that should make dell/hp/apple and the likes very happy to offer the cards. Just look how tiny the fury x is Now imagine something like the 290x on 14/16nm with hdm . That be a killer product.