AMD is comparing V100 PCI-E (250W) vs MI60 (300W). Both cards trade blows in a number of benchmarks. Though they avoided the comparison with the 10% faster V100 NVLink (300W).
My guess is that AMD isn't using the Tensor Cores.I thought Volta could do over 1K images per second in ResNet-50.
I thought Volta could do over 1K images per second in ResNet-50.
Yes they are not.My guess is that AMD isn't using the Tensor Cores.
https://devblogs.nvidia.com/tensor-core-ai-performance-milestones/We have achieved record-setting ResNet-50 performance for a single chip and single server with these improvements.
A single V100 Tensor Core GPU achieves 1,075 images/second when training ResNet-50, a 4x performance increase compared to the previous generation Pascal GPU.
1.8ghz at 300w ? We're not that far under Vega 10, I was expecting better, even if it's not "just" a die shrink...
Seeing the predictions "north of 2GHz"... oh my, god. Vega simply has to be clocked to ~1.2GHz to be effective.
How would a 7nm Fiji do? The same I guess.
Vega MI25 has basically the same 1500MHz clock as Vega 64. So consumer Vega is the same as datacenter Vega. Makes no difference whatsoever.Predictions of north of 2GHz were always made regarding a hypothetical consumer gaming product, not with workstation/datacenter products.
Vega 64 on performance mode moves up to ~1.5GHz
Looking at current scaling, MI60 vs MI25, we get 20% more FLOPS. For basically the same TDP. If consumer Vega 20 vs Vega 10 are subjected to the same treatment, we are looking at 350w to 400w of power consumption for the hypothetical consumer Vega 20 @2.1Ghz clock.How much is 1.8GHz with 18% higher clocks? 2.124GHz
AMD denied any consumer Vega 20 to be released for gamers, this thing is strictly datacenter.And this is of course looking at a hypothetical consumer release, which may very well not happen.
https://www.overclock3d.net/news/gp..._vega_instinct_mi60_and_mi50_graphics_cards/1AMD has made it clear that their 7nm Vega graphics cards are not designed for gaming applications, instead, acting as a way for the company to enter the lucrative server/datecenter and machine learning markets.
Vega MI25 has basically the same 1500MHz clock as Vega 64. So consumer Vega is the same as datacenter Vega. Makes no difference whatsoever.
Worse yet, officially AMD mentions MI60 to have UP TO 1.8GHz clock, which means this is by no means a fixed clock. It probably goes under that.
Looking at current scaling, MI60 vs MI25, we get 20% more FLOPS. For basically the same TDP. If consumer Vega 20 vs Vega 10 are subjected to the same treatment, we are looking at 350w to 400w of power consumption for the hypothetical consumer Vega 20 @2.1Ghz clock.
AMD denied any consumer Vega 20 to be released for gamers, this thing is strictly datacenter.
https://www.overclock3d.net/news/gp..._vega_instinct_mi60_and_mi50_graphics_cards/1
I think the TDP for the whole card includes the memory and the W9100 has 2x the memory as the 290x, easily increasing the power by the 25W difference. This is also something to consider for the MI60 vs MI125. 32GB of HBM with corresponding memory interface probably uses more power than the 16GB of HBM in the MI25.Well I'm not disappointed with the clocks because Pro cards use traditionally conservative clocks, but I am with those 300W TDP.
It seems that at least in GCN a high double precision throughput causes a significant impact in power efficiency.
For example the R9 290X (1:8 FP64) has a 250W TDP with 1000MHz core clock, whereas the similar FirePo W9100 (1:2 FP64) has a 275W TDP with 930MHz core clock. Both at same memory clock speeds.
7% lower core clock for 10% higher TDP.
Vega MI25 has basically the same 1500MHz clock as Vega 64. So consumer Vega is the same as datacenter Vega. Makes no difference whatsoever.
Worse yet, officially AMD mentions MI60 to have UP TO 1.8GHz clock, which means this is by no means a fixed clock. It probably goes under that.
Looking at current scaling, MI60 vs MI25, we get 20% more FLOPS. For basically the same TDP. If consumer Vega 20 vs Vega 10 are subjected to the same treatment, we are looking at 350w to 400w of power consumption for the hypothetical consumer Vega 20 @2.1Ghz clock.
AMD denied any consumer Vega 20 to be released for gamers, this thing is strictly datacenter.
https://www.overclock3d.net/news/gp..._vega_instinct_mi60_and_mi50_graphics_cards/1
Why would there be a gaming version at all? The fp64 performance of this card is wasted there, or put in other words, it is inefficient by design with gaming workloads. Nothing AMD has said as far as I have seen has indicated that Vega20 is targeting the gaming market.I'm not sure how P6/P7 works with MI25 cards, but if the peak engine clock of MI25 is 1500Mhz, it's a bit behind the peak clock of 1630Mhz on Vega64. A little underwhelming for the new process anyway.
I think the power consumption would be even higher for Vega20@2.1Ghz, if it can even hit it in normal conditions.
The gaming version can be a 1080Ti competitor but not anything more.
To compete with the 1080Ti and 2080.Why would there be a gaming version at all
As it was for Hawaii, yet they launched 5 consumer SKUs with that chip.The fp64 performance of this card is wasted there
It is?it is inefficient by design with gaming workloads.
Nothing AMD has said as far as I have seen has indicated that Vega20 is targeting the gaming market.
True. However, this time, to my knowledge, AMD have not mentioned gaming at all, and they have said that they have Navi coming for gaming at 7nm.To compete with the 1080Ti and 2080.
As it was for Hawaii, yet they launched 5 consumer SKUs with that chip.
Volta's TensorCores dont support INT. This is a new feature for Turing. But there is still support for 4x INT8.
Volta has no Int4, only INT8. As for Int8, we're at 120 TOPs INT8 for Volta vs 60 INT8 TOPS for V20.But Volta isn't really for the Int8, Int4 workload, more for FP32 and FP16. Turing in Tesla T4 is Nvidias solution for inference (There you use INT8/INT4) with 130 INT8 TOPs and 260 INT4 TOPs at 70 W vs V20 120TOPs at 300W.
So AMDs solution for inference isn't really competitive.
AMDs strength with V20 is it's DP and FP32 performance. There they can compete pretty good with Volta and might sell some cards. Also code which needs a lot of bandwidth will run great with this 1TB/s HBM.
Edit: I think troyan is right, Int8 isn't on tensor cores in volta.
So the MI60 can be used for rendering, RT and denoising. Could AMD release gaming drivers for it?