Looks like I'm outdated, What is Vulkan ?
and by now, we should be close to Pascal announcement, no ?
It's the new version of OpenGL which is closer to the metal like DX12. Basically the DX12 version of OpenGL.
Looks like I'm outdated, What is Vulkan ?
and by now, we should be close to Pascal announcement, no ?
Looks like I'm outdated, What is Vulkan ?
Not really. While it's maintained and developed by Khronos, just like OpenGL, it was built on AMD's Mantle rather than being from scratch or based on older OpenGLIt's the new version of OpenGL which is closer to the metal like DX12. Basically the DX12 version of OpenGL.
Not really. While it's maintained and developed by Khronos, just like OpenGL, it was built on AMD's Mantle rather than being from scratch or based on older OpenGL
It is the same strategy they used for the Fermi reveal and presentation first in 2009 and for the big Kepler in 2012 -- both architectures with emphasis on HPC, while consumer features were trailing behind. And since Maxwell was clearly consumer-first architecture (with some cloud-induced applications), now it's only logical that Pascal would be propped for the next wave of HPC, that the previous generation took a rest from, mostly thanks to the extended life of the 28nm process.So far, all marketing material published by NV has revolved around the topic "neural networks" and similar scientific applications, but nothing about where to place Pascal performance wise in games.
I don't agree. At Pascal launch, I believe that FF16+ will have much better yields than 28nm at beginning. For months, Apple is already making millions of A9 on FF16+ and the process is derived from 20nm that is used for years.There is also a good chance that 16nm FF+ turned out to have worse yields and therefor higher costs than initially expected, making it quite possible that Pascal will only be rentable in the middle 4-diggit price range in the beginning.
The A9 isn't directly comparable. From what I understood, FF16+ allows to tune for two different characteristics, either transistor density and switching time or power consumption.For months, Apple is already making millions of A9 on FF16+ and the process is derived from 20nm that is used for years.
Speculation from me, no backed yet:
What's the chance that the first Pascal GPUs are going to be shipped as Quadro or Tesla, not GeForce series?
...
And I'm not just talking about HBM2 being reserved to high end models for now, I have concerns that Pascal won't be cost efficient in general for quite some time.
2:1? No way.Another problem is the DP throughput. If NV goes for Mixed Precision (Half : Single : Double => 4:2:1), i would speculate ~6000 SPs at max. IMO i would go for ~5000SPs.
Apple's volumes are ginormous. If NV were to sell like 80 million GPUs every quarter they would be shitting their pants of sheer surprise and excitement. Alas, that's not the case though.Apple is dual sourcing manufacturing for the A9. If yields would be that great they could have avoided the trouble and cost to dual source.
AnarchX (Google Translate) said:The CUDA DLL also calls equal to a few codenames:
Maybe GP100 and GP102 are both large GPUs ( ≥ 450 mm^2 or so) but one of them has fast DP and the other one has slow DP?GP100
GP102
GP104
GP106
GP107
GP10B
GV100
2:1? No way.
They will most likely go with mixed precision ALUs instead of dedicated DP and SP units this time. And for that, the rate is about 4:1 for doubled data width for multiplication. So 16:4:1, best case. I think half precision is additionally going to be implemented as VLIW4 or at least VLIW2 (to keep the architecture 32bit wide) at SP latency, double precision as multi-cycle.
Out of these, Kepler was actually the one closest to optimum. It's just that a 64bit FMAD/FMUL costs 4x as much hardware resources as the corresponding 32bit operation, you can't cheat around that. There is some additional, mostly data width independent overhead for IEEE 754 specific edge case handling.Well, Nvidia already had FP32:FP64 2:1 with Fermi. AMD had it with Hawaii. NV had 3:1 with GK100/110. NV has FP16:FP32 2:1 with GM20B (Tegra X GPU). The most interesting fact is, that Fermi didn't have dedicated FP64 units.
maybe they are prepping the code names for a titan and then a gaming version like they did with the 7x0 series.
Weird GV100 is in there, kind of a bit early for that I think.
And you didn't just called for the 295X2 as a reference for DP performance?That's a dual GPU card, and if you really want to go that way: