Nvidia Pascal Announcement

so... 2.0 - 2.2 Gbps? You know right PCI-E 3.0 has almost the same bandwidth per line?


The cards can be placed in SLI, Nvidia developed a new SLI HB-bridge (high bandwidth) that offers twice the bandwidth compared to the regular SLI bridges you guys already know.
......
The GeForce GTX 1080 with 8GB GDDR5X pricing is set at 599 USD, there will be a founders (special binned and more overclockable) edition as well for 699. The card will be available starting May 27th. The GeForce GTX 1070 with 8GB of GDDR5 memory as well will be released in a founders edition. The basis model will cost 379 USD and the founders edition is going to cost 449 USD. The GeForce GTX 1070 will reach the market a bit later at June 10th, that might be due to custom AIB partner designs and releases released and showed off right after Computex.



http://www.guru3d.com/news-story/nvidia-announces-the-geforce-gtx-1070-and-1080.html
 
What are you going on about?
He mentions Pascal and Polaris.
Polaris is mentioned to be within Mainstream, Pascal within High End...
What is so complicated?
Specific models may not be mentioned, but the context was initial release because the GP106 IS mainstream and coming out in the near future (who know 2-4 months from now).
If he felt Polaris also covered high end, he would not had made the statement about Pascal being a different target audience and that of High End....
Therefore it is not difficult to see he is talking about 1070 and 1080.
Cheers

Well seeing as how he was talking about TAM and specifically mentioned $349 and up as the differentiator... he would be quite right.
He never said you couldn't compare the performance, he was just talking about target markets. Nvidia is targeting a smaller market(7.5million) with Pascal while they are specifically targeting a much larger market with Polaris.
 
well anycase looks to me nV hit 2.5 times perf/watt Pascal over Maxwell 2..... and actually might be a bit more.
How on earth do you get those numbers?
Pascal is ~20% faster than Maxwell at bit over 70 % of the power
 
Pascal from the gtx 980 to the gtx 1080, its more like 60% in increased performance with 10% increase in power consumption, which is 2.0x perf per watt.

From the 980ti its a 30% increase in performance with a 30% drop in power usage. That goes to 2.5 x the perf. watt. All of this with less ALU's.
 
Last edited:
Pascal from the gtx 980 to the gtx 1080, its more like 60% in increased performance with 10% increase in power consumption, which is 2.0x perf per watt.

From the 980ti its a 30% increase in performance with a 30% drop in power usage. That goes to 2.5 x the perf. watt. All of this with less ALU's.
I think you should check your math, seriously. Using your numbers, versus the 980 it's more like 1.5 times perf/w. Versus the 980Ti, it would be 1.85x perf/w. These are solid numbers, but were pretty much expected (due to FinFet). We'll see if actual power consumption figures will mirror those closely (though I think they should).
 
Flops aren't the only thing that come into the performance metric ;)

PS, for more increases in performance

http://deliddedtech.com/2016/05/07/hidden-information-found-in-source-code-of-nvidias-gtx-1080-page/

they measured out the graph lengths and calculated from them.

80% increased performance for ROTR with a 10% increase in power consumption. That it higher than 2.5x perf/watt

70% increased performance for Witcher 3 which will end up around 2.5x per per watt.

Nvidias own "average gaming performance" graph shows around ~20% performance advantage over the Titan x and ~60% over the gtx980, the ROTR and Witcher 3 scores are clearly outliers.
 
Flops aren't the only thing that come into the performance metric ;)

Sure, there is also for example bandwidth which is at 320 GB/s vs 336 GB/s.
Assuming there will be 64 vs 96 ROPs, I can imagine worst case UHD scenarios with 4xmulti-sampling where performance might not even reach GTX980Ti.
 
How on earth do you get those numbers?
Pascal is ~20% faster than Maxwell at bit over 70 % of the power
How are you working out those numbers?
According to their site (yeah short on specifics such as resolution,settings but still): http://www.geforce.com/hardware/10series/geforce-gtx-1080
It shows 67% performance gain in Witcher 3, 75% performance gain in Rise Tomb Raider when compared to a 980.
VR results are sillier in a good way.

I would say that is comparing reference to reference; as we know it OCs well this is a fair comparison as it has headroom for AIBs when released in OC models that will be interesting to compare to the OC 980s.
I guess you could be approximating for apples-to-apples comparison but not sure how easy/relevant that is with changes in characteristics between frabrication dies/ideal clock speeds/libraries and allowing for transisters/cores/TDP.
IMO only real apple-to-apple comparison would be matching TDP/watts and seeing the performance difference, and separately benching its OC overhead capabilities to the reference 980.
Cheers
 
Well seeing as how he was talking about TAM and specifically mentioned $349 and up as the differentiator... he would be quite right.
He never said you couldn't compare the performance, he was just talking about target markets. Nvidia is targeting a smaller market(7.5million) with Pascal while they are specifically targeting a much larger market with Polaris.
OK we will disagree,
but that is not entirely correct in context....
He said about Pascal:
it's a high-end part
Your now putting this into a context that may be beyond what the whole article emphasised, as again mentioned throughout the article with mainstream and high end.
Mainstream has greater footprint than high end, so of course NVIDIA target is smaller market.
But again, he said high end part,,,,,,
He is differentiating between a high end part and Polaris in the first set of releases from both companies (this is probably for the next 2-4 months as he deliberately ignores the Pascal GP106 product coming out).
This is digressing the NVIDIA Pascal thread so maybe take it to the Polaris one, where it is more relevant for those that want to disagree with an article all about Polaris being good mainstream performance with great efficiency.

Cheers
 
Last edited:
you're scaring me sebbbi.
GCN can run compute and graphics on the same CU (wave granularity). This for example allows you to use idle CU cycles for compute when the graphics shader is sampler bound. And use remaining CU GPRs and LDS for compute tasks when the graphics shader is simple.

However per CU / SM granularity hardware scheduling is MUCH better than static allocation. If Maxwell is indeed only using static allocation, this explains why async compute doesn't bring much gains (and even hurts sometimes).
 
sebbbi - is it just one compute shader and one graphics shader per CU, or can each wave effectively be doing its own thing?
 
Back
Top