Nvidia Pascal Reviews [1080XP, 1080ti, 1080, 1070ti, 1070, 1060, 1050, and 1030]

silent_guy · May 18, 2016

Isn't async compute simply the fact that a GPU can run compute shaders independently and asynchronously with graphics workloads?
If so, doing it inter-shader instead of intra-shader should be sufficient to meet that definition.
Nobody said that it has to be the most efficient or fastest implementation in existence. Similarly, nobody said that enabling async compute has to be faster than not enabling it: if a particular implementation is such that it can't find inefficiencies to exploit, then so be it.

pjbliverpool · May 18, 2016

Love_In_Rio said:
1920 shaders at 1,506 Ghz base clock and 1,683 Ghz bost clock.
So it is 1,60 times faster than a 970 more or less as the benches chart shows.
Not bad at all.

Spec for spec it's very close to a Titan-X. I understand NV are saying it will be faster, although it will likely be very marginally so. Factory O/C versions though should easily cruise past the Titan-X.

Love_In_Rio · May 18, 2016

pjbliverpool said:
Spec for spec it's very close to a Titan-X. I understand NV are saying it will be faster, although it will likely be very marginally so. Factory O/C versions though should easily cruise past the Titan-X.

For the price is really really good.

Ext3h · May 18, 2016

silent_guy said:
Isn't async compute simply the fact that a GPU can run compute shaders independently and asynchronously with graphics workloads?
If so, doing it inter-shader instead of intra-shader should be sufficient to meet that definition.

Yes, cooperative scheduling is perfectly sufficient to fulfill the specification. Maxwell did that already, respectively you can do that on any hardware.

But the problem with Maxwell was that it would essentially flush the entire graphics pipeline, all SMMs and stall the command processors, in order to reconfigure the hardware for compute. That made the switch extremely expensive, as the GPU utilization suffers while the remaining draw calls complete, and the GPC isn't allowed to dispatch anything new.

The specs said nowhere that you had to gain anything from Async Compute, but that penalty should not have happened either.

Love_In_Rio · May 18, 2016

By the way, 150 watts TDP.Thats Polaris 10 XT territory.We'll see.

silent_guy · May 18, 2016

Ext3h said:
Yes, cooperative scheduling is perfectly sufficient to fulfill the specification. Maxwell did that already, respectively you can do that on any hardware.

But the problem with Maxwell was that it would essentially flush the entire graphics pipeline, all SMMs and stall the command processors, in order to reconfigure the hardware for compute. That made the switch extremely expensive, as the GPU utilization suffers while the remaining draw calls complete, and the GPC isn't allowed to dispatch anything new.

The specs said nowhere that you had to gain anything from Async Compute, but that penalty should not have happened either.

Fine. That's Maxwell. So with Pascal, they're able to avoid this flush and reassign the SMs dynamically? That's a major improvement, right? So why the complaints? It's not perfect, it doesn't have the granularity of AMD. It's not the first time that there have been features that worked better for one vendor than the other.

CarstenS · May 18, 2016

1st card with 8gbps GDDR5 btw. And full memory subsystem confirmed. No 970-reloaded.

Love_In_Rio · May 18, 2016

CarstenS said:
1st card with 8gbps GDDR5 btw. And full memory subsystem confirmed. No 970-reloaded.

The pity is the PCB is not shorter. A card like this with the size of a Fury nano would be perfect for a mini-ITX living room system.

pjbliverpool · May 18, 2016

CarstenS said:
1st card with 8gbps GDDR5 btw. And full memory subsystem confirmed. No 970-reloaded.

So does that mean it will have 64 ROPS?

CarstenS · May 18, 2016

Yes. And all cache partitions. And all memory controllers. For all intends and purposes the full memory subsystem of the 1080 sans "X" from GDDR.

DavidGraham · May 18, 2016

silent_guy said:
. Similarly, nobody said that enabling async compute has to be faster than not enabling it: if a particular implementation is such that it can't find inefficiencies to exploit, then so be it.

It is well known AMD hardware suffered from underutilization since generations ago. Async helps them achieve better utilization. That doesnt mean NVIDIA should follow suit. There are other ways through which a certain archeticture maximizes its throughput.

Malo · May 18, 2016

Love_In_Rio said:
The pity is the PCB is not shorter. A card like this with the size of a Fury nano would be perfect for a mini-ITX living room system.

Yeah I'm waiting to see how the mid-range looks for my HTPC replacement as I want to move to dedicated discrete instead of Steam streaming from my gaming PC. Provided all the checkboxes are there for 4k, HDR etc. I'm hoping Polaris will shine for perf/w.

Ext3h · May 18, 2016

silent_guy said:
Fine. That's Maxwell. So with Pascal, they're able to avoid this flush and reassign the SMs dynamically? That's a major improvement, right? So why the complaints? It's not perfect, it doesn't have the granularity of AMD. It's not the first time that there have been features that worked better for one vendor than the other.

I'm only complaining that they are apparently not putting the hardware to FULL use yet. Now that they fixed Pascal, it's about time that they move the DX12 compute queues to the GMU as well. Till now, the hardware queues in that are still reserved for CUDA only. In hindsight it makes sense why they didn't do that for Maxwell yet, it just wouldn't have worked properly at all. But with Pascal, that limitation is gone, and there are still *actual* gains to be achieved there.

Apart from that, it is impressive that they managed to fix the fundamental problem so thoroughly this time.

lanek · May 18, 2016

Razor1 said:
http://www.geforce.com/hardware/10series/geforce-gtx-1070

spec page for 1070 is up

I like what have bring Nvidia with the 1080, but honestly, when i read something like ( up to ) 3x the performance over the previous generation in the main page. ( ofc maybe is in a really specific case of a specific VR implementation ( even if we cant really check it ).. i dont know what to tell.

Personnally i wait that they really demo a VR game who run at 300% more fps... Should be easy to calculate, the game will run faster with VR that on a 1080p standard monitor ( i joke a bit about the marketing )

Razor1 · May 18, 2016

yep gotta love marketing lol.

Voxilla · May 18, 2016

pjbliverpool said:
Spec for spec it's very close to a Titan-X. I understand NV are saying it will be faster, although it will likely be very marginally so. Factory O/C versions though should easily cruise past the Titan-X.

Some compute applications, like physics simulation (ie fluid,smoke), are memory bandwidth limited.
There the frame buffer compression does not help and the 80 GB/s deficit of the 1070 will hurt.

Take for example the Wave simulation @ http://users.skynet.be/fquake/
I'm getting 573 FPS on a Titan-X @ 1080p screen resolution.
(With memory @ 8GHz 654 FPS)

Razor1 · May 18, 2016

http://videocardz.com/60113/nvidia-geforce-gtx-1080-reviews

Full review list and video review list.

Infinisearch · May 18, 2016

CarstenS said:
1st card with 8gbps GDDR5 btw. And full memory subsystem confirmed. No 970-reloaded.

Where is this from?

CarstenS · May 18, 2016

Voxilla said:
Some compute applications, like physics simulation (ie fluid,smoke), are memory bandwidth limited.
There the frame buffer compression does not help and the 80 GB/s deficit of the 1070 will hurt.

Take for example the Wave simulation @ http://users.skynet.be/fquake/
I'm getting 573 FPS on a Titan-X @ 1080p screen resolution.
(With memory @ 8GHz 654 FPS)

Nice scaling.

1080: 646
Fury X: 808

Infinisearch said:
Where is this from?

From here:
http://www.pcgameshardware.de/Nvidi...cials/Geforce-GTX-1080-GTX-1070-KFKA-1195567/
and confirmed by Nvidia.

Malo · May 18, 2016

Infinisearch said:
Where is this from?

From the Nvidia specs for the 1070
http://www.geforce.com/hardware/10series/geforce-gtx-1070

Nvidia Pascal Reviews [1080XP, 1080ti, 1080, 1070ti, 1070, 1060, 1050, and 1030]

silent_guy

pjbliverpool

B3D Scallywag

Love_In_Rio

Ext3h

Love_In_Rio

silent_guy

CarstenS

Moderator

Love_In_Rio

pjbliverpool

B3D Scallywag

CarstenS

Moderator

DavidGraham

Malo

Yak Mechanicum

Ext3h

lanek

Razor1

Voxilla

Razor1

Infinisearch

CarstenS

Moderator

Malo

Yak Mechanicum

Similar threads