If you look at PcgamesHardware the rsullts are the same like Techreport. At culled Polygoneoutput there is no difference between gp102 and gp104The Tech Report conclusion is incorrect. Nvidia had a faster culling rate prior to the tiled rasterizer.
Hi, first post.Wait - is that the benchmark, where Vega FE scores 114-ish at gamersnexus? And downclocked to Fury X leel 97-ish and Fury X is at 70-ish? Don't you think something's fishy here?
Welcome!Hi, first post.
Vega FE in gaming workloads at Fury X clocks seems to perform almost exactly as the Fury X
In specviewperf, GamersNexus showed great improvements in FPS even when downlocked to match Fury X..
lol I think it might have been. I thought I checked my reference but apparently I didn't. Here's hoping for RX Vega ordering after SIGGRAPH! I wonder if that means reviewers will be getting their cards before or during SIGGRAPH?is this where they said it wouldnt be available that week? because, if so, he was talking about computex.
I suspect a global primitive distributor at the front of the pipe didn't scale. This likely fetches indices and forms primitives. Also, Nvidia has claimed 1/2 a tri per SM for some parts so apparently it can be 1/3 or 1/2.Ever since they have distributed setup, to be exact (with the "polymorph engine", starting with fermi). (The tiled rasterizer would not help in any case for that.)
FWIW gp102 is a bit of an anomaly as it shows no scaling over gp104 with the culled polygon throughput test (which I think is what techreport must have been using). Since the theoretical culled throughput is nominally simply 1/3 tri per clock per smm, suggesting it hits another limit on gp102.
I wasn't commenting on the performance results. Only the conclusion about the tiled rasterizer being relevant.If you look at PcgamesHardware the rsullts are the same like Techreport. At culled Polygoneoutput there is no difference between gp102 and gp104
Do you remember which ones had a 2-cycle per VTF? I was only aware of one VTF every 3 cycles per SM.I suspect a global primitive distributor at the front of the pipe didn't scale. This likely fetches indices and forms primitives. Also, Nvidia has claimed 1/2 a tri per SM for some parts so apparently it can be 1/3 or 1/2.
I remember Kepler being 2 cycle and first gen Maxwell (750 Ti) being 3 cycle. It seems to me that 2nd gen Maxwell went back to 2 cycle. Without locking the clocks it's tough to know the clock rate for synthetics, thus it's tough to estimate how many operations are performed per clock.Do you remember which ones had a 2-cycle per VTF? I was only aware of one VTF every 3 cycles per SM.
I always thought Fermi is 1 tri every 4, Kepler 1 tri every 2, and Maxwell/Pascal 1 tri every 3 cycles. That said, I got that from what Damien wrote, and indeed starting with 2nd gen Maxwell the chips seem to exceed that rate. Now the theoretical rate wasn't mentioned in the gtx 980 article, but there was no hint it would be different to first gen maxwell (ok the marketing actually says maxwell 1 is polymorph engine 2.0, same as kepler, which doesn't make much sense, whereas gm2xx is polymorph engine 3.0, but I wouldn't really give those marketing terms any credibility).I remember Kepler being 2 cycle and first gen Maxwell (750 Ti) being 3 cycle. It seems to me that 2nd gen Maxwell went back to 2 cycle. Without locking the clocks it's tough to know the clock rate for synthetics, thus it's tough to estimate how many operations are performed per clock.
I've never gotten round to polishing it enough for regular use. One day this year, I'm just not sure when.
I can't, chunks of the framework it uses are licensed and the license I have doesn't let me do that. I'll start a new thread for it nearer the time.Throw it up on github? Let people polish it for you
I can't, chunks of the framework it uses are licensed and the license I have doesn't let me do that. I'll start a new thread for it nearer the time.
The card throttles to both temperature and power, @1600MHz the card reaches 375w power consumption quickly and throttles down to a lower clock to reduce power, which probably means it needs even more power than that to sustain 1600MHz for longer periods. Also increasing the clocks gives some diminishing returns.Can you give some context to the video?
The card throttles to both temperature and power, @1600MHz the card reaches 375w power consumption quickly and throttles down to a lower clock to reduce power, which probably means it needs even more power than that to sustain 1600MHz for longer periods. Also increasing the clocks gives some diminishing returns.
375w at 1600 ? Holly...