NVIDIA GF100 & Friends speculation

psolord · Mar 26, 2010

Guys I am not cherry picking anything.

Well OK maybe I am but I explained myself.

There is no point paying attention to results that have no meaning for each ones configuration. Someone that games at 2560X1600, please go ahead and don't pay attention to any of the other results.

As for this Unigine benchmark, I am pretty sure they are made at extreme tesselation setting. What chances are that we see games with this kind of tesselation for the next 12 months? Right. Next to none! Why should anyone base a future purchase on benchmarks that don't concern him?

That's why I prefer Tehcpowerup's and Computerbase's results, that average results per resolution.

Arty · Mar 26, 2010

trinibwoy said:
Yaaay, we're already cherry picking and reviews aren't even out yet. This weekend will be fun!

The cherry picking had already started when 5970 was hinted as unfair comparison

DegustatoR · Mar 26, 2010

psolord said:
What chances are that we see games with this kind of tesselation for the next 12 months? Right. Next to none!

Cranking up tesselation factor is like the easiest thing on the planet one can do, you now?

AlexV · Mar 26, 2010

DegustatoR said:
Cranking up tesselation factor is like the easiest thing on the planet one can do, you now?

It's probably on the same level of ease as putting your fingers into the mains...perhaps vaguely more difficult since you actually do need to start some sort of IDE, or at least notepad, do some editing, re-compiling etc., whereas there are power sockets everywhere. Both actions are also similarly useful.

GZ007 · Mar 26, 2010

DegustatoR said:
Cranking up tesselation factor is like the easiest thing on the planet one can do, you now?

Yes but at some point u cant find the difference even with a magnifying lense

. Sometimes its even hard to find the difference with on and off

. (except unigine heaven)

DegustatoR · Mar 26, 2010

AlexV said:
It's probably on the same level of ease as putting your fingers into the mains...

You should take a look at DX11 SDK demos really and stop posting crap.

Erinyes · Mar 26, 2010

Silus said:
Anyway, this s getting off-topic. If you want to discuss more of this, PM me or let's take it to another thread. Let's get back to GF100 speculation, which hopefully will end soon anyway

Never mind, you clearly have your mind made up, lets just leave it at that shall we

Silus said:
http://www.fudzilla.com/content/view/18243/1/

GTS 450 = 256 SPs, 64 TMUs, 32 ROPs, 256 bit memory interface
GTS 430 = 192 SPs, 48 TMUs, 24 ROPs, 192 bit memory interface

So according to this the specs are exactly what neliz was hinting at

Seems like its again divided into 4 GPC's with each GPC consisting of 64SP's, 16 TMU's,8 ROP's and one GPC disabled for the GTS 430?

psolord said:
Crap, I totally missed that! If the GTX 480 and the 5870 have 45W difference and since we know the 5870 idles at 27W then the GTX 480 idles at 72W?

Whoa, yea if those are the idle numbers then it is 72W, toasty! Its quite clearly a big step back from GT200 if its true. Of course the Radeon 4870 with its 90W idle power takes the crown here

Silus · Mar 26, 2010

Erinyes said:
Never mind, you clearly have your mind made up, lets just leave it at that shall we

Just like you

Erinyes said:
So according to this the specs are exactly what neliz was hinting at

Seems like its again divided into 4 GPC's with each GPC consisting of 64SP's, 16 TMU's,8 ROP's and one GPC disabled for the GTS 430?

How exactly do you figure that GF104 has the 4 GPCs ??

If you are factoring the speculation by neliz, 128 TMUs are actually the total

This is a new chip and nothing will be disabled, but rather removed. It's exactly half of GF100 in terms of GPCs and SMs and that makes it 2 full GPCs, 8 SMs in total, each attached to 8 TMUs (4 re-enabled) and it has 32 ROPs, because it's using a 256 bit memory interface (two 64 bit controllers were removed, each connected to 8 ROPs).

air_ii · Mar 26, 2010

DegustatoR said:
You should take a look at DX11 SDK demos really and stop posting crap.

I think what he means is that cranking up tesselation will give you squat in terms of quality, visual realism, etc. You need other stuff, like displacement mapping to make it matter... Hence, it's not the easiest thing on earth.

Gipsel · Mar 26, 2010

Silus said:
How exactly do you figure that GF104 has the 4 GPCs ??

How exactly does anyone figure that there are such things like GPCs at all? I think it's purely PR. Nvidia has distributed setup/raster engines and they can tack an arbitrary amount of SMs to an arbitrary number of raster engines (as proven by GTX470 with 14 SMs and GTX480 with 15SMs and both having 4 raster engines). There is probably just some kind of simple crossbar or FIFO queue between the raster engines and the SMs.

Or how does the loadbalancing between the GPCs with a different amount of SMs (GTX470: 2 GPCs with 4 SMS and 2 GPCs with 3 SMs?) actually work? Most likely the warps formed by one rasterizer unit can end up in any of the enabled SMs and not only in a SM belonging to the same GPC as the rasterizer. That means, there are no real GPCs at all besides on some slides.

Rys · Mar 26, 2010

DegustatoR said:
You should take a look at DX11 SDK demos really and stop posting crap.

That you think it's so easy to add, with little consideration to anything else that's happening on the GPU at the same time, or your art and asset pipe, is testament to how much you should do more than go by SDK samples, so you understand how this stuff works in the real world on real hardware and in real games.

So you stop posting crap.

Erinyes · Mar 26, 2010

Silus said:
How exactly do you figure that GF104 has the 4 GPCs ??

If you are factoring the speculation by neliz, 128 TMUs are actually the total

This is a new chip and nothing will be disabled, but rather removed. It's exactly half of GF100 in terms of GPCs and SMs and that makes it 2 full GPCs, 8 SMs in total, each attached to 8 TMUs (4 re-enabled) and it has 32 ROPs, because it's using a 256 bit memory interface (two 64 bit controllers were removed, each connected to 8 ROPs).

I was speculating... but since you clearly know exactly what the design of the chip is i'll take your word for it

Vincent · Mar 26, 2010

http://vga.it168.com/a2010/0326/865/000000865774.shtml

I am not impressed, if these numbers are all correct.

Silus · Mar 26, 2010

Gipsel said:
How exactly does anyone figure that there are such things like GPCs at all? I think it's purely PR. Nvidia has distributed setup/raster engines and they can tack an arbitrary amount of SMs to an arbitrary number of raster engines (as proven by GTX470 with 14 SMs and GTX480 with 15SMs and both having 4 raster engines). There is probably just some kind of simple crossbar or FIFO queue between the raster engines and the SMs.

Or how does the loadbalancing between the GPCs with a different amount of SMs (GTX470: 2 GPCs with 4 SMS and 2 GPCs with 3 SMs?) actually work? Most likely the warps formed by one rasterizer unit can end up in any of the enabled SMs and not only in a SM belonging to the same GPC as the rasterizer. That means, there are no real GPCs at all besides on some slides.

That's what the scheduler is for. It must know where to send the info.

Also, I'll take the names given by the engineers that designed the chip, to the units in it, over your attempts at classifying them. Sorry about that

mczak · Mar 26, 2010

Silus said:
http://www.fudzilla.com/content/view/18243/1/

GTS 450 = 256 SPs, 64 TMUs, 32 ROPs, 256 bit memory interface
GTS 430 = 192 SPs, 48 TMUs, 24 ROPs, 192 bit memory interface

Fairly interesting, though some things still don't quite add up to me:
256SPs likely implies 2 GPCs, hence only 16 pixels/clock rasterization rate (unless this was altered). Meaning 32 ROPs is a bit overkill.
Also twice the tmus - so maybe that was true afterall...
Also, the performance numbers don't quite add up - if the GTS 450 does "slightly" outperform 5830, then a 20% slower GTS440 won't in fact outperform a HD5770 but end up roughly equal since the difference between HD5770 and HD5830 is only ~15% on average. Meaning the GTS430 would more likely compete with the HD5750 instead.

FenderBender · Mar 26, 2010

Vincent said:
http://vga.it168.com/a2010/0326/865/000000865774.shtml

I am not impressed, if these numbers are all correct.

The CUDA speed seems very good though.. GTX480 Folding@Home 192% the speed of a GTX285, and that's with existing code and not using any new Fermi features.

No wattage measurements in the article, unfortunately.. those would be interesting.

Gipsel · Mar 26, 2010

Silus said:
Also, I'll take the names given by the engineers that designed the chip, to the units in it, over your attempts at classifying them.

Bad thing is that architectural slides are known to bend the reality a bit from time to time. Besides that, do you think engineers draw such shiny and stylish graphs?

Or to put it differently, what sense has the GPC designator at all if the schedulers have to possibility to assign the pixelshader work to a SM not belonging to the same GPC as the raster unit that have rasterized the triangles?

Andrew Lauritzen · Mar 26, 2010

MfA said:
That's not globally coherent though.

Ok, so you tell me what you think "globallycoherent" does when all of the accesses to a UAV are unordered by definition? Why does it matter if you "flush the writes across the chip" if nothing else is guaranteed to be running at the same time and your groups can run in arbitrary order?

Jawed said:
I get a sense that 10.3a are sufficiently un-stable, for instance, that a few review sites will reject them for upcoming comparisons with GF100.

I haven't experienced any stability issues myself, and they have provided some very sizable boosts to even complex DX11 algorithms. Impressive work I say!

thop · Mar 26, 2010

Vincent said:
http://vga.it168.com/a2010/0326/865/000000865774.shtml

All Physx games only benchmarked with it enabled

SimBy · Mar 26, 2010

Vincent said:
http://vga.it168.com/a2010/0326/865/000000865774.shtml

I am not impressed, if these numbers are all correct.

I'm sure they followed nV reviewer guidelines to the letter

NVIDIA GF100 & Friends speculation

psolord

Arty

KEPLER

DegustatoR

AlexV

Heteroscedasticitate

GZ007

DegustatoR

Erinyes

Silus

air_ii

Gipsel

Rys

Graphics @ AMD

Erinyes

Vincent

Silus

mczak

FenderBender

Gipsel

Andrew Lauritzen

Moderator

thop

Great Member

SimBy

Similar threads