Toms Hardware: GeForce 5200 is 2x2?

pcchen · Mar 7, 2003

But this definition has its problem.
For example, NV2X has four Z test units per pixel pipeline. That is, a GF3/GF4 can fetch and test 16 Z values per cycle. Does that make GF3/GF4 "16 zixels per cycle?" I don't think it fit the discussions here.

So I think it's better to use "Z/stencil writes per cycle" than "zixels."

Marc · Mar 7, 2003

You will just have to look to fillrate test result 2 second to see that 5200 is 4x1, not 2x2.

Dave H · Mar 7, 2003

Well NV25 can write 16 z-values per clock, if it's doing 4xMSAA. (Where "write" has a nonliteral definition due to z-compression.)

On the other hand, if I understand the definition of "fragment" correctly, when you're doing e.g. 4xMSAA you have 4 sub-pixels but still only one fragment. In other words, NV25 can write up to 16 z-values per clock, but they are associated with at most 4 fragments. If so, we can save my definition by adding one letter:

"the z/stencil values associated with a particular fragment"

Or is there a better way of saying it?

nutball · Mar 7, 2003

Ugh! My understanding of fragment was that if you're doing 4xMSAA you have 4 fragments and one pixel :?

demalion · Mar 7, 2003

Dave H said:
....
In any case...I know demalion will never speak to me again after this, but I formally propose "shixel" in place of his "proxel". Yes, I realize "shixel" sounds like a vaguely derogatory Yiddush term. But "proxel" sounds like a prescription medication for adult incontinence. In my opinion, of course.

<speaking to the air> "shixel" sounds like a "definitely profane English term"...if I were to rename "proxels" I'd call them something like "Ixel" (for instruction element, maybe "Iixel" to prevent confusion with a pixel typo), "Cixel", or drop the "h" in "shixel". Or maybe just add a letter in front of "pixel" (I-pixels, Z-pixels, etc).

Hmm...just realized "sixel" is a bit close to "sexel", and, my god, that term could sell a LOT of cards....

. Maybe it would be safer to keep that term away from marketroids...

Maybe Opsel or O-pixel would be safer?

Joe DeFuria · Mar 7, 2003

Marc said:
You will just have to look to fillrate test result 2 second to see that 5200 is 4x1, not 2x2.

I would agree....if you could show us the results of the fillrate tests then we cn go home.

Evildeus · Mar 7, 2003

Joe DeFuria said:
Marc said:

You will just have to look to fillrate test result 2 second to see that 5200 is 4x1, not 2x2.

Click to expand...

I would agree....if you could show us the results of the fillrate tests then we cn go home.

Well, i think Marc know what he is talking about since he is the webmaster @ http://www.hardware.fr/ (and probably has a card

)

Joe DeFuria · Mar 7, 2003

Sure....like I said...show us the results.

RussSchultz · Mar 7, 2003

demalion said:
Hmm...just realized "sixel" is a bit close to "sexel", and, my god, that term could sell a LOT of cards.... . Maybe it would be safer to keep that term away from marketroids...

Maybe Opsel or O-pixel would be safer?

I like shiznixel. It would really bring 3d technology to the urban youth.

Evildeus · Mar 7, 2003

Joe DeFuria said:
Sure....like I said...show us the results.

End of the NDA on monday, isn't it?

Nazgul · Mar 7, 2003

RussSchultz said:
I like shiznixel. It would really bring 3d technology to the urban youth.

So I expect the NV35 launch party will be headlined by Snoop Dogg?

Of course, GeForceDMX has a nice ring to it...

overclocked · Mar 7, 2003

Nv30=4*2+8z
Nv31=4*1+4z
Nv34=2*2+2z

No that canÂ´t be right or?

BoddoZerg · Mar 7, 2003

Nazgul said:
Of course, GeForceDMX has a nice ring to it...

Much better than GeForceFX at any rate.

Dave H · Mar 7, 2003

Ugh! My understanding of fragment was that if you're doing 4xMSAA you have 4 fragments and one pixel

No. I looked it up (in the OpenGL spec): four samples, one fragment. The best way to describe the difference between a fragment and a pixel is that "pixel" refers to an area of the finished frame (or to a slot in the framebuffer if you'd rather look at it that way), whereas "fragement" refers to a pixel-sized object making its way through the rasterization pipeline.

UncleSam DL iXBT · Mar 8, 2003

my investigations show that

NV31 is

4х1 in blend stages only
2х2 on all p shaders with 2 stage ALUs per pipe probably
all commands on same ALU's - so 1.1 and 1.4 and 2.0 speed equal
its NV35 approach

NV34 is true half of NV30 in terms of pixel pipes organisation

2х2 always.
1.4 and 2.0 twice slower like in NV30

Dave H · Mar 8, 2003

UncleSam DL iXBT said:
my investigations show that

NV31 is

4х1 in blend stages only
2х2 on all p shaders with 2 stage ALUs per pipe probably
all commands on same ALU's - so 1.1 and 1.4 and 2.0 speed equal
its NV35 approach

NV34 is true half of NV30 in terms of pixel pipes organisation

2х2 always.
1.4 and 2.0 twice slower like in NV30

What do you mean by "blend stages only"? Fixed-function, i.e. not shaders?

I wonder why the TMUs would be limited to 2x2 during shader programs but 4x1 otherwise. How exactly are you testing this, anyways?

Anyways, interesting stuff; and here's hoping your description of NV35 is on target.

Mulciber · Mar 8, 2003

UncleSam DL iXBT said:
my investigations show that

NV31 is

4&#1093;1 in blend stages only
2&#1093;2 on all p shaders with 2 stage ALUs per pipe probably
all commands on same ALU's - so 1.1 and 1.4 and 2.0 speed equal
its NV35 approach

NV34 is true half of NV30 in terms of pixel pipes organisation

2&#1093;2 always.
1.4 and 2.0 twice slower like in NV30

The nVidia video yesterday mentioned that the nv31 "barrowed" from the nv35, ie Intellisampe 2.0. I wonder if this is just another way of saying they finaly incorporated all commands onto the same pipe for the nv31 and nv35. My guess is that would also save them transistors of the nv30 design.

Wow, a 2x2 architecture huh? Looks like we're still going backwards. nVidia was the first to market a 4x1 design in what, 99' with the GeForce256. Here we are 3 years later going back to a 2x2 desing. Sheesh.

UncleSam DL iXBT · Mar 8, 2003

Dave H said:
What do you mean by "blend stages only"? Fixed-function, i.e. not shaders?

I wonder why the TMUs would be limited to 2x2 during shader programs but 4x1 otherwise. How exactly are you testing this, anyways?

Anyways, interesting stuff; and here's hoping your description of NV35 is on target.

yep 4х1 - it's "no pshader" cases (register combiners).

limited becouse in shader program we always deal with more than one command

so no worry about less pipes if they more powerful.

i think nv31 have 4 universal alu's at all. we can use it as 2 stage 2х2 or one stage on 4х1. for hard texturing seems they can get some optimisations from 2 stage 2 tmu scheme so they use it.

rule simply - deeper pipe - probably lower fetch latency from one command standpoint.

yep, seems that nv35 will have pretty same pipes behaviour

UncleSam DL iXBT · Mar 8, 2003

Mulciber said:
The nVidia video yesterday mentioned that the nv31 "barrowed" from the nv35, ie Intellisampe 2.0. I wonder if this is just another way of saying they finaly incorporated all commands onto the same pipe for the nv31 and nv35. My guess is that would also save them transistors of the nv30 design.

Wow, a 2x2 architecture huh? Looks like we're still going backwards. nVidia was the first to market a 4x1 design in what, 99' with the GeForce256. Here we are 3 years later going back to a 2x2 desing. Sheesh.

yep i talk about this saving a less than week ago here - if we do 2 universal FP/int stages on each pipe we probably dont save to much but dont pay to much also. and get ability to fetch textures in paralel with ps20
so they save from case where they simply add another one fp ALU to nv30 but count of copmlexity increase in this case also a bit

nv10 have 4х1 _or_ 2х2 design

same as here. no somthing new to nv in having flexible configured pipes

PS - i use RightMark 3D synthetic tools for measuring pixel and texel fillrate in different modes with different counts of textures used.

It's our own synthetic tests set that enables very focused investigation of different GPU blocks

Arun · Mar 8, 2003

I've done a little PS2 test program and sent it to someone who got a FX5600 Ultra ( not going to say the name, because I'm not 100% sure nVidia authorizes to give this type of number before the 10th... )
The program does about 20 arithmetic instructions, and 1 texture instruction.
It then draws the screen three times, without a Z buffer, and with a 128x128 texure ( screen resolution: 1024x768 )

The result he gave me was 14FPS ( using FRAPS )

So, considering a margin or error, I'd consider 700MIPS ( global instructions, that is )

And the clock rate of the FX5600 Ultra is 350

So... 700/350 = 2
2 Global Instructions/clock?

Uttar

Toms Hardware: GeForce 5200 is 2x2?

pcchen

Moderator

Marc

Dave H

nutball

demalion

Joe DeFuria

Evildeus

Joe DeFuria

RussSchultz

Professional Malcontent

Evildeus

Nazgul

overclocked

BoddoZerg

Dave H

UncleSam DL iXBT

Dave H

Mulciber

UncleSam DL iXBT

UncleSam DL iXBT

Arun

Unknown.

Similar threads