Vince said:demalion said:How the heck is mentioning Hellbinder's "4x4" comment anything remotely resembling "anti-nvidia bias"? It seems likely now that Hellbinder's description may have been based on the description Randell was replying to...was he asking anything more than that?
It's the mentality.
That one line question is an exhibit of a "mentality"? What does your response exhibit? Should we stop here and ponder the "mentalities" exhibited by the question and your response to it? I did ask you if there was other text you had in mind, so I presume this means "yes"?
Err..what? Where did that come from? The observed information has the GF FX exhibiting performance equivalent to a 4 pipe architecture in many cases. Why is he a "moron" for asking the question you quoted? Or, is there some other piece of text you forgot to quote that is the reason for your description?
Are you missing this whole ideal? Instead of a 4 pipe architecture, the architecture is instead composed of an array of processing elements, ALUs, whatever. From there, these can be formed into virtual pipelines - based upon whats being rendered.
Hmm...I think you are missing something that has been said a few times now: if it can't output the pixels to the screen, what is the point of all those "elements"? It is not like the GF FX is significantly outperforming the R300 in shader computations (this is WITH a significant clock advantage).
Thus, the whole paradigm of a 4 or 8 pipe architecture allways exhibiting the same, set-piece preformance is overwith. As Kirk stated in the quote that you all glazed over in your continued nVidia bashing.
Eh!? That reads like you started with David Kirk's comments, and are defining the world by it, with any and all observed performance to date not mattering at all. What exactly is "set-piece" performance, perhaps the meaning of this term is the cause of my confusion.
What we've actually observed is the nv30 fails to calculate pixel output significantly more quickly in comparison to the competition's design, is limited in its ability to output what it has calculated to the screen compared to the competition's design, has more design complexity than the competition, and this comparison with the competition is with the advantage of significantly higher clock speed.
Looking at your statements so far, it seems to me that you are the one who "glazed over" important issues.
Is this statement supposed to make sense? "if you're still dumb enough to think,"? Eh? The simple observation is the R300 exhibits behavior consistent with being able to output 8 pixels per clock, and the nv30 does not (most of the time)...the "why" is under investigation, and it doesn't take "anti-nvidia" bias to wonder about it in the meantime.
It takes anti-nVidia bias to dismiss the answer:
Pipes don't mean as much as they used to. In the [dual-pipeline] TNT2 days you used to be able to do two pixels in one clock if they were single textured, or one dual-textured pixel per pipe in every two clocks, it could operate in either of those two modes. We've now taken that to an extreme. Some things happen at sixteen pixels per clock. Some things happen at eight. Some things happen at four, and a lot of things happen in a bunch of clock cycles four pixels at a time. For instance, if you're doing sixteen textures, it's four pixels per clock, but it takes more than one clock. There are really 32 functional units that can do things in various multiples. We don't have the ability in NV30 to actually draw more than eight pixels per cycle. It's going to be a less meaningful question as we move forward...[GeForceFX] isn't really a texture lookup and blending pipeline with stages and maybe loop back anymore. It's a processor, and texture lookups are decoupled from this hard-wired pipe.
What does it take to dismiss the observation that many elements of this description could apply to the R300 as well, and that it seems to be more effective at getting the tasks done (with less transistors, and lower clockspeed... )?
And instead talk about nVidia's lying and failure, ohh, and throw in some 3dfx bashing too.
So...your problem is that people are criticizing nvidia for calling this design "8x1" when it, so far, can't seem to actually output 8 pixels a clock in any useful circumstance? Why shouldn't they? Nvidia could have picked some other description that isn't misleading compared to its abilities, couldn't they?
Did you ever pause to contemplate that perhaps the reason why the nV30's preformance isn't fixed like the R300's, is because it is using the approach that Kirk talked about?
I have little doubt it is using the approach Kirk is talking about, to some unspecified degree. I am also observing that so far it gets equivalent work done at 500 MHz as the R300 at 325 MHz, and using more transistors. Is it really so hard to understand why that might be criticized?
Did you ever pause to contemplate that the reason why the nv30 is being criticized is because compared to the R300, and the statements nvidia made concerning its design, it deserves it?
If you have an array of processing elements, then you can only use their flexibility to do so much as you only have so many elements to share.
Well, that seems to be a flaw in the execution of the concept as done in the nv30, doesn't it? The advantage we've perhaps observed so far, to my knowledge, is enhanced simple vertex processing power. Do you want to search for my comments on the Quadro FX to see if I acknowledge that...would that satisfy your idea of how the nv30 design should be discussed?
Thus, it won't have the consistentcy of a fixed pipeline, but a virtual pipeline has more plasticity and - as stated - can achieve 16-odd ops in some tasks.
You keep echoing the words in David Kirk's comments (in a way that looks mangled to me) without explaining how discussing the way the design behaves like a 4x? architecture (not just in synthetic tests, either) is displaying "anti-nvidia bias".
You mean in 1997, when performance mattered...as opposed to now, when...?
Are you saying the nv30's performance isn't relevant now, or do you think there is some other reason this 4x? discussion is being brought up?
What? Preformance matters, but so does architectural elegence and effeciency. I think you have absolutly no idea what I'm talking about.
I'm all for architectural elegance and efficiency, believe me, but could you relate that description to the actual nv30 for me please? You do realize that it isn't David Kirk's description of an architecture we are "attacking" when we comment on 4x? for the nv30, but the specific realization (to whatever extent) of that architecture that the is the nv30?
I do fear, mind you, that this topic is sticking and growing in presence because of two factors (1) Ignorance to the Nv30's true design
What does "true design" mean? Are there "true pixels" being rendered by the nv30 that the benchmarks and games aren't accounting for? You seem to be using catch phrases to counter "criticism" based on observed performance. I don't understand the validity. Why don't you stop calling people ignorant for basing their comments on observations you decide to ignore?
(b) the rampent anti-nVidia rhetoric thats rallying around this ideology that the Nv30 has only "4pipes" and is thus inferior in the nomenclature race with the R300s "8pipes"
What are you talking about, "nomenclature race"?! Are you reading anything anyone has said about this? This 4x? speculation (seemingly confirmed now) is based on the actual performance of the nv30, not some nomenclature contest!
nvidia said nv30 is 8x1.
nv30 underperforms expectations, and the performance is evaluated.
nv30 is theorized to actually be more accurately described as 4xSomething.
You come in and criticize people for describing it as 4xSomething.
Ack!