If this is your first visit, be sure to check out the FAQ by clicking the link above. You may have to register before you can post: click the register link above to proceed. To start viewing messages, select the forum that you want to visit from the selection below.
![]() |
|
|
#101 | |
|
Crazy coder
|
Quote:
This technique can replace most common dynamic code paths, or at least those that we will see in the near future. It won't do while-statement in the general case. But it will do any kind of if-statement, including nesting, and only run the correct path. For performance optimization, this is the most common use of dynamic branching. So a technique that can replace that is IMHO significant. |
|
|
|
|
|
|
#102 | ||
|
Member
Join Date: Jan 2004
Location: Oberlin, OH
Posts: 258
|
Quote:
|
||
|
|
|
|
|
#103 | |
|
Crazy coder
|
Quote:
|
|
|
|
|
|
|
#104 | |
|
Senior Member
Join Date: Jul 2002
Location: UK
Posts: 1,758
|
Quote:
For example, if you're running a 200-instruction pixel shader, there's no point jumping through hoops to avoid using floating point buffers, because the pixel shader takes so long to execute that the extra bandwidth taken is irrelevant. Similarly, if you are limited by your pixel shader, then it's a lot less likely to matter if you send all the geometry through many times (if they don't share resources). In contrast, if your average polygon size is 2-3 pixels, then it doesn't really matter if you run a hundred instruction pixel shader on it. It won't cost anything. It's all about pipeline balances. |
|
|
|
|
|
|
#105 | |
|
Crazy coder
|
Quote:
|
|
|
|
|
|
|
#106 | |
|
Crazy coder
|
Quote:
|
|
|
|
|
|
|
#107 | |
|
Dangerously Mirthful
Join Date: Feb 2002
Location: Winfield, IN USA
Posts: 15,292
|
Quote:
![]() You do of course realize that this makes absolutely no sense, right?
__________________
Elite Bastards - Adminish “Be polite, be professional, but have a plan to kill everybody you meet.” - General James N. Mattis |
|
|
|
|
|
|
#108 | |
|
Crazy coder
|
Quote:
|
|
|
|
|
|
|
#109 | |
|
Crazy coder
|
Quote:
Do you have any proof of a flaw in the coding? It's open source. I challenge you to find any flaw or anything dishonest in there. I got nothing to hide, but you got something to prove. In fact, I believe that this IS the card. It makes a lot of sense. That's the amount of slowdown I would expect for a card that doesn't perform top of the pipe stencil rejection. I didn't know if they did or not, now I know they don't. Either that, or or there's a problem with their drivers that leaves it disabled. |
|
|
|
|
|
|
#110 | ||
|
Guest
Join Date: Nov 2002
Posts: 900
|
Quote:
On nVidia when you use dynamic branching without the optimizations it is faster than when the optimization is enabled. In other words the hardware is already efficient at dynamic branching. |
||
|
|
|
|
|
#111 | |
|
Crazy coder
|
Quote:
Instead of looking for evil explanations to everything, why don't you look for a logical one? Such as that I don't have any nVidia hardware to try it on. If you read my website it says that it's supposed to run on their hardware too. It's been my intent all the way that it should run on all capable hardware. Either there's a driver bug, or there's a bug somewere in my framework that causes the problem. If I had access to any nVidia hardware, I wouldn't hesitate to spend a night to debug it and find the problem AND report any driver bug I find to nVidia. |
|
|
|
|
|
|
#112 | |
|
Crazy coder
|
Quote:
|
|
|
|
|
|
|
#113 | |
|
Dangerously Mirthful
Join Date: Feb 2002
Location: Winfield, IN USA
Posts: 15,292
|
Quote:
__________________
Elite Bastards - Adminish “Be polite, be professional, but have a plan to kill everybody you meet.” - General James N. Mattis |
|
|
|
|
|
|
#114 |
|
Member
Join Date: Jan 2004
Location: Oberlin, OH
Posts: 258
|
So still nobody answered me why. :?
|
|
|
|
|
|
#115 | |
|
Crazy coder
|
Quote:
|
|
|
|
|
|
|
#116 |
|
Member
Join Date: Jul 2003
Location: Beijing
Posts: 640
|
People, stop bashing Humus, will ya?
He just showed sth interesting to us. Yeah, I know the idea has been there for years, but no one mentioned it lately, we nearly forgot there's such an optimization trick. Humus reminds us, and I think it's a good thing. No matter what his purpose is, I repect his coding skill and creativity. As long as this is a valid optimization techniqe, who cares who invented it or implemented it? For me, knowing more is always better than knowing less. |
|
|
|
|
|
#117 |
|
R.I.P. 1983-2010
Join Date: Nov 2002
Posts: 2,234
|
I am not bashing Humus, I am mostly curious as to his comments on whether or not the demo is bugged on NV40 hardware, and why NV40 users are experience a performance deficit from this method.
__________________
Nzone
SLI Forum Administrator NVIDIA User Group Members receive free software and/or hardware from NVIDIA from time to time to facilitate the evaluation of NVIDIA products. However, the opinions expressed are solely those of the members |
|
|
|
|
|
#118 | |
|
Crazy coder
|
Quote:
|
|
|
|
|
|
|
#119 | |
|
Dangerously Mirthful
Join Date: Feb 2002
Location: Winfield, IN USA
Posts: 15,292
|
Quote:
__________________
Elite Bastards - Adminish “Be polite, be professional, but have a plan to kill everybody you meet.” - General James N. Mattis |
|
|
|
|
|
|
#120 | ||
|
R.I.P. 1983-2010
Join Date: Nov 2002
Posts: 2,234
|
Quote:
__________________
Nzone
SLI Forum Administrator NVIDIA User Group Members receive free software and/or hardware from NVIDIA from time to time to facilitate the evaluation of NVIDIA products. However, the opinions expressed are solely those of the members |
||
|
|
|
|
|
#121 | |
|
Crazy coder
|
Quote:
|
|
|
|
|
|
|
#122 | |
|
Member
Join Date: Jul 2003
Location: Beijing
Posts: 640
|
Quote:
|
|
|
|
|
|
|
#123 |
|
Member
Join Date: Sep 2003
Location: Asslanta
Posts: 180
|
X800XT
*edit...whoops...didn't have AA/AF on at all...sorry* 16*12 ON 140 OFF 43 10*7 ON 315 OFF 106 Jack
__________________
Jack be nimble |
|
|
|
|
|
#124 |
|
Dangerously Mirthful
Join Date: Feb 2002
Location: Winfield, IN USA
Posts: 15,292
|
Humus, did ATi give ya a pat-on-the-head or a bonus or anything weird like that for this? (Sorry, the cursiousity bug bit me and I had to ask.)
__________________
Elite Bastards - Adminish “Be polite, be professional, but have a plan to kill everybody you meet.” - General James N. Mattis |
|
|
|
|
|
#125 |
|
Meh
Join Date: Mar 2004
Location: New York
Posts: 9,809
|
Outside of the fact that we have the usual people hyping the ATI side of things and the usual people brandishing Nvidia's guns, like Chris Ray I am leary of the extreme difference in the impact of the optimizations on different hardware.
I don't believe that Humus intentionally made the optimization ineffective for Nvidia hardware but I thought the optimization was supposed to be generic? Humus, being the coder you are in the best position to at least make an informed guess at a reason for the discrepancy. Any ideas? Also, Nvidia is a corporation that has an opportunity to take advantage of features that are unique to their product and they are doing so. The argument that it is Nvidia's 'fault' that Crytek didn't implement these new features in SM2.0 is infantile. It is actually ATI's 'fault' for not implementing SM3.0 support in the first place. Any sensible person would acknowledge that. Yes it is possible and feasible, but what incentive does Crytek really have to forego SM3.0 and implement fall backs for SM2.0? ATI sure as hell doesn't seem to be giving them any. So Crytek's decision is $$$+3.0 or nothing+2.0 assuming equal effort for either. Yeah, tough decision. Fact is, Far Cry is a completed retail game that runs perfectly on ATI hardware. Crytek would have never even added these new features without pressure/incentive from Nvidia in the first place and nobody would have seen them on either IHV's hardware. Unless some of you conspiracy theorists think that they intentionally dummied down their SM2.0 features in anticipation of the SM3.0 only addons The same people that were saying SM3.0 is worthless are the same ones now saying that Crytek/Nvidia are evil for using it. Where does it stop? This is getting ridiculous.
__________________
What the deuce!? |
|
|
|
![]() |
| Thread Tools | |
| Display Modes | |
|
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Dynamic branching, again. | Frank | 3D Architectures & Chips | 8 | 21-Nov-2004 02:02 |
| Faking dynamic branching - technical discussion | Mintmaster | 3D Technology & Algorithms | 51 | 08-Jul-2004 15:57 |
| ATI Benchmarking Whitepaper | Dave Baumann | 3D & Semiconductor Industry | 112 | 27-Oct-2003 19:50 |
| DirectX 9.0 macros and dynamic branching? | Luminescent | 3D Architectures & Chips | 19 | 20-Aug-2002 17:36 |
| Mark Rein comments on the 'UT2K3' "leaked" demo | Aurra_Sing | 3D Architectures & Chips | 0 | 11-Jun-2002 22:25 |