PDA

View Full Version : Own an ATI X1, HD or NV 7/8 series GPU and have 5-10 minutes? Please run this.


PeterT
29-May-2007, 17:20
Hi enthusiast community, I once again turn to you for your help. If you own one of these cards (I'm especially interested in the new ATI/AMD GPUs of course), use a windows OS, and have a bit of time then please follow the instructions below.

Please download this: (611 KiB)
http://peter.metaclassofnil.com/uni/mgbench310507.zip
It extracts to a folder called "mgbench".
Then close all GPU-intensive or high-priority CPU apps you may have running and run the .exe contained in the folder. Now, this can take 3 to ? (probably around 10) minutes. After it has finished, it should have generated a file called "GPUMGbench.txt". Either send me that file by email (peter@metaclassofnil.com), or post its contents here, or send it via forum PM. However, please don't forget to mention your graphics card type in detail, and if you use non-standard clock speeds. Driver version would also be nice but is not strictly required.

Some important points:
- If you're worried about this being malware, you can see in my posting history that I've already done something like this 2 times before. I've also been a member of this forum for over 5 years.
- Better not try to use the values reported by this program to compare GPU performance. It's very specific, somewhat overhead-limited and probably buggy.
- WARNING: If you're not 100% sure that your GPU is stable, save your data or return it to standard clocks before running this. After getting some reports of crashes I found out that this tool is a very good "GPU stability test" - even configurations that make it through hours of 3Dmark loops may well BSoD running this program. I even had one card crash at stock speeds and only be able to complete the run downclocked.

Thanks for your help!

[edit]
I updated the link above to point to the new version, for more information see my post later in this thread. (http://forum.beyond3d.com/showpost.php?p=1013723&postcount=61)

AlStrong
29-May-2007, 17:27
Q: What is the program doing?

PeterT
29-May-2007, 17:34
A: It benchmarks the speed of various components of a GPGPU multigrid method implementation at a number of different problem sizes. The specific algorithm implemented in this benchmark calculates the solution to a 2D Poisson equation (with Dirichlet boundary conditions) using a multigrid algorithm with a Jacobi smoother. A bit of explanation can be found here:
http://peter.metaclassofnil.com/mastervortrag.pdf
The benchmark data in that presentation is extremely old and incorrect though, the current version is much more optimized and somewhat more correct. It is part of my Master's thesis which should be coming to a close soon.

Maybe I should have mentioned that in the first post :D

GMâ„¢
29-May-2007, 17:45
NVIDIA 7800GTX 768Mb Stock Clocks
NVIDIA Forceware v158.42 Vista x64
Intel C2D @3.33GHz

Results:
http://www.gm-designs.com/gmart/bench.rar

trinibwoy
29-May-2007, 19:21
ATI X1900XT-512 Stock
Catalyst 7.4
AthlonX2 2.5Ghz
Windows XP SP2

Results:
Crashed with "atidvag.dll stuck in infinite loop" error :???:

AlexV
29-May-2007, 20:02
HD 2900XT 512MB Stock Clocks
Catalyst 8.37.4.3
Pentium D 940 3.2Ghz
Vista X64


------------------
-- size: 3

BENCHMARK
Jacobi iteration: 20.3 micros
Residual calculation: 18.7 micros
Restriction: fw: 15.6 micros
Interpolation + add: 18.8 micros
VCycle: 359 micros


BENCHMARK
Jacobi iteration: 20.3 micros
Residual calculation: 18.8 micros
Restriction: fw: 14 micros
Interpolation + add: 18.8 micros
VCycle: 390 micros


BENCHMARK
Jacobi iteration: 18.7 micros
Residual calculation: 20.3 micros
Restriction: fw: 14.1 micros
Interpolation + add: 18.7 micros
VCycle: 360 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 19.1111 micros
Residual calculation: 19 micros
Restriction: fw: 15.6667 micros
Interpolation + add: 19.1111 micros
VCycle: 607.778 micros


BENCHMARK
Jacobi iteration: 19.1111 micros
Residual calculation: 19.1111 micros
Restriction: fw: 13.8889 micros
Interpolation + add: 19.1111 micros
VCycle: 607.778 micros


BENCHMARK
Jacobi iteration: 19.1111 micros
Residual calculation: 19.1111 micros
Restriction: fw: 15.6667 micros
Interpolation + add: 19 micros
VCycle: 607.778 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 19.625 micros
Residual calculation: 19.5 micros
Restriction: fw: 13.625 micros
Interpolation + add: 19.625 micros
VCycle: 820 micros


BENCHMARK
Jacobi iteration: 19.5 micros
Residual calculation: 19.625 micros
Restriction: fw: 13.625 micros
Interpolation + add: 19.5 micros
VCycle: 840 micros


BENCHMARK
Jacobi iteration: 19.625 micros
Residual calculation: 17.5 micros
Restriction: fw: 15.625 micros
Interpolation + add: 17.625 micros
VCycle: 840 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 20.1429 micros
Residual calculation: 20 micros
Restriction: fw: 13.4286 micros
Interpolation + add: 20.1429 micros
VCycle: 1048.57 micros


BENCHMARK
Jacobi iteration: 20.1429 micros
Residual calculation: 17.8571 micros
Restriction: fw: 15.5714 micros
Interpolation + add: 17.8571 micros
VCycle: 1094.29 micros


BENCHMARK
Jacobi iteration: 20.1429 micros
Residual calculation: 20 micros
Restriction: fw: 13.4286 micros
Interpolation + add: 20.1429 micros
VCycle: 1048.57 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 18.3333 micros
Residual calculation: 18.1667 micros
Restriction: fw: 15.6667 micros
Interpolation + add: 18.1667 micros
VCycle: 1303.33 micros


BENCHMARK
Jacobi iteration: 20.8333 micros
Residual calculation: 18.1667 micros
Restriction: fw: 15.6667 micros
Interpolation + add: 18.1667 micros
VCycle: 1328.33 micros


BENCHMARK
Jacobi iteration: 20.8333 micros
Residual calculation: 18.3333 micros
Restriction: fw: 15.5 micros
Interpolation + add: 18.3333 micros
VCycle: 1301.67 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 15.6 micros
Residual calculation: 21.8 micros
Restriction: fw: 12.6 micros
Interpolation + add: 21.8 micros
VCycle: 1500 micros


BENCHMARK
Jacobi iteration: 18.8 micros
Residual calculation: 18.8 micros
Restriction: fw: 15.6 micros
Interpolation + add: 18.8 micros
VCycle: 1530 micros


BENCHMARK
Jacobi iteration: 22 micros
Residual calculation: 18.6 micros
Restriction: fw: 12.6 micros
Interpolation + add: 18.8 micros
VCycle: 1562 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 19.5 micros
Residual calculation: 19.5 micros
Restriction: fw: 15.5 micros
Interpolation + add: 19.5 micros
VCycle: 1760 micros


BENCHMARK
Jacobi iteration: 19.5 micros
Residual calculation: 19.5 micros
Restriction: fw: 15.75 micros
Interpolation + add: 19.5 micros
VCycle: 1795 micros


BENCHMARK
Jacobi iteration: 15.5 micros
Residual calculation: 19.5 micros
Restriction: fw: 15.75 micros
Interpolation + add: 19.5 micros
VCycle: 1797.5 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 20.6667 micros
Residual calculation: 21 micros
Restriction: fw: 10.3333 micros
Interpolation + add: 20.6667 micros
VCycle: 2033.33 micros


BENCHMARK
Jacobi iteration: 20.6667 micros
Residual calculation: 15.6667 micros
Restriction: fw: 15.6667 micros
Interpolation + add: 20.6667 micros
VCycle: 2033.33 micros


BENCHMARK
Jacobi iteration: 20.6667 micros
Residual calculation: 21 micros
Restriction: fw: 10.3333 micros
Interpolation + add: 21 micros
VCycle: 2030 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 15.5 micros
Residual calculation: 23.5 micros
Restriction: fw: 15.5 micros
Interpolation + add: 16 micros
VCycle: 2340 micros


BENCHMARK
Jacobi iteration: 15.5 micros
Residual calculation: 23.5 micros
Restriction: fw: 16 micros
Interpolation + add: 15.5 micros
VCycle: 2345 micros


BENCHMARK
Jacobi iteration: 16 micros
Residual calculation: 23 micros
Restriction: fw: 8 micros
Interpolation + add: 23.5 micros
VCycle: 2265 micros



------------------
-- size: 2047

BENCHMARK
Jacobi iteration: 32 micros
Residual calculation: 15 micros
Restriction: fw: 16 micros
Interpolation + add: 15 micros
VCycle: 2660 micros


BENCHMARK
Jacobi iteration: 15 micros
Residual calculation: 16 micros
Restriction: fw: 15 micros
Interpolation + add: 32 micros
VCycle: 2500 micros


BENCHMARK
Jacobi iteration: 16 micros
Residual calculation: 15 micros
Restriction: fw: 16 micros
Interpolation + add: 16 micros
VCycle: 2500 micros

Sorry for posting it like that, but I cannot post attachements.

caffeinated
29-May-2007, 20:27
Vista 32-bit/Geforce 8600 GTS:

GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.

GLImageTexture: DevIL: could not load image.

PeterT
29-May-2007, 20:36
HD 2900XT 512MB Stock Clocks
Catalyst 8.37.4.3
Pentium D 940 3.2Ghz
Vista X64

[...]

Wow. Those scores are utterly amazing if true. I really can't believe them for the 2047 test at least - regardless of how powerful the card is, it can't be that fast... (I'm not accusing you, my tests must be buggy, perhaps it's vista related)

It would be helpful to get some more HD 2900 results, perhaps running in XP.

Crashed with "atidvag.dll stuck in infinite loop" errorSeems your card is not stable at stock speeds :/ - I really should market this as a stability test.

GLImageTexture: DevIL: could not load image.That's very strange. Did you extract the zip archive with folders intact?

AlexV
29-May-2007, 20:44
I`ve barely looked at the way you`ve set-up your code-if it`s very friendly towards the R600s architecture, it could have some serious oomph...by skimming one of the shaders i`ve noticed quite a number of vec4s being done(skimming as really skimming, so I may not have been paying attention). I simply ran the exe, which means it ran in windowed mode...any command line arguments I should use or anything?

Skrying
29-May-2007, 20:47
X1950 Pro
Core 2 Duo E6300 @ 2.8Ghz
8.36/Catalyst 7.4
Windows Vista (32-bit)


GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11.2 micros
Restriction: fw: 8.2 micros
Interpolation + add: 11 micros
VCycle: 226 micros


BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11.1 micros
Restriction: fw: 8.2 micros
Interpolation + add: 11.1 micros
VCycle: 223 micros


BENCHMARK
Jacobi iteration: 11.1 micros
Residual calculation: 11.2 micros
Restriction: fw: 8.2 micros
Interpolation + add: 10.9 micros
VCycle: 223 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 11.3333 micros
Residual calculation: 11.2222 micros
Restriction: fw: 8.11111 micros
Interpolation + add: 11.2222 micros
VCycle: 364.444 micros


BENCHMARK
Jacobi iteration: 11.2222 micros
Residual calculation: 11.2222 micros
Restriction: fw: 8.22222 micros
Interpolation + add: 11 micros
VCycle: 366.667 micros


BENCHMARK
Jacobi iteration: 11.2222 micros
Residual calculation: 11 micros
Restriction: fw: 8.22222 micros
Interpolation + add: 11.2222 micros
VCycle: 364.444 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 11.125 micros
Residual calculation: 11.125 micros
Restriction: fw: 8.125 micros
Interpolation + add: 11.25 micros
VCycle: 503.75 micros


BENCHMARK
Jacobi iteration: 10.875 micros
Residual calculation: 11.125 micros
Restriction: fw: 8.125 micros
Interpolation + add: 11.125 micros
VCycle: 500 micros


BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11 micros
Restriction: fw: 8.25 micros
Interpolation + add: 11.125 micros
VCycle: 503.75 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 11.4286 micros
Residual calculation: 11.2857 micros
Restriction: fw: 8.14286 micros
Interpolation + add: 11.2857 micros
VCycle: 644.286 micros


BENCHMARK
Jacobi iteration: 11.2857 micros
Residual calculation: 11.1429 micros
Restriction: fw: 8.42857 micros
Interpolation + add: 11.1429 micros
VCycle: 645.714 micros


BENCHMARK
Jacobi iteration: 11.1429 micros
Residual calculation: 11.1429 micros
Restriction: fw: 8.42857 micros
Interpolation + add: 11.1429 micros
VCycle: 644.286 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 11.5 micros
Residual calculation: 11.5 micros
Restriction: fw: 8.5 micros
Interpolation + add: 11.5 micros
VCycle: 806.667 micros


BENCHMARK
Jacobi iteration: 11.5 micros
Residual calculation: 11.5 micros
Restriction: fw: 8.5 micros
Interpolation + add: 11.6667 micros
VCycle: 800 micros


BENCHMARK
Jacobi iteration: 11.8333 micros
Residual calculation: 11.6667 micros
Restriction: fw: 8.5 micros
Interpolation + add: 11.8333 micros
VCycle: 798.333 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11.4 micros
Restriction: fw: 8 micros
Interpolation + add: 11 micros
VCycle: 928 micros


BENCHMARK
Jacobi iteration: 10.8 micros
Residual calculation: 11.4 micros
Restriction: fw: 8 micros
Interpolation + add: 11.2 micros
VCycle: 926 micros


BENCHMARK
Jacobi iteration: 11.2 micros
Residual calculation: 11 micros
Restriction: fw: 8 micros
Interpolation + add: 11.2 micros
VCycle: 922 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 12 micros
Residual calculation: 11.75 micros
Restriction: fw: 8.5 micros
Interpolation + add: 11.75 micros
VCycle: 1082.5 micros


BENCHMARK
Jacobi iteration: 11.25 micros
Residual calculation: 11.25 micros
Restriction: fw: 8.25 micros
Interpolation + add: 11.5 micros
VCycle: 1077.5 micros


BENCHMARK
Jacobi iteration: 11.5 micros
Residual calculation: 11.25 micros
Restriction: fw: 8.25 micros
Interpolation + add: 11.25 micros
VCycle: 1077.5 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 13 micros
Residual calculation: 13 micros
Restriction: fw: 9.33333 micros
Interpolation + add: 11.3333 micros
VCycle: 1216.67 micros


BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11.3333 micros
Restriction: fw: 8 micros
Interpolation + add: 11.6667 micros
VCycle: 1220 micros


BENCHMARK
Jacobi iteration: 11.3333 micros
Residual calculation: 11.3333 micros
Restriction: fw: 8 micros
Interpolation + add: 11.3333 micros
VCycle: 1210 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11.5 micros
Restriction: fw: 7.5 micros
Interpolation + add: 11.5 micros
VCycle: 1350 micros


BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11.5 micros
Restriction: fw: 8 micros
Interpolation + add: 11.5 micros
VCycle: 1355 micros


BENCHMARK
Jacobi iteration: 11.5 micros
Residual calculation: 12 micros
Restriction: fw: 8.5 micros
Interpolation + add: 11.5 micros
VCycle: 1380 micros



------------------
-- size: 2047

BENCHMARK
Jacobi iteration: 13 micros
Residual calculation: 16 micros
Restriction: fw: 11 micros
Interpolation + add: 12 micros
VCycle: 1580 micros


BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11 micros
Restriction: fw: 8 micros
Interpolation + add: 12 micros
VCycle: 1490 micros


BENCHMARK
Jacobi iteration: 12 micros
Residual calculation: 11 micros
Restriction: fw: 9 micros
Interpolation + add: 10 micros
VCycle: 1550 micros

AnarchX
29-May-2007, 21:04
P4 3GHz
6800GT FW158.27
Win XP SP2

GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 18.7 micros
Residual calculation: 18.8 micros
Restriction: fw: 14.1 micros
Interpolation + add: 14 micros
VCycle: 828 micros


BENCHMARK
Jacobi iteration: 12.5 micros
Residual calculation: 12.5 micros
Restriction: fw: 9.4 micros
Interpolation + add: 9.4 micros
VCycle: 797 micros


BENCHMARK
Jacobi iteration: 10.9 micros
Residual calculation: 12.5 micros
Restriction: fw: 9.4 micros
Interpolation + add: 9.4 micros
VCycle: 797 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 12.2222 micros
Residual calculation: 12.1111 micros
Restriction: fw: 10.4444 micros
Interpolation + add: 10.4444 micros
VCycle: 1318.89 micros


BENCHMARK
Jacobi iteration: 12.1111 micros
Residual calculation: 13.8889 micros
Restriction: fw: 8.66667 micros
Interpolation + add: 10.4444 micros
VCycle: 1441.11 micros


BENCHMARK
Jacobi iteration: 12.2222 micros
Residual calculation: 12.1111 micros
Restriction: fw: 8.66667 micros
Interpolation + add: 10.4444 micros
VCycle: 1354.44 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 13.625 micros
Residual calculation: 13.625 micros
Restriction: fw: 9.75 micros
Interpolation + add: 9.875 micros
VCycle: 1875 micros


BENCHMARK
Jacobi iteration: 13.625 micros
Residual calculation: 11.75 micros
Restriction: fw: 9.75 micros
Interpolation + add: 11.75 micros
VCycle: 2011.25 micros


BENCHMARK
Jacobi iteration: 13.75 micros
Residual calculation: 13.625 micros
Restriction: fw: 9.75 micros
Interpolation + add: 9.75 micros
VCycle: 1875 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 24.5714 micros
Residual calculation: 22.2857 micros
Restriction: fw: 13.4286 micros
Interpolation + add: 15.5714 micros
VCycle: 2678.57 micros


BENCHMARK
Jacobi iteration: 24.5714 micros
Residual calculation: 22.2857 micros
Restriction: fw: 13.4286 micros
Interpolation + add: 15.5714 micros
VCycle: 2522.86 micros


BENCHMARK
Jacobi iteration: 22.2857 micros
Residual calculation: 22.4286 micros
Restriction: fw: 13.2857 micros
Interpolation + add: 15.7143 micros
VCycle: 2454.29 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 60 micros
Residual calculation: 57.1667 micros
Restriction: fw: 34 micros
Interpolation + add: 39 micros
VCycle: 3073.33 micros


BENCHMARK
Jacobi iteration: 60 micros
Residual calculation: 59.8333 micros
Restriction: fw: 33.8333 micros
Interpolation + add: 36.5 micros
VCycle: 3125 micros


BENCHMARK
Jacobi iteration: 59.8333 micros
Residual calculation: 59.8333 micros
Restriction: fw: 34 micros
Interpolation + add: 36.3333 micros
VCycle: 3125 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 253 micros
Residual calculation: 253.2 micros
Restriction: fw: 171.8 micros
Interpolation + add: 128.2 micros
VCycle: 3938 micros


BENCHMARK
Jacobi iteration: 253.2 micros
Residual calculation: 253.2 micros
Restriction: fw: 168.6 micros
Interpolation + add: 128.2 micros
VCycle: 3938 micros


BENCHMARK
Jacobi iteration: 253 micros
Residual calculation: 250 micros
Restriction: fw: 172 micros
Interpolation + add: 128 micros
VCycle: 3908 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 679.75 micros
Residual calculation: 675.75 micros
Restriction: fw: 492.25 micros
Interpolation + add: 445.25 micros
VCycle: 8437.5 micros


BENCHMARK
Jacobi iteration: 683.5 micros
Residual calculation: 675.75 micros
Restriction: fw: 488.25 micros
Interpolation + add: 445.25 micros
VCycle: 8400 micros


BENCHMARK
Jacobi iteration: 683.5 micros
Residual calculation: 672 micros
Restriction: fw: 492 micros
Interpolation + add: 445.5 micros
VCycle: 8397.5 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 2859.33 micros
Residual calculation: 2859.33 micros
Restriction: fw: 2265.67 micros
Interpolation + add: 2010.33 micros
VCycle: 33073.3 micros


BENCHMARK
Jacobi iteration: 2844 micros
Residual calculation: 2859.33 micros
Restriction: fw: 2270.67 micros
Interpolation + add: 1979.33 micros
VCycle: 35520 micros


BENCHMARK
Jacobi iteration: 3177.33 micros
Residual calculation: 3375 micros
Restriction: fw: 2723.67 micros
Interpolation + add: 2088.67 micros
VCycle: 36146.7 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 14218.5 micros
Residual calculation: 14515.5 micros
Restriction: fw: 12484.5 micros
Interpolation + add: 11429.5 micros
VCycle: 165470 micros


BENCHMARK
Jacobi iteration: 14101.5 micros
Residual calculation: 14500 micros
Restriction: fw: 12539 micros
Interpolation + add: 11476.5 micros
VCycle: 168360 micros


BENCHMARK
Jacobi iteration: 14164 micros
Residual calculation: 14445.5 micros
Restriction: fw: 12508 micros
Interpolation + add: 11468.5 micros
VCycle: 165315 micros



------------------
-- size: 2047
GLFilterStep -- Error doing makeCurrent on target.

size 2047 did not run?

AlexV
29-May-2007, 21:07
X1950 Pro
Core 2 Duo E6300 @ 2.8Ghz
8.36/Catalyst 7.4
Windows Vista (32-bit)


GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11.2 micros
Restriction: fw: 8.2 micros
Interpolation + add: 11 micros
VCycle: 226 micros


BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11.1 micros
Restriction: fw: 8.2 micros
Interpolation + add: 11.1 micros
VCycle: 223 micros


BENCHMARK
Jacobi iteration: 11.1 micros
Residual calculation: 11.2 micros
Restriction: fw: 8.2 micros
Interpolation + add: 10.9 micros
VCycle: 223 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 11.3333 micros
Residual calculation: 11.2222 micros
Restriction: fw: 8.11111 micros
Interpolation + add: 11.2222 micros
VCycle: 364.444 micros


BENCHMARK
Jacobi iteration: 11.2222 micros
Residual calculation: 11.2222 micros
Restriction: fw: 8.22222 micros
Interpolation + add: 11 micros
VCycle: 366.667 micros


BENCHMARK
Jacobi iteration: 11.2222 micros
Residual calculation: 11 micros
Restriction: fw: 8.22222 micros
Interpolation + add: 11.2222 micros
VCycle: 364.444 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 11.125 micros
Residual calculation: 11.125 micros
Restriction: fw: 8.125 micros
Interpolation + add: 11.25 micros
VCycle: 503.75 micros


BENCHMARK
Jacobi iteration: 10.875 micros
Residual calculation: 11.125 micros
Restriction: fw: 8.125 micros
Interpolation + add: 11.125 micros
VCycle: 500 micros


BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11 micros
Restriction: fw: 8.25 micros
Interpolation + add: 11.125 micros
VCycle: 503.75 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 11.4286 micros
Residual calculation: 11.2857 micros
Restriction: fw: 8.14286 micros
Interpolation + add: 11.2857 micros
VCycle: 644.286 micros


BENCHMARK
Jacobi iteration: 11.2857 micros
Residual calculation: 11.1429 micros
Restriction: fw: 8.42857 micros
Interpolation + add: 11.1429 micros
VCycle: 645.714 micros


BENCHMARK
Jacobi iteration: 11.1429 micros
Residual calculation: 11.1429 micros
Restriction: fw: 8.42857 micros
Interpolation + add: 11.1429 micros
VCycle: 644.286 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 11.5 micros
Residual calculation: 11.5 micros
Restriction: fw: 8.5 micros
Interpolation + add: 11.5 micros
VCycle: 806.667 micros


BENCHMARK
Jacobi iteration: 11.5 micros
Residual calculation: 11.5 micros
Restriction: fw: 8.5 micros
Interpolation + add: 11.6667 micros
VCycle: 800 micros


BENCHMARK
Jacobi iteration: 11.8333 micros
Residual calculation: 11.6667 micros
Restriction: fw: 8.5 micros
Interpolation + add: 11.8333 micros
VCycle: 798.333 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11.4 micros
Restriction: fw: 8 micros
Interpolation + add: 11 micros
VCycle: 928 micros


BENCHMARK
Jacobi iteration: 10.8 micros
Residual calculation: 11.4 micros
Restriction: fw: 8 micros
Interpolation + add: 11.2 micros
VCycle: 926 micros


BENCHMARK
Jacobi iteration: 11.2 micros
Residual calculation: 11 micros
Restriction: fw: 8 micros
Interpolation + add: 11.2 micros
VCycle: 922 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 12 micros
Residual calculation: 11.75 micros
Restriction: fw: 8.5 micros
Interpolation + add: 11.75 micros
VCycle: 1082.5 micros


BENCHMARK
Jacobi iteration: 11.25 micros
Residual calculation: 11.25 micros
Restriction: fw: 8.25 micros
Interpolation + add: 11.5 micros
VCycle: 1077.5 micros


BENCHMARK
Jacobi iteration: 11.5 micros
Residual calculation: 11.25 micros
Restriction: fw: 8.25 micros
Interpolation + add: 11.25 micros
VCycle: 1077.5 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 13 micros
Residual calculation: 13 micros
Restriction: fw: 9.33333 micros
Interpolation + add: 11.3333 micros
VCycle: 1216.67 micros


BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11.3333 micros
Restriction: fw: 8 micros
Interpolation + add: 11.6667 micros
VCycle: 1220 micros


BENCHMARK
Jacobi iteration: 11.3333 micros
Residual calculation: 11.3333 micros
Restriction: fw: 8 micros
Interpolation + add: 11.3333 micros
VCycle: 1210 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11.5 micros
Restriction: fw: 7.5 micros
Interpolation + add: 11.5 micros
VCycle: 1350 micros


BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11.5 micros
Restriction: fw: 8 micros
Interpolation + add: 11.5 micros
VCycle: 1355 micros


BENCHMARK
Jacobi iteration: 11.5 micros
Residual calculation: 12 micros
Restriction: fw: 8.5 micros
Interpolation + add: 11.5 micros
VCycle: 1380 micros



------------------
-- size: 2047

BENCHMARK
Jacobi iteration: 13 micros
Residual calculation: 16 micros
Restriction: fw: 11 micros
Interpolation + add: 12 micros
VCycle: 1580 micros


BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11 micros
Restriction: fw: 8 micros
Interpolation + add: 12 micros
VCycle: 1490 micros


BENCHMARK
Jacobi iteration: 12 micros
Residual calculation: 11 micros
Restriction: fw: 9 micros
Interpolation + add: 10 micros
VCycle: 1550 micros


Seeing this, I`d think there`s either a bug in your code or something the Catalysts foobar in order to achieve such performance. It should also be mentioned that, at least for me, the test was quite short, dunno if it reached 3 minutes.

Skrying
29-May-2007, 21:12
Seeing this, I`d think there`s either a bug in your code or something the Catalysts foobar in order to achieve such performance. It should also be mentioned that, at least for me, the test was quite short, dunno if it reached 3 minutes.

Oh yes, I should mention it was about 30 seconds for me on the test.

PeterT
29-May-2007, 21:14
size 2047 did not run?That's normal, may run out of memory on some cards.

Looking at Skrying's results, who also runs an ATI card on Vista, it's clear that there's something very wrong going on with that combination. There's simply no way these results can be right. Rule of thumb: if any of "Jacobi iteration", "Residual calculation", "Restriction" or "Interpolation + add" report less than 100 microseconds at size 2047 (or even 1023) the measurements are not being taken correctly.

Here are my 8800 gts results at those sizes for comparison. That's about the range those numbers "should" fall in.
------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 1320.5 micros
Residual calculation: 1265.5 micros
Restriction: fw: 1250 micros
Interpolation + add: 961 micros
VCycle: 23750 micros


BENCHMARK
Jacobi iteration: 1344 micros
Residual calculation: 1234 micros
Restriction: fw: 1250 micros
Interpolation + add: 961 micros
VCycle: 23750 micros


BENCHMARK
Jacobi iteration: 1351.5 micros
Residual calculation: 1234 micros
Restriction: fw: 1242.5 micros
Interpolation + add: 961 micros
VCycle: 23750 micros



------------------
-- size: 2047

BENCHMARK
Jacobi iteration: 4456 micros
Residual calculation: 4734 micros
Restriction: fw: 5594 micros
Interpolation + add: 4922 micros
VCycle: 139690 micros


BENCHMARK
Jacobi iteration: 4509 micros
Residual calculation: 4703 micros
Restriction: fw: 5688 micros
Interpolation + add: 4875 micros
VCycle: 140000 micros


BENCHMARK
Jacobi iteration: 4625 micros
Residual calculation: 4750 micros
Restriction: fw: 5656 micros
Interpolation + add: 4844 micros
VCycle: 140160 micros

I really need to investigate what's going on with ATI in Vista, but that's somewhat challenging if one has neither at hand.

[edit]
Seeing this, I`d think there`s either a bug in your code or something the Catalysts foobar in order to achieve such performance.Indeed. It would be great if someone running ATI on XP could post, to see if it's a general Catalyst issue or a Catalyst/Vista issue.

Dave Baumann
29-May-2007, 21:19
With regards to clockspeeds, I'm not convinced that this would be operating at "3D speeds" on ATI Radeon boards.

PeterT
29-May-2007, 21:24
With regards to clockspeeds, I'm not convinced that this would be operating at "3D speeds" on ATI Radeon boards.
Well, currently it seems to operate at "well beyond lightspeed" on ATI cards in Vista, so that shouldn't be a problem ;)

But seriously, I hadn't considered that. What triggers "3D speeds"?

Dave Baumann
29-May-2007, 21:29
Full screen rendering.

caffeinated
29-May-2007, 21:35
I would put this in a code box if I knew how...If someone could tell me how to do this I will go back and edit it. The following are my results with my 8800 GTS. I also have results with a Geforce 8600 GTS. I will wait, though, if someone would be so kind as to explain how to fix it into the scrolling box so that it won't be so irritating to read ;).

AMD 4000+ X2 Brisbane @ 2.83
G.SKILL 4-4-4-12 @ 704mhz
Geforce 8800 GTS 640MB @ 525/830
Forceware 158.22/ Windows XP SP2
**************************************

GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 39.1 micros
Residual calculation: 35.9 micros
Restriction: fw: 39.1 micros
Interpolation + add: 39.1 micros
VCycle: 1515 micros


BENCHMARK
Jacobi iteration: 37.5 micros
Residual calculation: 39.1 micros
Restriction: fw: 37.5 micros
Interpolation + add: 39 micros
VCycle: 1532 micros


BENCHMARK
Jacobi iteration: 37.5 micros
Residual calculation: 39.1 micros
Restriction: fw: 37.5 micros
Interpolation + add: 31.2 micros
VCycle: 1547 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 39.8889 micros
Residual calculation: 29.5556 micros
Restriction: fw: 38.2222 micros
Interpolation + add: 38.1111 micros
VCycle: 2483.33 micros


BENCHMARK
Jacobi iteration: 36.4444 micros
Residual calculation: 38.1111 micros
Restriction: fw: 38.2222 micros
Interpolation + add: 38.2222 micros
VCycle: 2465.56 micros


BENCHMARK
Jacobi iteration: 38.2222 micros
Residual calculation: 38.1111 micros
Restriction: fw: 36.4444 micros
Interpolation + add: 31.3333 micros
VCycle: 2500 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 33.25 micros
Residual calculation: 39 micros
Restriction: fw: 37.125 micros
Interpolation + add: 39.125 micros
VCycle: 3417.5 micros


BENCHMARK
Jacobi iteration: 37.125 micros
Residual calculation: 39.125 micros
Restriction: fw: 39 micros
Interpolation + add: 37.125 micros
VCycle: 3437.5 micros


BENCHMARK
Jacobi iteration: 29.375 micros
Residual calculation: 39 micros
Restriction: fw: 37.125 micros
Interpolation + add: 39 micros
VCycle: 3437.5 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 38 micros
Residual calculation: 37.8571 micros
Restriction: fw: 38 micros
Interpolation + add: 38 micros
VCycle: 4374.29 micros


BENCHMARK
Jacobi iteration: 31.2857 micros
Residual calculation: 40.1429 micros
Restriction: fw: 38 micros
Interpolation + add: 37.8571 micros
VCycle: 4375.71 micros


BENCHMARK
Jacobi iteration: 38 micros
Residual calculation: 38 micros
Restriction: fw: 37.8571 micros
Interpolation + add: 40.2857 micros
VCycle: 4374.29 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 39.1667 micros
Residual calculation: 39 micros
Restriction: fw: 36.5 micros
Interpolation + add: 36.5 micros
VCycle: 5363.33 micros


BENCHMARK
Jacobi iteration: 36.5 micros
Residual calculation: 39 micros
Restriction: fw: 39.1667 micros
Interpolation + add: 36.3333 micros
VCycle: 5338.33 micros


BENCHMARK
Jacobi iteration: 39.1667 micros
Residual calculation: 39 micros
Restriction: fw: 36.5 micros
Interpolation + add: 39 micros
VCycle: 5365 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 37.4 micros
Residual calculation: 37.6 micros
Restriction: fw: 40.6 micros
Interpolation + add: 37.4 micros
VCycle: 6312 micros


BENCHMARK
Jacobi iteration: 12.6 micros
Residual calculation: 31.2 micros
Restriction: fw: 37.6 micros
Interpolation + add: 37.4 micros
VCycle: 6312 micros


BENCHMARK
Jacobi iteration: 37.4 micros
Residual calculation: 37.6 micros
Restriction: fw: 37.4 micros
Interpolation + add: 40.6 micros
VCycle: 6282 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 89.75 micros
Residual calculation: 78 micros
Restriction: fw: 74.25 micros
Interpolation + add: 47 micros
VCycle: 7617.5 micros


BENCHMARK
Jacobi iteration: 85.75 micros
Residual calculation: 78.25 micros
Restriction: fw: 70.25 micros
Interpolation + add: 47 micros
VCycle: 7617.5 micros


BENCHMARK
Jacobi iteration: 86 micros
Residual calculation: 74.25 micros
Restriction: fw: 70.25 micros
Interpolation + add: 54.75 micros
VCycle: 7617.5 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 323 micros
Residual calculation: 307.333 micros
Restriction: fw: 312.333 micros
Interpolation + add: 192.667 micros
VCycle: 10106.7 micros


BENCHMARK
Jacobi iteration: 323 micros
Residual calculation: 302 micros
Restriction: fw: 317.667 micros
Interpolation + add: 187.667 micros
VCycle: 10156.7 micros


BENCHMARK
Jacobi iteration: 317.667 micros
Residual calculation: 317.667 micros
Restriction: fw: 312.667 micros
Interpolation + add: 187.333 micros
VCycle: 10416.7 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 1211 micros
Residual calculation: 1187.5 micros
Restriction: fw: 1328.5 micros
Interpolation + add: 843.5 micros
VCycle: 24765 micros


BENCHMARK
Jacobi iteration: 1273.5 micros
Residual calculation: 1187.5 micros
Restriction: fw: 1320.5 micros
Interpolation + add: 836 micros
VCycle: 24840 micros


BENCHMARK
Jacobi iteration: 1242.5 micros
Residual calculation: 1210.5 micros
Restriction: fw: 1312.5 micros
Interpolation + add: 852 micros
VCycle: 24765 micros



------------------
-- size: 2047

BENCHMARK
Jacobi iteration: 547 micros
Residual calculation: 3844 micros
Restriction: fw: 5687 micros
Interpolation + add: 4453 micros
VCycle: 154220 micros


BENCHMARK
Jacobi iteration: 594 micros
Residual calculation: 3844 micros
Restriction: fw: 5734 micros
Interpolation + add: 4359 micros
VCycle: 155160 micros


BENCHMARK
Jacobi iteration: 578 micros
Residual calculation: 3828 micros
Restriction: fw: 5781 micros
Interpolation + add: 4344 micros
VCycle: 154690 micros


******************************

I also have a result for Vista using an 8600 GTS 720/2100 on an X2 @ ~2.4, but will wait until someone shows me how to stick it in a code box ;).

PeterT
29-May-2007, 21:38
I would put this in a code box if I knew how. Put (code) and (/code) around it, but with [ and ] instead of ( and ). Thanks for the results BTW. All NV results so far fall within expectations. But that's not very surprising given that it was developed mostly on a 8800.

I'd still like some ATI on XP results, to see if the same issue exists there.

neliz
29-May-2007, 21:39
I would put this in a code box if I knew how...

type [ code ] and [ / code ] around it ;)

caffeinated
29-May-2007, 21:47
type [ code ] and [ / code ] around it ;)

Wonderful! :grin: Thank you so much. I will edit my original, and then post the other results.

neliz
29-May-2007, 21:47
It crashed at 2047 with me too, display driver recovered but application seemed to hang.

HD2900 Cat 8.38RC7
Opteron 165
1GB DDR-3200
Vista32

GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 20.1 micros
Residual calculation: 19.9 micros
Restriction: fw: 14.2 micros
Interpolation + add: 19.7 micros
VCycle: 445 micros


BENCHMARK
Jacobi iteration: 19.5 micros
Residual calculation: 19.3 micros
Restriction: fw: 13.9 micros
Interpolation + add: 19.7 micros
VCycle: 457 micros


BENCHMARK
Jacobi iteration: 20.2 micros
Residual calculation: 19.3 micros
Restriction: fw: 14 micros
Interpolation + add: 19.1 micros
VCycle: 446 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 19.6667 micros
Residual calculation: 19.6667 micros
Restriction: fw: 14 micros
Interpolation + add: 19.5556 micros
VCycle: 724.444 micros


BENCHMARK
Jacobi iteration: 19.6667 micros
Residual calculation: 19.7778 micros
Restriction: fw: 13.8889 micros
Interpolation + add: 19.6667 micros
VCycle: 725.556 micros


BENCHMARK
Jacobi iteration: 19.6667 micros
Residual calculation: 19.7778 micros
Restriction: fw: 13.8889 micros
Interpolation + add: 19.4444 micros
VCycle: 725.556 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 19.375 micros
Residual calculation: 19.375 micros
Restriction: fw: 13.875 micros
Interpolation + add: 19.625 micros
VCycle: 1005 micros


BENCHMARK
Jacobi iteration: 19.375 micros
Residual calculation: 19.5 micros
Restriction: fw: 13.75 micros
Interpolation + add: 19.5 micros
VCycle: 1020 micros


BENCHMARK
Jacobi iteration: 20 micros
Residual calculation: 19.75 micros
Restriction: fw: 13.75 micros
Interpolation + add: 19.25 micros
VCycle: 1005 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 19.5714 micros
Residual calculation: 19.7143 micros
Restriction: fw: 14 micros
Interpolation + add: 19.2857 micros
VCycle: 1294.29 micros


BENCHMARK
Jacobi iteration: 19.7143 micros
Residual calculation: 19.7143 micros
Restriction: fw: 14 micros
Interpolation + add: 19.7143 micros
VCycle: 1290 micros


BENCHMARK
Jacobi iteration: 19.7143 micros
Residual calculation: 19.7143 micros
Restriction: fw: 14 micros
Interpolation + add: 19.2857 micros
VCycle: 1290 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 19.6667 micros
Residual calculation: 19.6667 micros
Restriction: fw: 13.8333 micros
Interpolation + add: 19.6667 micros
VCycle: 1606.67 micros


BENCHMARK
Jacobi iteration: 20.3333 micros
Residual calculation: 19.6667 micros
Restriction: fw: 13.8333 micros
Interpolation + add: 19.6667 micros
VCycle: 1576.67 micros


BENCHMARK
Jacobi iteration: 19.8333 micros
Residual calculation: 19.5 micros
Restriction: fw: 14 micros
Interpolation + add: 19.5 micros
VCycle: 1598.33 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 20 micros
Residual calculation: 19.8 micros
Restriction: fw: 14.4 micros
Interpolation + add: 19.8 micros
VCycle: 1894 micros


BENCHMARK
Jacobi iteration: 19.6 micros
Residual calculation: 19.6 micros
Restriction: fw: 14 micros
Interpolation + add: 19.4 micros
VCycle: 1874 micros


BENCHMARK
Jacobi iteration: 20 micros
Residual calculation: 20 micros
Restriction: fw: 14.4 micros
Interpolation + add: 19.8 micros
VCycle: 1910 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 20.75 micros
Residual calculation: 20.5 micros
Restriction: fw: 14.75 micros
Interpolation + add: 20.75 micros
VCycle: 4360 micros


BENCHMARK
Jacobi iteration: 19.75 micros
Residual calculation: 19 micros
Restriction: fw: 13.75 micros
Interpolation + add: 19.5 micros
VCycle: 4380 micros


BENCHMARK
Jacobi iteration: 19.5 micros
Residual calculation: 19.5 micros
Restriction: fw: 14 micros
Interpolation + add: 19 micros
VCycle: 4427.5 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 19.3333 micros
Residual calculation: 19.3333 micros
Restriction: fw: 13.6667 micros
Interpolation + add: 20 micros
VCycle: 20176.7 micros


BENCHMARK
Jacobi iteration: 19.3333 micros
Residual calculation: 19.6667 micros
Restriction: fw: 14 micros
Interpolation + add: 19.6667 micros
VCycle: 22086.7 micros


BENCHMARK
Jacobi iteration: 19.6667 micros
Residual calculation: 19.6667 micros
Restriction: fw: 13.6667 micros
Interpolation + add: 19.3333 micros
VCycle: 22066.7 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 19 micros
Residual calculation: 19 micros
Restriction: fw: 13.5 micros
Interpolation + add: 19.5 micros
VCycle: 81530 micros


BENCHMARK
Jacobi iteration: 39.5 micros
Residual calculation: 38 micros
Restriction: fw: 41.5 micros
Interpolation + add: 41.5 micros
VCycle: 89600 micros


BENCHMARK
Jacobi iteration: 41.5 micros
Residual calculation: 38 micros
Restriction: fw: 41.5 micros
Interpolation + add: 42.5 micros
VCycle: 89620 micros



------------------
-- size: 2047

caffeinated
29-May-2007, 21:53
Vista 32-bit / Forceware 158.18
AMD X2 4200+ Windsor @ 2.2 Ghz (stock)
Corsair XMS PC2-5400 @ 667
Geforce 8600 GTS @ 720/2100

***************************************

GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 3.1 micros
Residual calculation: 3.2 micros
Restriction: fw: 3.1 micros
Interpolation + add: 3.1 micros
VCycle: 6443 micros


BENCHMARK
Jacobi iteration: 3.1 micros
Residual calculation: 4.7 micros
Restriction: fw: 3.1 micros
Interpolation + add: 3.1 micros
VCycle: 6505 micros


BENCHMARK
Jacobi iteration: 4.7 micros
Residual calculation: 3.1 micros
Restriction: fw: 3.1 micros
Interpolation + add: 3.1 micros
VCycle: 6428 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 3.44444 micros
Residual calculation: 3.44444 micros
Restriction: fw: 3.55556 micros
Interpolation + add: 3.44444 micros
VCycle: 10850 micros


BENCHMARK
Jacobi iteration: 3.44444 micros
Residual calculation: 3.44444 micros
Restriction: fw: 3.55556 micros
Interpolation + add: 3.44444 micros
VCycle: 10850 micros


BENCHMARK
Jacobi iteration: 3.44444 micros
Residual calculation: 3.44444 micros
Restriction: fw: 3.55556 micros
Interpolation + add: 3.44444 micros
VCycle: 10920 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 3.875 micros
Residual calculation: 3.875 micros
Restriction: fw: 2 micros
Interpolation + add: 3.875 micros
VCycle: 15521.3 micros


BENCHMARK
Jacobi iteration: 3.875 micros
Residual calculation: 4 micros
Restriction: fw: 1.875 micros
Interpolation + add: 3.875 micros
VCycle: 15093.8 micros


BENCHMARK
Jacobi iteration: 3.875 micros
Residual calculation: 2 micros
Restriction: fw: 3.875 micros
Interpolation + add: 3.875 micros
VCycle: 15073.8 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 2.14286 micros
Residual calculation: 4.57143 micros
Restriction: fw: 4.42857 micros
Interpolation + add: 2.14286 micros
VCycle: 19545.7 micros


BENCHMARK
Jacobi iteration: 2.14286 micros
Residual calculation: 4.57143 micros
Restriction: fw: 4.42857 micros
Interpolation + add: 2.14286 micros
VCycle: 19657.1 micros


BENCHMARK
Jacobi iteration: 4.42857 micros
Residual calculation: 2.28571 micros
Restriction: fw: 4.42857 micros
Interpolation + add: 2.28571 micros
VCycle: 19611.4 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 2.5 micros
Residual calculation: 5.33333 micros
Restriction: fw: 2.5 micros
Interpolation + add: 7.83333 micros
VCycle: 24258.3 micros


BENCHMARK
Jacobi iteration: 2.5 micros
Residual calculation: 2.66667 micros
Restriction: fw: 5.16667 micros
Interpolation + add: 5.16667 micros
VCycle: 24336.7 micros


BENCHMARK
Jacobi iteration: 5.16667 micros
Residual calculation: 2.5 micros
Restriction: fw: 2.66667 micros
Interpolation + add: 7.83333 micros
VCycle: 24595 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 3.2 micros
Residual calculation: 6.2 micros
Restriction: fw: 37.4 micros
Interpolation + add: 37.6 micros
VCycle: 29952 micros


BENCHMARK
Jacobi iteration: 3.2 micros
Residual calculation: 6.2 micros
Restriction: fw: 37.4 micros
Interpolation + add: 34.4 micros
VCycle: 29764 micros


BENCHMARK
Jacobi iteration: 3 micros
Residual calculation: 9.4 micros
Restriction: fw: 34.4 micros
Interpolation + add: 34.2 micros
VCycle: 30296 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 4 micros
Residual calculation: 3.75 micros
Restriction: fw: 101.5 micros
Interpolation + add: 152 micros
VCycle: 40170 micros


BENCHMARK
Jacobi iteration: 3.75 micros
Residual calculation: 0 micros
Restriction: fw: 97.5 micros
Interpolation + add: 152.25 micros
VCycle: 40092.5 micros


BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 4 micros
Restriction: fw: 97.5 micros
Interpolation + add: 152 micros
VCycle: 40325 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 5 micros
Restriction: fw: 151 micros
Interpolation + add: 353.333 micros
VCycle: 71086.7 micros


BENCHMARK
Jacobi iteration: 5.33333 micros
Residual calculation: 5 micros
Restriction: fw: 151 micros
Interpolation + add: 358.667 micros
VCycle: 70200 micros


BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 5.33333 micros
Restriction: fw: 150.667 micros
Interpolation + add: 359 micros
VCycle: 70096.7 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 0 micros
Restriction: fw: 7.5 micros
Interpolation + add: 304.5 micros
VCycle: 191490 micros


BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 7.5 micros
Restriction: fw: 0 micros
Interpolation + add: 304.5 micros
VCycle: 191565 micros


BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 8 micros
Restriction: fw: 0 micros
Interpolation + add: 304 micros
VCycle: 191410 micros



------------------
-- size: 2047


*************************

It did not complete the 2047 test; I tried it twice and both times the display driver reset itself in the middle of the test (it likely ran out of VRAM since it only has 256MB).

neliz
29-May-2007, 21:54
Peter, do you have an older version of the App (the January one perhaps?) that we could test with? don't know if much has changed though but I'm curious why all the Radeon's bug out at 2047

P.S. now I know why I bought a HD2900, to be a master at theoretical benchmarking! w00t!

PeterT
29-May-2007, 21:55
It crashed at 2047 with me too, display driver recovered but application seemed to hang.Interesting. It showing signs of instability (as well as your V-cycle numbers) suggests that it's actually doing something, even though you're running Catalyst on Vista. But the other numbers just don't work. Now I'm quite confused.

AlexV
29-May-2007, 21:57
It crashed at 2047 with me too, display driver recovered but application seemed to hang.

HD2900 Cat 8.38RC7
Opteron 165
1GB DDR-3200
Vista32

GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 20.1 micros
Residual calculation: 19.9 micros
Restriction: fw: 14.2 micros
Interpolation + add: 19.7 micros
VCycle: 445 micros


BENCHMARK
Jacobi iteration: 19.5 micros
Residual calculation: 19.3 micros
Restriction: fw: 13.9 micros
Interpolation + add: 19.7 micros
VCycle: 457 micros


BENCHMARK
Jacobi iteration: 20.2 micros
Residual calculation: 19.3 micros
Restriction: fw: 14 micros
Interpolation + add: 19.1 micros
VCycle: 446 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 19.6667 micros
Residual calculation: 19.6667 micros
Restriction: fw: 14 micros
Interpolation + add: 19.5556 micros
VCycle: 724.444 micros


BENCHMARK
Jacobi iteration: 19.6667 micros
Residual calculation: 19.7778 micros
Restriction: fw: 13.8889 micros
Interpolation + add: 19.6667 micros
VCycle: 725.556 micros


BENCHMARK
Jacobi iteration: 19.6667 micros
Residual calculation: 19.7778 micros
Restriction: fw: 13.8889 micros
Interpolation + add: 19.4444 micros
VCycle: 725.556 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 19.375 micros
Residual calculation: 19.375 micros
Restriction: fw: 13.875 micros
Interpolation + add: 19.625 micros
VCycle: 1005 micros


BENCHMARK
Jacobi iteration: 19.375 micros
Residual calculation: 19.5 micros
Restriction: fw: 13.75 micros
Interpolation + add: 19.5 micros
VCycle: 1020 micros


BENCHMARK
Jacobi iteration: 20 micros
Residual calculation: 19.75 micros
Restriction: fw: 13.75 micros
Interpolation + add: 19.25 micros
VCycle: 1005 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 19.5714 micros
Residual calculation: 19.7143 micros
Restriction: fw: 14 micros
Interpolation + add: 19.2857 micros
VCycle: 1294.29 micros


BENCHMARK
Jacobi iteration: 19.7143 micros
Residual calculation: 19.7143 micros
Restriction: fw: 14 micros
Interpolation + add: 19.7143 micros
VCycle: 1290 micros


BENCHMARK
Jacobi iteration: 19.7143 micros
Residual calculation: 19.7143 micros
Restriction: fw: 14 micros
Interpolation + add: 19.2857 micros
VCycle: 1290 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 19.6667 micros
Residual calculation: 19.6667 micros
Restriction: fw: 13.8333 micros
Interpolation + add: 19.6667 micros
VCycle: 1606.67 micros


BENCHMARK
Jacobi iteration: 20.3333 micros
Residual calculation: 19.6667 micros
Restriction: fw: 13.8333 micros
Interpolation + add: 19.6667 micros
VCycle: 1576.67 micros


BENCHMARK
Jacobi iteration: 19.8333 micros
Residual calculation: 19.5 micros
Restriction: fw: 14 micros
Interpolation + add: 19.5 micros
VCycle: 1598.33 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 20 micros
Residual calculation: 19.8 micros
Restriction: fw: 14.4 micros
Interpolation + add: 19.8 micros
VCycle: 1894 micros


BENCHMARK
Jacobi iteration: 19.6 micros
Residual calculation: 19.6 micros
Restriction: fw: 14 micros
Interpolation + add: 19.4 micros
VCycle: 1874 micros


BENCHMARK
Jacobi iteration: 20 micros
Residual calculation: 20 micros
Restriction: fw: 14.4 micros
Interpolation + add: 19.8 micros
VCycle: 1910 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 20.75 micros
Residual calculation: 20.5 micros
Restriction: fw: 14.75 micros
Interpolation + add: 20.75 micros
VCycle: 4360 micros


BENCHMARK
Jacobi iteration: 19.75 micros
Residual calculation: 19 micros
Restriction: fw: 13.75 micros
Interpolation + add: 19.5 micros
VCycle: 4380 micros


BENCHMARK
Jacobi iteration: 19.5 micros
Residual calculation: 19.5 micros
Restriction: fw: 14 micros
Interpolation + add: 19 micros
VCycle: 4427.5 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 19.3333 micros
Residual calculation: 19.3333 micros
Restriction: fw: 13.6667 micros
Interpolation + add: 20 micros
VCycle: 20176.7 micros


BENCHMARK
Jacobi iteration: 19.3333 micros
Residual calculation: 19.6667 micros
Restriction: fw: 14 micros
Interpolation + add: 19.6667 micros
VCycle: 22086.7 micros


BENCHMARK
Jacobi iteration: 19.6667 micros
Residual calculation: 19.6667 micros
Restriction: fw: 13.6667 micros
Interpolation + add: 19.3333 micros
VCycle: 22066.7 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 19 micros
Residual calculation: 19 micros
Restriction: fw: 13.5 micros
Interpolation + add: 19.5 micros
VCycle: 81530 micros


BENCHMARK
Jacobi iteration: 39.5 micros
Residual calculation: 38 micros
Restriction: fw: 41.5 micros
Interpolation + add: 41.5 micros
VCycle: 89600 micros


BENCHMARK
Jacobi iteration: 41.5 micros
Residual calculation: 38 micros
Restriction: fw: 41.5 micros
Interpolation + add: 42.5 micros
VCycle: 89620 micros



------------------
-- size: 2047


But at least it seems to work right for you. And may I add that I hate you for being able to use the 8.38s(Nah, just kidding:) ). I`d like to second what Dave said:it certainly is NOT running at 3D speeds-the fan doesn`t spin-up at all.

PeterT
29-May-2007, 21:59
Peter, do you have an older version of the App (the January one perhaps?) that we could test with? don't know if much has changed though but I'm curious why all the Radeon's bug out at 2047

P.S. now I know why I bought a HD2900, to be a master at theoretical benchmarking! w00t!A lot has changed, basically the January version took completely wrong measurements. There have also been lots of optimizations. I'll try to whip together a functionality testing app that will produce (or fail to) a result showing whether the computation actually was carried out.

Tim Murray
29-May-2007, 22:04
Full screen rendering.
So no windowed app will trigger 3D clocks?

neliz
29-May-2007, 22:04
Interesting. It showing signs of instability (as well as your V-cycle numbers) suggests that it's actually doing something, even though you're running Catalyst on Vista. But the other numbers just don't work. Now I'm quite confused.

I ran your old fbench too, don't know if it helps anything...


GL filter framework 1.4 test application by Peter Thoman 2004-2005

Gui initialized successfully.
DevIL initialized successfully.
- DevIL Version: 167
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.

Press return key to start benchmark...



Testing 32x32 image:
Results for BufferCreateINT: msecs: 457 || ms/i: 76.1667 || i/s: 13.1291
No suitable INT format found. Trying FP... (Flaky 6x00 workaround)

Results for BufferCreateINT16: msecs: 386 || ms/i: 64.3333 || i/s: 15.544
Results for BufferCreateFP16: msecs: 388 || ms/i: 64.6667 || i/s: 15.4639
Results for BufferCreateFP32: msecs: 386 || ms/i: 64.3333 || i/s: 15.544
Results for JustCopy: msecs: 1511 || ms/i: 0.7555 || i/s: 1323.63
Results for SimpleSmooth: msecs: 1500 || ms/i: 0.75 || i/s: 1333.33
Results for TexNoise: msecs: 1537 || ms/i: 0.7685 || i/s: 1301.24
Results for 3x3Conv: msecs: 773 || ms/i: 0.773 || i/s: 1293.66
Results for TEncode: msecs: 754 || ms/i: 0.754 || i/s: 1326.26
Results for TDecode: msecs: 774 || ms/i: 0.774 || i/s: 1291.99
Results for LinDiffINT: msecs: 1508 || ms/i: 0.754 || i/s: 1326.26
Results for LinDiffINT16: msecs: 1510 || ms/i: 0.755 || i/s: 1324.5
Results for LinDiffFP16: msecs: 1507 || ms/i: 0.7535 || i/s: 1327.14
Results for LinDiffFP32: msecs: 1534 || ms/i: 0.767 || i/s: 1303.78
Results for LD_INT->FP16: msecs: 772 || ms/i: 0.772 || i/s: 1295.34
Results for LD_INT->FP32: msecs: 747 || ms/i: 0.747 || i/s: 1338.69
Results for LD_FP16->INT: msecs: 749 || ms/i: 0.749 || i/s: 1335.11
Results for LD_FP32->INT: msecs: 769 || ms/i: 0.769 || i/s: 1300.39
Results for PMTEncoded: msecs: 2352 || ms/i: 2.352 || i/s: 425.17
Results for PMStandard: msecs: 2350 || ms/i: 2.35 || i/s: 425.532
Results for PMBuffered: msecs: 96 || ms/i: 0.192 || i/s: 5208.33

Testing 64x64 image:
Results for BufferCreateINT: msecs: 379 || ms/i: 63.1667 || i/s: 15.8311
Results for BufferCreateINT16: msecs: 379 || ms/i: 63.1667 || i/s: 15.8311
Results for BufferCreateFP16: msecs: 379 || ms/i: 63.1667 || i/s: 15.8311
Results for BufferCreateFP32: msecs: 380 || ms/i: 63.3333 || i/s: 15.7895
Results for JustCopy: msecs: 1487 || ms/i: 0.7435 || i/s: 1344.99
Results for SimpleSmooth: msecs: 1525 || ms/i: 0.7625 || i/s: 1311.48
Results for TexNoise: msecs: 1524 || ms/i: 0.762 || i/s: 1312.34
Results for 3x3Conv: msecs: 768 || ms/i: 0.768 || i/s: 1302.08
Results for TEncode: msecs: 757 || ms/i: 0.757 || i/s: 1321
Results for TDecode: msecs: 759 || ms/i: 0.759 || i/s: 1317.52
Results for LinDiffINT: msecs: 1527 || ms/i: 0.7635 || i/s: 1309.76
Results for LinDiffINT16: msecs: 1559 || ms/i: 0.7795 || i/s: 1282.87
Results for LinDiffFP16: msecs: 1511 || ms/i: 0.7555 || i/s: 1323.63
Results for LinDiffFP32: msecs: 1513 || ms/i: 0.7565 || i/s: 1321.88
Results for LD_INT->FP16: msecs: 763 || ms/i: 0.763 || i/s: 1310.62
Results for LD_INT->FP32: msecs: 762 || ms/i: 0.762 || i/s: 1312.34
Results for LD_FP16->INT: msecs: 747 || ms/i: 0.747 || i/s: 1338.69
Results for LD_FP32->INT: msecs: 748 || ms/i: 0.748 || i/s: 1336.9
Results for PMTEncoded: msecs: 2341 || ms/i: 2.341 || i/s: 427.168
Results for PMStandard: msecs: 2349 || ms/i: 2.349 || i/s: 425.713
Results for PMBuffered: msecs: 98 || ms/i: 0.196 || i/s: 5102.04

Testing 128x128 image:
Results for BufferCreateINT: msecs: 384 || ms/i: 64 || i/s: 15.625
Results for BufferCreateINT16: msecs: 383 || ms/i: 63.8333 || i/s: 15.6658
Results for BufferCreateFP16: msecs: 381 || ms/i: 63.5 || i/s: 15.748
Results for BufferCreateFP32: msecs: 380 || ms/i: 63.3333 || i/s: 15.7895
Results for JustCopy: msecs: 1503 || ms/i: 0.7515 || i/s: 1330.67
Results for SimpleSmooth: msecs: 1494 || ms/i: 0.747 || i/s: 1338.69
Results for TexNoise: msecs: 1540 || ms/i: 0.77 || i/s: 1298.7
Results for 3x3Conv: msecs: 761 || ms/i: 0.761 || i/s: 1314.06
Results for TEncode: msecs: 759 || ms/i: 0.759 || i/s: 1317.52
Results for TDecode: msecs: 775 || ms/i: 0.775 || i/s: 1290.32
Results for LinDiffINT: msecs: 1558 || ms/i: 0.779 || i/s: 1283.7
Results for LinDiffINT16: msecs: 1523 || ms/i: 0.7615 || i/s: 1313.2
Results for LinDiffFP16: msecs: 1553 || ms/i: 0.7765 || i/s: 1287.83
Results for LinDiffFP32: msecs: 1537 || ms/i: 0.7685 || i/s: 1301.24
Results for LD_INT->FP16: msecs: 773 || ms/i: 0.773 || i/s: 1293.66
Results for LD_INT->FP32: msecs: 782 || ms/i: 0.782 || i/s: 1278.77
Results for LD_FP16->INT: msecs: 766 || ms/i: 0.766 || i/s: 1305.48
Results for LD_FP32->INT: msecs: 760 || ms/i: 0.76 || i/s: 1315.79
Results for PMTEncoded: msecs: 2374 || ms/i: 2.374 || i/s: 421.23
Results for PMStandard: msecs: 2348 || ms/i: 2.348 || i/s: 425.894
Results for PMBuffered: msecs: 97 || ms/i: 0.194 || i/s: 5154.64

Testing 256x256 image:
Results for BufferCreateINT: msecs: 387 || ms/i: 64.5 || i/s: 15.5039
Results for BufferCreateINT16: msecs: 383 || ms/i: 63.8333 || i/s: 15.6658
Results for BufferCreateFP16: msecs: 382 || ms/i: 63.6667 || i/s: 15.7068
Results for BufferCreateFP32: msecs: 381 || ms/i: 63.5 || i/s: 15.748
Results for JustCopy: msecs: 1519 || ms/i: 0.7595 || i/s: 1316.66
Results for SimpleSmooth: msecs: 1487 || ms/i: 0.7435 || i/s: 1344.99
Results for TexNoise: msecs: 1507 || ms/i: 0.7535 || i/s: 1327.14
Results for 3x3Conv: msecs: 758 || ms/i: 0.758 || i/s: 1319.26
Results for TEncode: msecs: 767 || ms/i: 0.767 || i/s: 1303.78
Results for TDecode: msecs: 769 || ms/i: 0.769 || i/s: 1300.39
Results for LinDiffINT: msecs: 1539 || ms/i: 0.7695 || i/s: 1299.55
Results for LinDiffINT16: msecs: 1526 || ms/i: 0.763 || i/s: 1310.62
Results for LinDiffFP16: msecs: 1526 || ms/i: 0.763 || i/s: 1310.62
Results for LinDiffFP32: msecs: 1557 || ms/i: 0.7785 || i/s: 1284.52
Results for LD_INT->FP16: msecs: 762 || ms/i: 0.762 || i/s: 1312.34
Results for LD_INT->FP32: msecs: 762 || ms/i: 0.762 || i/s: 1312.34
Results for LD_FP16->INT: msecs: 761 || ms/i: 0.761 || i/s: 1314.06
Results for LD_FP32->INT: msecs: 758 || ms/i: 0.758 || i/s: 1319.26
Results for PMTEncoded: msecs: 2368 || ms/i: 2.368 || i/s: 422.297
Results for PMStandard: msecs: 2407 || ms/i: 2.407 || i/s: 415.455
Results for PMBuffered: msecs: 118 || ms/i: 0.236 || i/s: 4237.29

Testing 512x512 image:
Results for BufferCreateINT: msecs: 386 || ms/i: 64.3333 || i/s: 15.544
Results for BufferCreateINT16: msecs: 385 || ms/i: 64.1667 || i/s: 15.5844
Results for BufferCreateFP16: msecs: 379 || ms/i: 63.1667 || i/s: 15.8311
Results for BufferCreateFP32: msecs: 383 || ms/i: 63.8333 || i/s: 15.6658
Results for JustCopy: msecs: 752 || ms/i: 0.752 || i/s: 1329.79
Results for SimpleSmooth: msecs: 776 || ms/i: 0.776 || i/s: 1288.66
Results for TexNoise: msecs: 782 || ms/i: 0.782 || i/s: 1278.77
Results for 3x3Conv: msecs: 391 || ms/i: 0.782 || i/s: 1278.77
Results for TEncode: msecs: 391 || ms/i: 0.782 || i/s: 1278.77
Results for TDecode: msecs: 394 || ms/i: 0.788 || i/s: 1269.04
Results for LinDiffINT: msecs: 798 || ms/i: 0.798 || i/s: 1253.13
Results for LinDiffINT16: msecs: 786 || ms/i: 0.786 || i/s: 1272.26
Results for LinDiffFP16: msecs: 780 || ms/i: 0.78 || i/s: 1282.05
Results for LinDiffFP32: msecs: 781 || ms/i: 0.781 || i/s: 1280.41
Results for LD_INT->FP16: msecs: 401 || ms/i: 0.802 || i/s: 1246.88
Results for LD_INT->FP32: msecs: 393 || ms/i: 0.786 || i/s: 1272.26
Results for LD_FP16->INT: msecs: 391 || ms/i: 0.782 || i/s: 1278.77
Results for LD_FP32->INT: msecs: 390 || ms/i: 0.78 || i/s: 1282.05
Results for PMTEncoded: msecs: 1245 || ms/i: 2.49 || i/s: 401.606
Results for PMStandard: msecs: 1248 || ms/i: 2.496 || i/s: 400.641
Results for PMBuffered: msecs: 77 || ms/i: 0.308 || i/s: 3246.75

Testing 1024x1024 image:
Results for BufferCreateINT: msecs: 386 || ms/i: 64.3333 || i/s: 15.544
Results for BufferCreateINT16: msecs: 395 || ms/i: 65.8333 || i/s: 15.1899
Results for BufferCreateFP16: msecs: 389 || ms/i: 64.8333 || i/s: 15.4242
Results for BufferCreateFP32: msecs: 392 || ms/i: 65.3333 || i/s: 15.3061
Results for JustCopy: msecs: 765 || ms/i: 0.765 || i/s: 1307.19
Results for SimpleSmooth: msecs: 768 || ms/i: 0.768 || i/s: 1302.08
Results for TexNoise: msecs: 777 || ms/i: 0.777 || i/s: 1287
Results for 3x3Conv: msecs: 496 || ms/i: 0.992 || i/s: 1008.06
Results for TEncode: msecs: 398 || ms/i: 0.796 || i/s: 1256.28
Results for TDecode: msecs: 403 || ms/i: 0.806 || i/s: 1240.69
Results for LinDiffINT: msecs: 798 || ms/i: 0.798 || i/s: 1253.13
Results for LinDiffINT16: msecs: 784 || ms/i: 0.784 || i/s: 1275.51
Results for LinDiffFP16: msecs: 785 || ms/i: 0.785 || i/s: 1273.89
Results for LinDiffFP32: msecs: 877 || ms/i: 0.877 || i/s: 1140.25
Results for LD_INT->FP16: msecs: 405 || ms/i: 0.81 || i/s: 1234.57
Results for LD_INT->FP32: msecs: 405 || ms/i: 0.81 || i/s: 1234.57
Results for LD_FP16->INT: msecs: 397 || ms/i: 0.794 || i/s: 1259.45
Results for LD_FP32->INT: msecs: 390 || ms/i: 0.78 || i/s: 1282.05
Results for PMTEncoded: msecs: 1251 || ms/i: 2.502 || i/s: 399.68
Results for PMStandard: msecs: 1549 || ms/i: 3.098 || i/s: 322.789
Results for PMBuffered: msecs: 91 || ms/i: 0.364 || i/s: 2747.25

neliz
29-May-2007, 22:09
But at least it seems to work right for you. And may I add that I hate you for being able to use the 8.38s(Nah, just kidding:) ). I`d like to second what Dave said:it certainly is NOT running at 3D speeds-the fan doesn`t spin-up at all.

Uhm.. my fan going *HNGGGGGGGGRRRRRRRRRRRRRRRHHH* suggests otherwise... (no 3D mode though, but there is an increase in fan speed)

FrameBuffer
29-May-2007, 22:16
System Info:

CPU: AMD Opteron 165 (2x1MB)
Mem: 2GB PC3200
OS: Windows Vista
Driver: 7.4 Vista
Graphics: ATI Radeon 1950 XT 512MB (@Default)

GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 19.9 micros
Residual calculation: 32.8 micros
Restriction: fw: 28.7 micros
Interpolation + add: 33.9 micros
VCycle: 549 micros


BENCHMARK
Jacobi iteration: 34.2 micros
Residual calculation: 46.4 micros
Restriction: fw: 43.3 micros
Interpolation + add: 40.8 micros
VCycle: 476 micros


BENCHMARK
Jacobi iteration: 22.7 micros
Residual calculation: 25.6 micros
Restriction: fw: 24.1 micros
Interpolation + add: 40.4 micros
VCycle: 908 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 27.1111 micros
Residual calculation: 44.4444 micros
Restriction: fw: 22.7778 micros
Interpolation + add: 22 micros
VCycle: 823.333 micros


BENCHMARK
Jacobi iteration: 22 micros
Residual calculation: 24.4444 micros
Restriction: fw: 16.3333 micros
Interpolation + add: 38.8889 micros
VCycle: 893.333 micros


BENCHMARK
Jacobi iteration: 20.5556 micros
Residual calculation: 20.4444 micros
Restriction: fw: 15.6667 micros
Interpolation + add: 20.1111 micros
VCycle: 703.333 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 20 micros
Residual calculation: 19.75 micros
Restriction: fw: 15.125 micros
Interpolation + add: 19.875 micros
VCycle: 972.5 micros


BENCHMARK
Jacobi iteration: 20 micros
Residual calculation: 20.5 micros
Restriction: fw: 15 micros
Interpolation + add: 20.25 micros
VCycle: 962.5 micros


BENCHMARK
Jacobi iteration: 19.875 micros
Residual calculation: 21.625 micros
Restriction: fw: 15.375 micros
Interpolation + add: 20.5 micros
VCycle: 977.5 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 19.8571 micros
Residual calculation: 19.7143 micros
Restriction: fw: 15 micros
Interpolation + add: 21.8571 micros
VCycle: 1221.43 micros


BENCHMARK
Jacobi iteration: 20 micros
Residual calculation: 20.2857 micros
Restriction: fw: 15 micros
Interpolation + add: 20.1429 micros
VCycle: 1497.14 micros


BENCHMARK
Jacobi iteration: 20 micros
Residual calculation: 19.8571 micros
Restriction: fw: 15 micros
Interpolation + add: 20.1429 micros
VCycle: 1241.43 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 20.6667 micros
Residual calculation: 21 micros
Restriction: fw: 15.8333 micros
Interpolation + add: 20.3333 micros
VCycle: 1535 micros


BENCHMARK
Jacobi iteration: 20.5 micros
Residual calculation: 20.5 micros
Restriction: fw: 15.3333 micros
Interpolation + add: 21 micros
VCycle: 2961.67 micros


BENCHMARK
Jacobi iteration: 22.1667 micros
Residual calculation: 22.1667 micros
Restriction: fw: 15.5 micros
Interpolation + add: 21.1667 micros
VCycle: 1541.67 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 20 micros
Residual calculation: 20.2 micros
Restriction: fw: 14.6 micros
Interpolation + add: 20.4 micros
VCycle: 1762 micros


BENCHMARK
Jacobi iteration: 20.2 micros
Residual calculation: 19.8 micros
Restriction: fw: 14.6 micros
Interpolation + add: 19.8 micros
VCycle: 1800 micros


BENCHMARK
Jacobi iteration: 20.2 micros
Residual calculation: 20.4 micros
Restriction: fw: 15.2 micros
Interpolation + add: 20.2 micros
VCycle: 1806 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 20.25 micros
Residual calculation: 19.75 micros
Restriction: fw: 15.5 micros
Interpolation + add: 19.5 micros
VCycle: 2030 micros


BENCHMARK
Jacobi iteration: 19.75 micros
Residual calculation: 20 micros
Restriction: fw: 14.5 micros
Interpolation + add: 20.5 micros
VCycle: 2040 micros


BENCHMARK
Jacobi iteration: 20.25 micros
Residual calculation: 20 micros
Restriction: fw: 15 micros
Interpolation + add: 19.75 micros
VCycle: 2067.5 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 20.3333 micros
Residual calculation: 20.6667 micros
Restriction: fw: 15 micros
Interpolation + add: 20 micros
VCycle: 2343.33 micros


BENCHMARK
Jacobi iteration: 20.3333 micros
Residual calculation: 20.6667 micros
Restriction: fw: 15 micros
Interpolation + add: 20.3333 micros
VCycle: 2330 micros


BENCHMARK
Jacobi iteration: 20.6667 micros
Residual calculation: 20 micros
Restriction: fw: 15.3333 micros
Interpolation + add: 20 micros
VCycle: 2330 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 20 micros
Residual calculation: 22.5 micros
Restriction: fw: 14.5 micros
Interpolation + add: 20 micros
VCycle: 2575 micros


BENCHMARK
Jacobi iteration: 20 micros
Residual calculation: 20 micros
Restriction: fw: 15 micros
Interpolation + add: 19.5 micros
VCycle: 2575 micros


BENCHMARK
Jacobi iteration: 20 micros
Residual calculation: 20.5 micros
Restriction: fw: 15 micros
Interpolation + add: 20 micros
VCycle: 2580 micros



------------------
-- size: 2047

BENCHMARK
Jacobi iteration: 40 micros
Residual calculation: 423 micros
Restriction: fw: 19 micros
Interpolation + add: 23 micros
VCycle: 3890 micros


BENCHMARK
Jacobi iteration: 21 micros
Residual calculation: 20 micros
Restriction: fw: 15 micros
Interpolation + add: 20 micros
VCycle: 3090 micros


BENCHMARK
Jacobi iteration: 20 micros
Residual calculation: 20 micros
Restriction: fw: 14 micros
Interpolation + add: 20 micros
VCycle: 2910 micros

neliz
29-May-2007, 22:17
btw, the error code

Faulting application GPUMGbench.exe,
version 0.0.0.0,
time stamp 0x465c4e20,
faulting module atioglxx.dll,
version 6.14.10.6590,
time stamp 0x464d02ab,
exception code 0xc0000005,
fault offset 0x0050256c,
process id 0x590,
application start time 0x01c7a231789f2b57.

AlexV
29-May-2007, 22:17
Uhm.. my fan going *HNGGGGGGGGRRRRRRRRRRRRRRRHHH* suggests otherwise... (no 3D mode though, but there is an increase in fan speed)

You`re also the only one getting results that are in line with what they should be(mostly). And I recall there was talk of a bug in the 8.38 making the fan spin-up even in desktop mode so...;)

neliz
29-May-2007, 22:19
You`re also the only one getting results that are in line with what they should be(mostly). And I recall there was talk of a bug in the 8.38 making the fan spin-up even in desktop mode so...;)

It sure is spinning in DT! making it much less comfortable than my first experience with the card

Dave Baumann
29-May-2007, 22:30
So no windowed app will trigger 3D clocks?
On the standard desktop Radeons this is the case (including things like SPECviewperf).

AlexV
29-May-2007, 22:34
Anybody know what`s the deviceID for an 8800GTX?As in one that would be useable in 3DAnalyze?

FrameBuffer
29-May-2007, 22:58
System Info:

CPU: AMD Opteron 165 (2x1MB)
Mem: 2GB PC3200
OS: Windows XP 32bit
Driver: 7.4 32bit
Graphics: ATI Radeon 1950 XT 512MB (@Default)

GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 4.6 micros
Residual calculation: 6.3 micros
Restriction: fw: 6.2 micros
Interpolation + add: 4.7 micros
VCycle: 672 micros


BENCHMARK
Jacobi iteration: 6.2 micros
Residual calculation: 6.3 micros
Restriction: fw: 4.6 micros
Interpolation + add: 6.3 micros
VCycle: 656 micros


BENCHMARK
Jacobi iteration: 6.3 micros
Residual calculation: 4.7 micros
Restriction: fw: 6.2 micros
Interpolation + add: 6.3 micros
VCycle: 656 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 5.22222 micros
Residual calculation: 7 micros
Restriction: fw: 5.22222 micros
Interpolation + add: 6.88889 micros
VCycle: 1076.67 micros


BENCHMARK
Jacobi iteration: 7 micros
Residual calculation: 5.22222 micros
Restriction: fw: 5.22222 micros
Interpolation + add: 5.11111 micros
VCycle: 1094.44 micros


BENCHMARK
Jacobi iteration: 7 micros
Residual calculation: 5.22222 micros
Restriction: fw: 5.11111 micros
Interpolation + add: 7 micros
VCycle: 1093.33 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 5.875 micros
Residual calculation: 5.875 micros
Restriction: fw: 5.875 micros
Interpolation + add: 5.75 micros
VCycle: 1505 micros


BENCHMARK
Jacobi iteration: 5.875 micros
Residual calculation: 5.875 micros
Restriction: fw: 5.875 micros
Interpolation + add: 3.875 micros
VCycle: 1523.75 micros


BENCHMARK
Jacobi iteration: 5.875 micros
Residual calculation: 5.875 micros
Restriction: fw: 5.875 micros
Interpolation + add: 3.875 micros
VCycle: 1523.75 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 6.57143 micros
Residual calculation: 6.71429 micros
Restriction: fw: 4.57143 micros
Interpolation + add: 6.57143 micros
VCycle: 1942.86 micros


BENCHMARK
Jacobi iteration: 6.57143 micros
Residual calculation: 4.57143 micros
Restriction: fw: 8.85714 micros
Interpolation + add: 6.71429 micros
VCycle: 1941.43 micros


BENCHMARK
Jacobi iteration: 6.71429 micros
Residual calculation: 6.71429 micros
Restriction: fw: 4.42857 micros
Interpolation + add: 6.71429 micros
VCycle: 1941.43 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 13.1667 micros
Residual calculation: 13 micros
Restriction: fw: 15.5 micros
Interpolation + add: 10.5 micros
VCycle: 2448.33 micros


BENCHMARK
Jacobi iteration: 7.83333 micros
Residual calculation: 15.6667 micros
Restriction: fw: 13 micros
Interpolation + add: 10.3333 micros
VCycle: 2448.33 micros


BENCHMARK
Jacobi iteration: 20.8333 micros
Residual calculation: 5.16667 micros
Restriction: fw: 13 micros
Interpolation + add: 10.5 micros
VCycle: 2446.67 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 71.8 micros
Residual calculation: 25 micros
Restriction: fw: 59.4 micros
Interpolation + add: 34.4 micros
VCycle: 3250 micros


BENCHMARK
Jacobi iteration: 46.8 micros
Residual calculation: 68.8 micros
Restriction: fw: 40.6 micros
Interpolation + add: 34.4 micros
VCycle: 3220 micros


BENCHMARK
Jacobi iteration: 46.8 micros
Residual calculation: 109.4 micros
Restriction: fw: 6.2 micros
Interpolation + add: 28.2 micros
VCycle: 3218 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 449.25 micros
Residual calculation: 15.5 micros
Restriction: fw: 140.75 micros
Interpolation + add: 234.25 micros
VCycle: 4532.5 micros


BENCHMARK
Jacobi iteration: 86 micros
Residual calculation: 50.75 micros
Restriction: fw: 390.5 micros
Interpolation + add: 4 micros
VCycle: 5272.5 micros


BENCHMARK
Jacobi iteration: 421.75 micros
Residual calculation: 7.75 micros
Restriction: fw: 363.25 micros
Interpolation + add: 15.75 micros
VCycle: 4805 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 1682.33 micros
Residual calculation: 1093.67 micros
Restriction: fw: 5.33333 micros
Interpolation + add: 1260.33 micros
VCycle: 8123.33 micros


BENCHMARK
Jacobi iteration: 2005.33 micros
Residual calculation: 5.33333 micros
Restriction: fw: 1380 micros
Interpolation + add: 343.667 micros
VCycle: 11250 micros


BENCHMARK
Jacobi iteration: 1937.67 micros
Residual calculation: 5 micros
Restriction: fw: 1380.33 micros
Interpolation + add: 390.667 micros
VCycle: 11510 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 8523.5 micros
Residual calculation: 3156.5 micros
Restriction: fw: 2390.5 micros
Interpolation + add: 2594 micros
VCycle: 30155 micros


BENCHMARK
Jacobi iteration: 8352 micros
Residual calculation: 3070 micros
Restriction: fw: 2367.5 micros
Interpolation + add: 2390.5 micros
VCycle: 33200 micros


BENCHMARK
Jacobi iteration: 8453 micros
Residual calculation: 3312.5 micros
Restriction: fw: 2258 micros
Interpolation + add: 2710.5 micros
VCycle: 29535 micros



------------------

BlizzardOne
30-May-2007, 00:03
On the standard desktop Radeons this is the case (including things like SPECviewperf).

I wish there was a toggle for this in the CCC.

Geeforcer
30-May-2007, 00:16
Peter, I see that the program is single-threaded, any way you can make it take advantage of multiple cores?

Sytem: E6600@3200Mhz, 2GB RAM
8800 GTX, Forceware 160.03
WinXP Pro

GL multigrid benchmark 0.5 by Peter Thoman 2007
DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.

------------------
-- size: 3
BENCHMARK
Jacobi iteration: 3.1 micros
Residual calculation: 1.6 micros
Restriction: fw: 1.5 micros
Interpolation + add: 1.6 micros
VCycle: 391 micros

BENCHMARK
Jacobi iteration: 1.5 micros
Residual calculation: 1.6 micros
Restriction: fw: 1.5 micros
Interpolation + add: 1.6 micros
VCycle: 391 micros

BENCHMARK
Jacobi iteration: 1.5 micros
Residual calculation: 1.6 micros
Restriction: fw: 1.5 micros
Interpolation + add: 1.6 micros
VCycle: 391 micros

------------------
-- size: 7
BENCHMARK
Jacobi iteration: 3.44444 micros
Residual calculation: 0 micros
Restriction: fw: 1.77778 micros
Interpolation + add: 3.44444 micros
VCycle: 660 micros

BENCHMARK
Jacobi iteration: 1.66667 micros
Residual calculation: 1.77778 micros
Restriction: fw: 1.77778 micros
Interpolation + add: 1.66667 micros
VCycle: 660 micros

BENCHMARK
Jacobi iteration: 1.77778 micros
Residual calculation: 1.66667 micros
Restriction: fw: 1.77778 micros
Interpolation + add: 1.66667 micros
VCycle: 660 micros

------------------
-- size: 15
BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 1.875 micros
Restriction: fw: 2 micros
Interpolation + add: 2 micros
VCycle: 937.5 micros

BENCHMARK
Jacobi iteration: 2 micros
Residual calculation: 1.875 micros
Restriction: fw: 2 micros
Interpolation + add: 0 micros
VCycle: 937.5 micros

BENCHMARK
Jacobi iteration: 2 micros
Residual calculation: 1.875 micros
Restriction: fw: 2 micros
Interpolation + add: 2 micros
VCycle: 917.5 micros

------------------
-- size: 31
BENCHMARK
Jacobi iteration: 2.14286 micros
Residual calculation: 2.28571 micros
Restriction: fw: 2.14286 micros
Interpolation + add: 0 micros
VCycle: 1205.71 micros

BENCHMARK
Jacobi iteration: 2.28571 micros
Residual calculation: 2.28571 micros
Restriction: fw: 2.14286 micros
Interpolation + add: 0 micros
VCycle: 1205.71 micros

BENCHMARK
Jacobi iteration: 2.28571 micros
Residual calculation: 2.28571 micros
Restriction: fw: 2.14286 micros
Interpolation + add: 0 micros
VCycle: 1205.71 micros

------------------
-- size: 63
BENCHMARK
Jacobi iteration: 2.66667 micros
Residual calculation: 5.16667 micros
Restriction: fw: 2.66667 micros
Interpolation + add: 2.5 micros
VCycle: 1458.33 micros

BENCHMARK
Jacobi iteration: 2.66667 micros
Residual calculation: 5.16667 micros
Restriction: fw: 2.66667 micros
Interpolation + add: 2.5 micros
VCycle: 1485 micros

BENCHMARK
Jacobi iteration: 5.16667 micros
Residual calculation: 2.66667 micros
Restriction: fw: 2.66667 micros
Interpolation + add: 2.5 micros
VCycle: 1485 micros

------------------
-- size: 127
BENCHMARK
Jacobi iteration: 12.6 micros
Residual calculation: 12.4 micros
Restriction: fw: 9.4 micros
Interpolation + add: 6.2 micros
VCycle: 1782 micros

BENCHMARK
Jacobi iteration: 12.4 micros
Residual calculation: 12.6 micros
Restriction: fw: 9.4 micros
Interpolation + add: 6.2 micros
VCycle: 1782 micros

BENCHMARK
Jacobi iteration: 12.4 micros
Residual calculation: 12.6 micros
Restriction: fw: 9.4 micros
Interpolation + add: 6.2 micros
VCycle: 1782 micros

------------------
-- size: 255
BENCHMARK
Jacobi iteration: 50.75 micros
Residual calculation: 47 micros
Restriction: fw: 50.75 micros
Interpolation + add: 31.25 micros
VCycle: 2227.5 micros

BENCHMARK
Jacobi iteration: 50.75 micros
Residual calculation: 46.75 micros
Restriction: fw: 47 micros
Interpolation + add: 35 micros
VCycle: 2227.5 micros

BENCHMARK
Jacobi iteration: 50.75 micros
Residual calculation: 47 micros
Restriction: fw: 50.75 micros
Interpolation + add: 31.25 micros
VCycle: 2225 micros

------------------
-- size: 511
BENCHMARK
Jacobi iteration: 192.667 micros
Residual calculation: 197.667 micros
Restriction: fw: 187.667 micros
Interpolation + add: 125 micros
VCycle: 3853.33 micros

BENCHMARK
Jacobi iteration: 192.667 micros
Residual calculation: 192.667 micros
Restriction: fw: 187.333 micros
Interpolation + add: 130.333 micros
VCycle: 3906.67 micros

BENCHMARK
Jacobi iteration: 192.667 micros
Residual calculation: 192.667 micros
Restriction: fw: 187.667 micros
Interpolation + add: 130 micros
VCycle: 3856.67 micros

------------------
-- size: 1023
BENCHMARK
Jacobi iteration: 812.5 micros
Residual calculation: 789 micros
Restriction: fw: 851.5 micros
Interpolation + add: 554.5 micros
VCycle: 15785 micros

BENCHMARK
Jacobi iteration: 812.5 micros
Residual calculation: 789 micros
Restriction: fw: 859.5 micros
Interpolation + add: 539 micros
VCycle: 15860 micros

BENCHMARK
Jacobi iteration: 804.5 micros
Residual calculation: 789 micros
Restriction: fw: 859.5 micros
Interpolation + add: 547 micros
VCycle: 15855 micros

------------------
-- size: 2047
BENCHMARK
Jacobi iteration: 468 micros
Residual calculation: 2625 micros
Restriction: fw: 3797 micros
Interpolation + add: 2938 micros
VCycle: 94840 micros

BENCHMARK
Jacobi iteration: 469 micros
Residual calculation: 2703 micros
Restriction: fw: 3797 micros
Interpolation + add: 2890 micros
VCycle: 94850 micros

BENCHMARK
Jacobi iteration: 469 micros
Residual calculation: 2672 micros
Restriction: fw: 3750 micros
Interpolation + add: 2891 micros
VCycle: 95150 micros

BlizzardOne
30-May-2007, 00:28
C2D X6800 @ Stock (2.93ghz)
2GB RAM @ DDR800
Crossfired X1950XTX's @ Stock, Cat 7.4's
Win XP Pro

GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 3.1 micros
Residual calculation: 3.2 micros
Restriction: fw: 3.1 micros
Interpolation + add: 3.1 micros
VCycle: 375 micros


BENCHMARK
Jacobi iteration: 3.1 micros
Residual calculation: 3.1 micros
Restriction: fw: 3.1 micros
Interpolation + add: 3.2 micros
VCycle: 375 micros


BENCHMARK
Jacobi iteration: 3.1 micros
Residual calculation: 3.2 micros
Restriction: fw: 3.1 micros
Interpolation + add: 3.1 micros
VCycle: 375 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 3.55556 micros
Residual calculation: 1.66667 micros
Restriction: fw: 3.44444 micros
Interpolation + add: 3.55556 micros
VCycle: 607.778 micros


BENCHMARK
Jacobi iteration: 3.44444 micros
Residual calculation: 3.55556 micros
Restriction: fw: 3.44444 micros
Interpolation + add: 3.44444 micros
VCycle: 607.778 micros


BENCHMARK
Jacobi iteration: 3.55556 micros
Residual calculation: 3.44444 micros
Restriction: fw: 1.77778 micros
Interpolation + add: 3.44444 micros
VCycle: 607.778 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 1.875 micros
Residual calculation: 4 micros
Restriction: fw: 3.875 micros
Interpolation + add: 3.875 micros
VCycle: 840 micros


BENCHMARK
Jacobi iteration: 3.875 micros
Residual calculation: 1.875 micros
Restriction: fw: 5.875 micros
Interpolation + add: 2 micros
VCycle: 858.75 micros


BENCHMARK
Jacobi iteration: 4 micros
Residual calculation: 3.875 micros
Restriction: fw: 3.875 micros
Interpolation + add: 2 micros
VCycle: 858.75 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 4.42857 micros
Residual calculation: 2.28571 micros
Restriction: fw: 6.71429 micros
Interpolation + add: 6.71429 micros
VCycle: 1115.71 micros


BENCHMARK
Jacobi iteration: 4.42857 micros
Residual calculation: 2.28571 micros
Restriction: fw: 6.71429 micros
Interpolation + add: 6.71429 micros
VCycle: 1115.71 micros


BENCHMARK
Jacobi iteration: 4.42857 micros
Residual calculation: 2.28571 micros
Restriction: fw: 6.71429 micros
Interpolation + add: 6.71429 micros
VCycle: 1115.71 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 15.6667 micros
Residual calculation: 13 micros
Restriction: fw: 15.6667 micros
Interpolation + add: 7.66667 micros
VCycle: 1406.67 micros


BENCHMARK
Jacobi iteration: 13 micros
Residual calculation: 15.6667 micros
Restriction: fw: 13 micros
Interpolation + add: 10.5 micros
VCycle: 1406.67 micros


BENCHMARK
Jacobi iteration: 13 micros
Residual calculation: 13 micros
Restriction: fw: 15.6667 micros
Interpolation + add: 10.3333 micros
VCycle: 1406.67 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 46.8 micros
Residual calculation: 50 micros
Restriction: fw: 59.4 micros
Interpolation + add: 34.4 micros
VCycle: 1970 micros


BENCHMARK
Jacobi iteration: 46.8 micros
Residual calculation: 50 micros
Restriction: fw: 59.4 micros
Interpolation + add: 34.4 micros
VCycle: 2000 micros


BENCHMARK
Jacobi iteration: 46.8 micros
Residual calculation: 50 micros
Restriction: fw: 59.4 micros
Interpolation + add: 34.4 micros
VCycle: 2000 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 164 micros
Residual calculation: 195.5 micros
Restriction: fw: 234.25 micros
Interpolation + add: 152.25 micros
VCycle: 4650 micros


BENCHMARK
Jacobi iteration: 160.25 micros
Residual calculation: 199.25 micros
Restriction: fw: 234.25 micros
Interpolation + add: 152.25 micros
VCycle: 4610 micros


BENCHMARK
Jacobi iteration: 160.25 micros
Residual calculation: 199.25 micros
Restriction: fw: 234.25 micros
Interpolation + add: 152.5 micros
VCycle: 4647.5 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 505.333 micros
Residual calculation: 770.667 micros
Restriction: fw: 942.667 micros
Interpolation + add: 739.667 micros
VCycle: 18750 micros


BENCHMARK
Jacobi iteration: 505.333 micros
Residual calculation: 781.333 micros
Restriction: fw: 932.333 micros
Interpolation + add: 729 micros
VCycle: 18853.3 micros


BENCHMARK
Jacobi iteration: 500 micros
Residual calculation: 786.333 micros
Restriction: fw: 927 micros
Interpolation + add: 750 micros
VCycle: 18803.3 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 609 micros
Residual calculation: 1969 micros
Restriction: fw: 3765.5 micros
Interpolation + add: 3367.5 micros
VCycle: 98200 micros


BENCHMARK
Jacobi iteration: 601.5 micros
Residual calculation: 2055 micros
Restriction: fw: 3679.5 micros
Interpolation + add: 3289 micros
VCycle: 98985 micros


BENCHMARK
Jacobi iteration: 554.5 micros
Residual calculation: 2023.5 micros
Restriction: fw: 3758 micros
Interpolation + add: 3367 micros
VCycle: 98205 micros



------------------
-- size: 2047

PeterT
30-May-2007, 01:29
Interesting. The 2 Radeon results from XP seem to confirm my suspicion that the issue only appears on Vista. Now it would be nice to have someone with HD 2900XT and XP run the app. But probably everyone with such cutting-edge HW also runs the bleeding-edge OS. (Oh I'm killing myself)

Peter, I see that the program is single-threaded, any way you can make it take advantage of multiple cores?It's completely GPU limited - or at least the parallelizable part of it is - so that wouldn't help much. I have a very experimental GPU/CPU combined solver, but that also only runs the very coarse grid levels on the CPU. All the parts that are easily parallelized are faster on GPUs anyway.

Geeforcer
30-May-2007, 02:22
But probably everyone with such cutting-edge HW also runs the bleeding-edge OS. (Oh I'm killing myself)

LOL, some of us are still pretty happy in the XP land.

It's completely GPU limited - or at least the parallelizable part of it is - so that wouldn't help much. I have a very experimental GPU/CPU combined solver, but that also only runs the very coarse grid levels on the CPU. All the parts that are easily parallelized are faster on GPUs anyway.

I see... I was just asking because it drove CPU utilization on core 0 to 100%.

banksie
30-May-2007, 02:35
I'll try running it on my XP machine with a 2900 tonight when I get home from work. Going to be interesting as I have been having odd stability issues and I am wondering if my PSU isn't holding up....

This should test it nicely.

Anarchist4000
30-May-2007, 03:36
Not sure how good these results are. Might have interfered with the benchmarking a bit.

1600XT
WindowsXP

------------------
-- size: 3

BENCHMARK
Jacobi iteration: 4.6 micros
Residual calculation: 6.3 micros
Restriction: fw: 4.7 micros
Interpolation + add: 4.7 micros
VCycle: 562 micros


BENCHMARK
Jacobi iteration: 4.6 micros
Residual calculation: 4.7 micros
Restriction: fw: 6.3 micros
Interpolation + add: 4.7 micros
VCycle: 547 micros


BENCHMARK
Jacobi iteration: 6.2 micros
Residual calculation: 4.7 micros
Restriction: fw: 4.7 micros
Interpolation + add: 4.7 micros
VCycle: 563 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 5.11111 micros
Residual calculation: 5.22222 micros
Restriction: fw: 5.22222 micros
Interpolation + add: 5.22222 micros
VCycle: 937.778 micros


BENCHMARK
Jacobi iteration: 3.44444 micros
Residual calculation: 5.22222 micros
Restriction: fw: 5.22222 micros
Interpolation + add: 5.22222 micros
VCycle: 972.222 micros


BENCHMARK
Jacobi iteration: 5.22222 micros
Residual calculation: 5.22222 micros
Restriction: fw: 5.22222 micros
Interpolation + add: 5.22222 micros
VCycle: 920 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 5.875 micros
Residual calculation: 5.875 micros
Restriction: fw: 3.875 micros
Interpolation + add: 5.875 micros
VCycle: 1406.25 micros


BENCHMARK
Jacobi iteration: 5.875 micros
Residual calculation: 7.75 micros
Restriction: fw: 5.875 micros
Interpolation + add: 3.875 micros
VCycle: 1308.75 micros


BENCHMARK
Jacobi iteration: 5.875 micros
Residual calculation: 5.875 micros
Restriction: fw: 3.875 micros
Interpolation + add: 5.875 micros
VCycle: 1347.5 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 11.1429 micros
Residual calculation: 11.1429 micros
Restriction: fw: 13.4286 micros
Interpolation + add: 6.71429 micros
VCycle: 1718.57 micros


BENCHMARK
Jacobi iteration: 13.2857 micros
Residual calculation: 11.1429 micros
Restriction: fw: 11.2857 micros
Interpolation + add: 11.1429 micros
VCycle: 1762.86 micros


BENCHMARK
Jacobi iteration: 11.1429 micros
Residual calculation: 11.1429 micros
Restriction: fw: 13.4286 micros
Interpolation + add: 6.71429 micros
VCycle: 1718.57 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 44.3333 micros
Residual calculation: 44.1667 micros
Restriction: fw: 47 micros
Interpolation + add: 41.6667 micros
VCycle: 2213.33 micros


BENCHMARK
Jacobi iteration: 44.1667 micros
Residual calculation: 44.3333 micros
Restriction: fw: 46.8333 micros
Interpolation + add: 28.6667 micros
VCycle: 2318.33 micros


BENCHMARK
Jacobi iteration: 44.3333 micros
Residual calculation: 41.6667 micros
Restriction: fw: 49.5 micros
Interpolation + add: 28.6667 micros
VCycle: 2370 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 334.2 micros
Residual calculation: 3.2 micros
Restriction: fw: 190.6 micros
Interpolation + add: 112.6 micros
VCycle: 3874 micros


BENCHMARK
Jacobi iteration: 162.6 micros
Residual calculation: 171.8 micros
Restriction: fw: 197 micros
Interpolation + add: 112.4 micros
VCycle: 3812 micros


BENCHMARK
Jacobi iteration: 162.6 micros
Residual calculation: 168.8 micros
Restriction: fw: 200 micros
Interpolation + add: 112.4 micros
VCycle: 3844 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 3.75 micros
Residual calculation: 468.75 micros
Restriction: fw: 804.75 micros
Interpolation + add: 511.75 micros
VCycle: 13672.5 micros


BENCHMARK
Jacobi iteration: 582 micros
Residual calculation: 668 micros
Restriction: fw: 816.5 micros
Interpolation + add: 500 micros
VCycle: 13672.5 micros


BENCHMARK
Jacobi iteration: 550.75 micros
Residual calculation: 1453.25 micros
Restriction: fw: 43 micros
Interpolation + add: 507.75 micros
VCycle: 13710 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 6510.67 micros
Residual calculation: 1812.33 micros
Restriction: fw: 2619.67 micros
Interpolation + add: 10.6667 micros
VCycle: 53123.3 micros


BENCHMARK
Jacobi iteration: 21 micros
Residual calculation: 343.667 micros
Restriction: fw: 6718.67 micros
Interpolation + add: 1010.33 micros
VCycle: 40676.7 micros


BENCHMARK
Jacobi iteration: 6755.33 micros
Residual calculation: 2843.67 micros
Restriction: fw: 1739.67 micros
Interpolation + add: 2343.67 micros
VCycle: 25833.3 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 12453 micros
Residual calculation: 14906 micros
Restriction: fw: 4328.5 micros
Interpolation + add: 8679.5 micros
VCycle: 145155 micros


BENCHMARK
Jacobi iteration: 12203 micros
Residual calculation: 10297 micros
Restriction: fw: 11539 micros
Interpolation + add: 9875 micros
VCycle: 101640 micros


BENCHMARK
Jacobi iteration: 8414.5 micros
Residual calculation: 12203 micros
Restriction: fw: 9882.5 micros
Interpolation + add: 5930 micros
VCycle: 135470 micros

With a 1900XT and Vista32 all the tests ran in about 15 micros. One thing I might suggest is to make sure VPU recover is disabled for the ATI systems. Doubt it will be an issue but it doesn't always like long running shaders.

RickCain
30-May-2007, 04:14
HD2900 running Vista 32-bit


GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 10.8 micros
Residual calculation: 10.4 micros
Restriction: fw: 8 micros
Interpolation + add: 10.4 micros
VCycle: 206 micros


BENCHMARK
Jacobi iteration: 10.3 micros
Residual calculation: 10.5 micros
Restriction: fw: 7.7 micros
Interpolation + add: 10.4 micros
VCycle: 206 micros


BENCHMARK
Jacobi iteration: 10.3 micros
Residual calculation: 10.3 micros
Restriction: fw: 7.7 micros
Interpolation + add: 10.1 micros
VCycle: 202 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 10.5556 micros
Residual calculation: 10.4444 micros
Restriction: fw: 7.55556 micros
Interpolation + add: 10.6667 micros
VCycle: 326.667 micros


BENCHMARK
Jacobi iteration: 10.3333 micros
Residual calculation: 10.1111 micros
Restriction: fw: 7.66667 micros
Interpolation + add: 10.6667 micros
VCycle: 323.333 micros


BENCHMARK
Jacobi iteration: 10.7778 micros
Residual calculation: 10.1111 micros
Restriction: fw: 7.55556 micros
Interpolation + add: 10.2222 micros
VCycle: 343.333 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 10.75 micros
Residual calculation: 10.625 micros
Restriction: fw: 7.875 micros
Interpolation + add: 10.25 micros
VCycle: 460 micros


BENCHMARK
Jacobi iteration: 10.5 micros
Residual calculation: 10.25 micros
Restriction: fw: 7.625 micros
Interpolation + add: 10.875 micros
VCycle: 458.75 micros


BENCHMARK
Jacobi iteration: 10.5 micros
Residual calculation: 11.75 micros
Restriction: fw: 7.875 micros
Interpolation + add: 10.25 micros
VCycle: 450 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 10.4286 micros
Residual calculation: 10.4286 micros
Restriction: fw: 8.14286 micros
Interpolation + add: 10.7143 micros
VCycle: 584.286 micros


BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11 micros
Restriction: fw: 8.57143 micros
Interpolation + add: 10.5714 micros
VCycle: 585.714 micros


BENCHMARK
Jacobi iteration: 10.2857 micros
Residual calculation: 10.7143 micros
Restriction: fw: 7.71429 micros
Interpolation + add: 10.2857 micros
VCycle: 581.429 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 10.5 micros
Residual calculation: 10.1667 micros
Restriction: fw: 7.83333 micros
Interpolation + add: 10 micros
VCycle: 695 micros


BENCHMARK
Jacobi iteration: 10.5 micros
Residual calculation: 10.3333 micros
Restriction: fw: 7.83333 micros
Interpolation + add: 10.1667 micros
VCycle: 698.333 micros


BENCHMARK
Jacobi iteration: 10.1667 micros
Residual calculation: 10.1667 micros
Restriction: fw: 7.33333 micros
Interpolation + add: 10.6667 micros
VCycle: 681.667 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 10.4 micros
Residual calculation: 10 micros
Restriction: fw: 8.2 micros
Interpolation + add: 10.2 micros
VCycle: 880 micros


BENCHMARK
Jacobi iteration: 10.4 micros
Residual calculation: 11.2 micros
Restriction: fw: 7.8 micros
Interpolation + add: 10.2 micros
VCycle: 838 micros


BENCHMARK
Jacobi iteration: 11.2 micros
Residual calculation: 10.4 micros
Restriction: fw: 7.6 micros
Interpolation + add: 12 micros
VCycle: 852 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 10.5 micros
Residual calculation: 10.5 micros
Restriction: fw: 7.75 micros
Interpolation + add: 9.75 micros
VCycle: 952.5 micros


BENCHMARK
Jacobi iteration: 10.5 micros
Residual calculation: 10.5 micros
Restriction: fw: 8 micros
Interpolation + add: 10.25 micros
VCycle: 975 micros


BENCHMARK
Jacobi iteration: 13.75 micros
Residual calculation: 13 micros
Restriction: fw: 8.25 micros
Interpolation + add: 10.5 micros
VCycle: 1010 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 10.3333 micros
Residual calculation: 10.6667 micros
Restriction: fw: 8.66667 micros
Interpolation + add: 10.3333 micros
VCycle: 1100 micros


BENCHMARK
Jacobi iteration: 10 micros
Residual calculation: 11 micros
Restriction: fw: 7.33333 micros
Interpolation + add: 11 micros
VCycle: 1086.67 micros


BENCHMARK
Jacobi iteration: 10.3333 micros
Residual calculation: 11.3333 micros
Restriction: fw: 8 micros
Interpolation + add: 10.6667 micros
VCycle: 1053.33 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 10.5 micros
Residual calculation: 11 micros
Restriction: fw: 8 micros
Interpolation + add: 10 micros
VCycle: 1235 micros


BENCHMARK
Jacobi iteration: 12.5 micros
Residual calculation: 10.5 micros
Restriction: fw: 9 micros
Interpolation + add: 11.5 micros
VCycle: 1280 micros


BENCHMARK
Jacobi iteration: 11.5 micros
Residual calculation: 10 micros
Restriction: fw: 8.5 micros
Interpolation + add: 10.5 micros
VCycle: 1220 micros



------------------
-- size: 2047

BENCHMARK
Jacobi iteration: 10 micros
Residual calculation: 13 micros
Restriction: fw: 7 micros
Interpolation + add: 11 micros
VCycle: 1300 micros


BENCHMARK
Jacobi iteration: 10 micros
Residual calculation: 10 micros
Restriction: fw: 7 micros
Interpolation + add: 11 micros
VCycle: 1350 micros


BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 9 micros
Restriction: fw: 10 micros
Interpolation + add: 11 micros
VCycle: 1360 micros

Silent_Buddha
30-May-2007, 04:33
You`re also the only one getting results that are in line with what they should be(mostly). And I recall there was talk of a bug in the 8.38 making the fan spin-up even in desktop mode so...;)

Yeah, apparently there's a bug in the beta 8.38 release 7 driver that I tested where after running a 3d application the card would remain in HIGH 3D mode and fan would stay at 100% even on the desktop. (LOTS of heat coming out of the rear exhaust.)

I've since reverted to 8.37 and everything is fine again with it properly reverting to 2D speeds at the desktop.

I'll run this for you Peter in a moment with both 8.37 and 8.38. Not looking forward to putting 8.38 back on again but Neliz's run appeared to be going maybe it'll complete for me.

And damn? No 3d clocks in windowed modes? /sigh. I run any game I can in windowed mode when possible. Vanguard, EQ2, Supreme Commander...

Regards,
SB

BRiT
30-May-2007, 05:02
Gave it a run on an intermeddiate system while I sort out issues on the new system.

Vista 64bit run in "Test mode" for experimental drivers
MSI 8800 GTX HD OC (air cooled model) running stock clocks
Forceware 158.43 drivers
AMD X2 4400+ @ 2.5Ghz
2Gig Mem

Not sure what to make of these results, as I'm getting some odd behavior with respect to performance in Lost Planet DX10 demo.


GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 4.7 micros
Residual calculation: 3.1 micros
Restriction: fw: 3.1 micros
Interpolation + add: 3.2 micros
VCycle: 6359 micros


BENCHMARK
Jacobi iteration: 3.2 micros
Residual calculation: 3.1 micros
Restriction: fw: 4.7 micros
Interpolation + add: 3.1 micros
VCycle: 6469 micros


BENCHMARK
Jacobi iteration: 3.2 micros
Residual calculation: 3.1 micros
Restriction: fw: 3.1 micros
Interpolation + add: 3.1 micros
VCycle: 6391 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 5.22222 micros
Residual calculation: 1.77778 micros
Restriction: fw: 3.44444 micros
Interpolation + add: 3.44444 micros
VCycle: 10555.6 micros


BENCHMARK
Jacobi iteration: 3.44444 micros
Residual calculation: 3.44444 micros
Restriction: fw: 3.55556 micros
Interpolation + add: 3.44444 micros
VCycle: 10660 micros


BENCHMARK
Jacobi iteration: 3.44444 micros
Residual calculation: 3.44444 micros
Restriction: fw: 3.44444 micros
Interpolation + add: 3.55556 micros
VCycle: 10972.2 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 2 micros
Residual calculation: 3.875 micros
Restriction: fw: 3.875 micros
Interpolation + add: 2 micros
VCycle: 15195 micros


BENCHMARK
Jacobi iteration: 4 micros
Residual calculation: 3.875 micros
Restriction: fw: 3.875 micros
Interpolation + add: 2 micros
VCycle: 15605 micros


BENCHMARK
Jacobi iteration: 3.875 micros
Residual calculation: 2 micros
Restriction: fw: 3.875 micros
Interpolation + add: 3.875 micros
VCycle: 15821.3 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 4.42857 micros
Residual calculation: 2.28571 micros
Restriction: fw: 4.42857 micros
Interpolation + add: 2.14286 micros
VCycle: 19888.6 micros


BENCHMARK
Jacobi iteration: 2.28571 micros
Residual calculation: 4.42857 micros
Restriction: fw: 4.42857 micros
Interpolation + add: 2.28571 micros
VCycle: 19420 micros


BENCHMARK
Jacobi iteration: 2.28571 micros
Residual calculation: 4.42857 micros
Restriction: fw: 2.28571 micros
Interpolation + add: 4.42857 micros
VCycle: 19732.9 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 2.66667 micros
Residual calculation: 5.16667 micros
Restriction: fw: 2.66667 micros
Interpolation + add: 2.5 micros
VCycle: 24350 micros


BENCHMARK
Jacobi iteration: 2.5 micros
Residual calculation: 5.33333 micros
Restriction: fw: 2.5 micros
Interpolation + add: 2.66667 micros
VCycle: 23671.7 micros


BENCHMARK
Jacobi iteration: 2.66667 micros
Residual calculation: 5.16667 micros
Restriction: fw: 2.5 micros
Interpolation + add: 5.33333 micros
VCycle: 24661.7 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 3.2 micros
Residual calculation: 3.2 micros
Restriction: fw: 6.2 micros
Interpolation + add: 12.4 micros
VCycle: 29250 micros


BENCHMARK
Jacobi iteration: 3.2 micros
Residual calculation: 3 micros
Restriction: fw: 6.2 micros
Interpolation + add: 12.6 micros
VCycle: 28718 micros


BENCHMARK
Jacobi iteration: 3 micros
Residual calculation: 3.2 micros
Restriction: fw: 9.4 micros
Interpolation + add: 9.4 micros
VCycle: 28406 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 4 micros
Residual calculation: 11.75 micros
Restriction: fw: 46.75 micros
Interpolation + add: 43 micros
VCycle: 34882.5 micros


BENCHMARK
Jacobi iteration: 4 micros
Residual calculation: 15.5 micros
Restriction: fw: 43 micros
Interpolation + add: 43 micros
VCycle: 35235 micros


BENCHMARK
Jacobi iteration: 4 micros
Residual calculation: 11.75 micros
Restriction: fw: 46.75 micros
Interpolation + add: 43 micros
VCycle: 34220 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 5.33333 micros
Residual calculation: 5.33333 micros
Restriction: fw: 135.333 micros
Interpolation + add: 192.667 micros
VCycle: 44896.7 micros


BENCHMARK
Jacobi iteration: 5 micros
Residual calculation: 5.33333 micros
Restriction: fw: 135.333 micros
Interpolation + add: 192.667 micros
VCycle: 44686.7 micros


BENCHMARK
Jacobi iteration: 5 micros
Residual calculation: 0 micros
Restriction: fw: 140.667 micros
Interpolation + add: 192.667 micros
VCycle: 45260 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 0 micros
Restriction: fw: 8 micros
Interpolation + add: 578 micros
VCycle: 82970 micros


BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 7.5 micros
Restriction: fw: 0 micros
Interpolation + add: 586 micros
VCycle: 83515 micros


BENCHMARK
Jacobi iteration: 7.5 micros
Residual calculation: 0 micros
Restriction: fw: 8 micros
Interpolation + add: 586 micros
VCycle: 82810 micros



------------------
-- size: 2047

BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 0 micros
Restriction: fw: 0 micros
Interpolation + add: 0 micros
VCycle: 237190 micros


BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 0 micros
Restriction: fw: 0 micros
Interpolation + add: 16 micros
VCycle: 236410 micros


BENCHMARK
Jacobi iteration: 15 micros
Residual calculation: 0 micros
Restriction: fw: 0 micros
Interpolation + add: 0 micros
VCycle: 238220 micros

Silent_Buddha
30-May-2007, 05:17
Good news and Bad news. 8.37 is obviously bugged, I'm including the scores anyway just for completeness. 8.38 release 7 however your program appears to work fine.

All runs were done on a core2duo, 4 gigs mem, and Vista 64.

[Edit - ooops HD 2900 XT]

8.37.4.3 - default 2d clocks
GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 11.2 micros
Residual calculation: 10.9 micros
Restriction: fw: 8.1 micros
Interpolation + add: 10.8 micros
VCycle: 212 micros


BENCHMARK
Jacobi iteration: 11.5 micros
Residual calculation: 10.9 micros
Restriction: fw: 8.1 micros
Interpolation + add: 11 micros
VCycle: 211 micros


BENCHMARK
Jacobi iteration: 11.2 micros
Residual calculation: 11.3 micros
Restriction: fw: 8.1 micros
Interpolation + add: 10.8 micros
VCycle: 210 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 11.1111 micros
Residual calculation: 11.2222 micros
Restriction: fw: 8.22222 micros
Interpolation + add: 11.2222 micros
VCycle: 343.333 micros


BENCHMARK
Jacobi iteration: 11.1111 micros
Residual calculation: 11 micros
Restriction: fw: 8.22222 micros
Interpolation + add: 11 micros
VCycle: 344.444 micros


BENCHMARK
Jacobi iteration: 11.1111 micros
Residual calculation: 11.1111 micros
Restriction: fw: 8.44444 micros
Interpolation + add: 11 micros
VCycle: 344.444 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 11.125 micros
Residual calculation: 10.875 micros
Restriction: fw: 8.125 micros
Interpolation + add: 10.875 micros
VCycle: 475 micros


BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11.125 micros
Restriction: fw: 8.125 micros
Interpolation + add: 10.875 micros
VCycle: 481.25 micros


BENCHMARK
Jacobi iteration: 11.125 micros
Residual calculation: 11.375 micros
Restriction: fw: 8.375 micros
Interpolation + add: 10.875 micros
VCycle: 475 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 11.2857 micros
Residual calculation: 11.1429 micros
Restriction: fw: 8.28571 micros
Interpolation + add: 10.7143 micros
VCycle: 605.714 micros


BENCHMARK
Jacobi iteration: 11.1429 micros
Residual calculation: 11 micros
Restriction: fw: 8.57143 micros
Interpolation + add: 10.8571 micros
VCycle: 605.714 micros


BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11 micros
Restriction: fw: 8.28571 micros
Interpolation + add: 10.7143 micros
VCycle: 618.571 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 10.8333 micros
Restriction: fw: 8.16667 micros
Interpolation + add: 11.1667 micros
VCycle: 750 micros


BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11 micros
Restriction: fw: 8 micros
Interpolation + add: 11 micros
VCycle: 751.667 micros


BENCHMARK
Jacobi iteration: 11.1667 micros
Residual calculation: 11 micros
Restriction: fw: 8.16667 micros
Interpolation + add: 11 micros
VCycle: 740 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11.2 micros
Restriction: fw: 8.2 micros
Interpolation + add: 10.8 micros
VCycle: 868 micros


BENCHMARK
Jacobi iteration: 10.8 micros
Residual calculation: 11.2 micros
Restriction: fw: 8.2 micros
Interpolation + add: 11 micros
VCycle: 870 micros


BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11.2 micros
Restriction: fw: 8.4 micros
Interpolation + add: 10.8 micros
VCycle: 878 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11 micros
Restriction: fw: 8 micros
Interpolation + add: 11 micros
VCycle: 1012.5 micros


BENCHMARK
Jacobi iteration: 11.5 micros
Residual calculation: 11 micros
Restriction: fw: 8.25 micros
Interpolation + add: 11 micros
VCycle: 1017.5 micros


BENCHMARK
Jacobi iteration: 11.25 micros
Residual calculation: 10.75 micros
Restriction: fw: 8.25 micros
Interpolation + add: 11 micros
VCycle: 1010 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11.3333 micros
Restriction: fw: 8.66667 micros
Interpolation + add: 12 micros
VCycle: 1173.33 micros


BENCHMARK
Jacobi iteration: 12.3333 micros
Residual calculation: 11.6667 micros
Restriction: fw: 8 micros
Interpolation + add: 11 micros
VCycle: 1143.33 micros


BENCHMARK
Jacobi iteration: 11.3333 micros
Residual calculation: 11 micros
Restriction: fw: 8 micros
Interpolation + add: 11 micros
VCycle: 1140 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 11.5 micros
Residual calculation: 11 micros
Restriction: fw: 8.5 micros
Interpolation + add: 11 micros
VCycle: 1295 micros


BENCHMARK
Jacobi iteration: 10.5 micros
Residual calculation: 11.5 micros
Restriction: fw: 7.5 micros
Interpolation + add: 11 micros
VCycle: 1275 micros


BENCHMARK
Jacobi iteration: 10.5 micros
Residual calculation: 11 micros
Restriction: fw: 8.5 micros
Interpolation + add: 11 micros
VCycle: 1280 micros



------------------
-- size: 2047

BENCHMARK
Jacobi iteration: 10 micros
Residual calculation: 12 micros
Restriction: fw: 9 micros
Interpolation + add: 11 micros
VCycle: 1400 micros


BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11 micros
Restriction: fw: 9 micros
Interpolation + add: 10 micros
VCycle: 1440 micros


BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11 micros
Restriction: fw: 7 micros
Interpolation + add: 12 micros
VCycle: 1400 micros


Beta 8.38 release 7 - default 2d clocks.
GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 10.9 micros
Restriction: fw: 8 micros
Interpolation + add: 11.1 micros
VCycle: 245 micros


BENCHMARK
Jacobi iteration: 11.1 micros
Residual calculation: 10.9 micros
Restriction: fw: 8 micros
Interpolation + add: 10.9 micros
VCycle: 247 micros


BENCHMARK
Jacobi iteration: 11.1 micros
Residual calculation: 11 micros
Restriction: fw: 8 micros
Interpolation + add: 10.7 micros
VCycle: 246 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 10.8889 micros
Residual calculation: 10.6667 micros
Restriction: fw: 8.11111 micros
Interpolation + add: 10.8889 micros
VCycle: 400 micros


BENCHMARK
Jacobi iteration: 10.7778 micros
Residual calculation: 10.6667 micros
Restriction: fw: 7.88889 micros
Interpolation + add: 10.8889 micros
VCycle: 401.111 micros


BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 10.7778 micros
Restriction: fw: 8 micros
Interpolation + add: 10.8889 micros
VCycle: 402.222 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 10.875 micros
Residual calculation: 10.875 micros
Restriction: fw: 7.875 micros
Interpolation + add: 10.875 micros
VCycle: 552.5 micros


BENCHMARK
Jacobi iteration: 10.75 micros
Residual calculation: 10.75 micros
Restriction: fw: 8 micros
Interpolation + add: 10.875 micros
VCycle: 555 micros


BENCHMARK
Jacobi iteration: 10.75 micros
Residual calculation: 10.875 micros
Restriction: fw: 8.125 micros
Interpolation + add: 10.75 micros
VCycle: 551.25 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 10.7143 micros
Residual calculation: 11.1429 micros
Restriction: fw: 8 micros
Interpolation + add: 10.8571 micros
VCycle: 725.714 micros


BENCHMARK
Jacobi iteration: 11.2857 micros
Residual calculation: 11 micros
Restriction: fw: 8 micros
Interpolation + add: 11 micros
VCycle: 712.857 micros


BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11.2857 micros
Restriction: fw: 8.14286 micros
Interpolation + add: 11 micros
VCycle: 714.286 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 10.8333 micros
Restriction: fw: 8 micros
Interpolation + add: 10.6667 micros
VCycle: 861.667 micros


BENCHMARK
Jacobi iteration: 10.3333 micros
Residual calculation: 10.8333 micros
Restriction: fw: 8 micros
Interpolation + add: 11.1667 micros
VCycle: 860 micros


BENCHMARK
Jacobi iteration: 11.1667 micros
Residual calculation: 10.8333 micros
Restriction: fw: 8 micros
Interpolation + add: 10.8333 micros
VCycle: 888.333 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 11.6 micros
Residual calculation: 11.2 micros
Restriction: fw: 8.6 micros
Interpolation + add: 11.2 micros
VCycle: 1090 micros


BENCHMARK
Jacobi iteration: 10.8 micros
Residual calculation: 10.6 micros
Restriction: fw: 8 micros
Interpolation + add: 10.8 micros
VCycle: 1056 micros


BENCHMARK
Jacobi iteration: 10.6 micros
Residual calculation: 10.8 micros
Restriction: fw: 7.6 micros
Interpolation + add: 10.8 micros
VCycle: 1046 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11 micros
Restriction: fw: 7.75 micros
Interpolation + add: 10.75 micros
VCycle: 4290 micros


BENCHMARK
Jacobi iteration: 11.25 micros
Residual calculation: 10.75 micros
Restriction: fw: 8 micros
Interpolation + add: 10.75 micros
VCycle: 4590 micros


BENCHMARK
Jacobi iteration: 11.25 micros
Residual calculation: 10.75 micros
Restriction: fw: 7.75 micros
Interpolation + add: 10.75 micros
VCycle: 4525 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 10.6667 micros
Restriction: fw: 8 micros
Interpolation + add: 11 micros
VCycle: 20886.7 micros


BENCHMARK
Jacobi iteration: 16.3333 micros
Residual calculation: 13.6667 micros
Restriction: fw: 7.66667 micros
Interpolation + add: 10.6667 micros
VCycle: 22576.7 micros


BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11.6667 micros
Restriction: fw: 11.6667 micros
Interpolation + add: 12.3333 micros
VCycle: 22606.7 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 15.5 micros
Residual calculation: 16.5 micros
Restriction: fw: 11.5 micros
Interpolation + add: 12 micros
VCycle: 82325 micros


BENCHMARK
Jacobi iteration: 40 micros
Residual calculation: 37 micros
Restriction: fw: 40 micros
Interpolation + add: 41.5 micros
VCycle: 89840 micros


BENCHMARK
Jacobi iteration: 40 micros
Residual calculation: 36.5 micros
Restriction: fw: 40.5 micros
Interpolation + add: 41 micros
VCycle: 89760 micros



------------------
-- size: 2047

BENCHMARK
Jacobi iteration: 10 micros
Residual calculation: 11 micros
Restriction: fw: 8 micros
Interpolation + add: 11 micros
VCycle: 1600 micros


BENCHMARK
Jacobi iteration: 6648 micros
Residual calculation: 6009 micros
Restriction: fw: 2761 micros
Interpolation + add: 3047 micros
VCycle: 50800 micros


BENCHMARK
Jacobi iteration: 6804 micros
Residual calculation: 5847 micros
Restriction: fw: 2271 micros
Interpolation + add: 3460 micros
VCycle: 50760 micros


Regards,
SB

SugarCoat
30-May-2007, 05:44
Windows Server 2003 64Bit (build 3790, Service Pack 2)
Intel(R) Core(TM)2 CPU 6300 @ 2.407GHz
NVIDIA GeForce 8800 GTX 6.14.11.5819 768Mb


GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 3.1 micros
Residual calculation: 3.1 micros
Restriction: fw: 1.6 micros
Interpolation + add: 1.5 micros
VCycle: 547 micros


BENCHMARK
Jacobi iteration: 1.5 micros
Residual calculation: 3.2 micros
Restriction: fw: 1.5 micros
Interpolation + add: 3.2 micros
VCycle: 546 micros


BENCHMARK
Jacobi iteration: 3.2 micros
Residual calculation: 3.1 micros
Restriction: fw: 1.5 micros
Interpolation + add: 3.2 micros
VCycle: 547 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 3.44444 micros
Residual calculation: 1.77778 micros
Restriction: fw: 3.44444 micros
Interpolation + add: 1.77778 micros
VCycle: 920 micros


BENCHMARK
Jacobi iteration: 3.44444 micros
Residual calculation: 1.77778 micros
Restriction: fw: 1.66667 micros
Interpolation + add: 3.55556 micros
VCycle: 920 micros


BENCHMARK
Jacobi iteration: 3.44444 micros
Residual calculation: 1.66667 micros
Restriction: fw: 3.55556 micros
Interpolation + add: 1.66667 micros
VCycle: 937.778 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 3.875 micros
Residual calculation: 1.875 micros
Restriction: fw: 2 micros
Interpolation + add: 2 micros
VCycle: 1288.75 micros


BENCHMARK
Jacobi iteration: 1.875 micros
Residual calculation: 4 micros
Restriction: fw: 1.875 micros
Interpolation + add: 2 micros
VCycle: 1288.75 micros


BENCHMARK
Jacobi iteration: 1.875 micros
Residual calculation: 2 micros
Restriction: fw: 2 micros
Interpolation + add: 1.875 micros
VCycle: 1308.75 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 2.28571 micros
Residual calculation: 4.42857 micros
Restriction: fw: 2.28571 micros
Interpolation + add: 2.14286 micros
VCycle: 1652.86 micros


BENCHMARK
Jacobi iteration: 2.28571 micros
Residual calculation: 4.42857 micros
Restriction: fw: 2.28571 micros
Interpolation + add: 2.28571 micros
VCycle: 1672.86 micros


BENCHMARK
Jacobi iteration: 2.28571 micros
Residual calculation: 2.28571 micros
Restriction: fw: 2.14286 micros
Interpolation + add: 2.28571 micros
VCycle: 1674.29 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 5.33333 micros
Residual calculation: 2.5 micros
Restriction: fw: 2.66667 micros
Interpolation + add: 2.66667 micros
VCycle: 2056.67 micros


BENCHMARK
Jacobi iteration: 2.5 micros
Residual calculation: 5.33333 micros
Restriction: fw: 2.5 micros
Interpolation + add: 2.66667 micros
VCycle: 2056.67 micros


BENCHMARK
Jacobi iteration: 2.5 micros
Residual calculation: 2.66667 micros
Restriction: fw: 2.66667 micros
Interpolation + add: 2.5 micros
VCycle: 2058.33 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 12.4 micros
Residual calculation: 9.4 micros
Restriction: fw: 12.6 micros
Interpolation + add: 6.2 micros
VCycle: 2468 micros


BENCHMARK
Jacobi iteration: 9.4 micros
Residual calculation: 12.6 micros
Restriction: fw: 9.2 micros
Interpolation + add: 6.4 micros
VCycle: 2468 micros


BENCHMARK
Jacobi iteration: 9.4 micros
Residual calculation: 12.4 micros
Restriction: fw: 9.4 micros
Interpolation + add: 6.2 micros
VCycle: 2470 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 46.75 micros
Residual calculation: 43 micros
Restriction: fw: 43 micros
Interpolation + add: 31.25 micros
VCycle: 3085 micros


BENCHMARK
Jacobi iteration: 46.75 micros
Residual calculation: 47 micros
Restriction: fw: 39 micros
Interpolation + add: 31.25 micros
VCycle: 3047.5 micros


BENCHMARK
Jacobi iteration: 47 micros
Residual calculation: 43 micros
Restriction: fw: 42.75 micros
Interpolation + add: 31.25 micros
VCycle: 3047.5 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 182.333 micros
Residual calculation: 182.333 micros
Restriction: fw: 182.333 micros
Interpolation + add: 114.333 micros
VCycle: 4533.33 micros


BENCHMARK
Jacobi iteration: 187.333 micros
Residual calculation: 182.333 micros
Restriction: fw: 177.333 micros
Interpolation + add: 119.667 micros
VCycle: 4530 micros


BENCHMARK
Jacobi iteration: 187.333 micros
Residual calculation: 177 micros
Restriction: fw: 182.333 micros
Interpolation + add: 120 micros
VCycle: 4476.67 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 750 micros
Residual calculation: 726.5 micros
Restriction: fw: 797 micros
Interpolation + add: 523.5 micros
VCycle: 15390 micros


BENCHMARK
Jacobi iteration: 750 micros
Residual calculation: 719 micros
Restriction: fw: 804.5 micros
Interpolation + add: 515.5 micros
VCycle: 15390 micros


BENCHMARK
Jacobi iteration: 742 micros
Residual calculation: 734.5 micros
Restriction: fw: 789 micros
Interpolation + add: 523.5 micros
VCycle: 15390 micros



------------------
-- size: 2047

BENCHMARK
Jacobi iteration: 437 micros
Residual calculation: 2406 micros
Restriction: fw: 3375 micros
Interpolation + add: 2704 micros
VCycle: 92650 micros


BENCHMARK
Jacobi iteration: 422 micros
Residual calculation: 2391 micros
Restriction: fw: 3422 micros
Interpolation + add: 2734 micros
VCycle: 92660 micros


BENCHMARK
Jacobi iteration: 422 micros
Residual calculation: 2422 micros
Restriction: fw: 3390 micros
Interpolation + add: 2781 micros
VCycle: 92500 micros



But at least it seems to work right for you. And may I add that I hate you for being able to use the 8.38s(Nah, just kidding:) ). I`d like to second what Dave said:it certainly is NOT running at 3D speeds-the fan doesn`t spin-up at all.

They havent run in 3D clocks without fullscreen since they started doing dynamic clocks. Its something that has annoyed me since the introduction of the X1800 and i've commented on a couple times. The fact that the card stays at 2D clocks even with a windowed 3D app strikes me as lazy.

Silent_Buddha
30-May-2007, 06:02
They havent run in 3D clocks without fullscreen since they started doing dynamic clocks. Its something that has annoyed me since the introduction of the X1800 and i've commented on a couple times. The fact that the card stays at 2D clocks even with a windowed 3D app strikes me as lazy.

Yeah, not to happy about that. Especially now with Vista and the fact that a window that spans multiple displays remains fully 3d accelerated.

Regards,
SB

SugarCoat
30-May-2007, 06:33
Yeah, not to happy about that. Especially now with Vista and the fact that a window that spans multiple displays remains fully 3d accelerated.

Regards,
SB

i use to run a few instances of games windowed when i first noticed it by running rivatuner in the background monitoring the hardware. I didnt even think to check to be honest, you'd think that they wouldnt be so lazy not to fix/include something like that, yet here we are, coming on the 2 year mark since the introduction of the R520...apparently nobody cares. At least not enough to make a fuss and get it fixed in a release or two. As it stands its problably something we wont see resolved until the R7xx refresh cards, it will be a small, size 2 font, notation in the driver release notes. :lol:

AlexV
30-May-2007, 07:41
Good news and Bad news. 8.37 is obviously bugged, I'm including the scores anyway just for completeness. 8.38 release 7 however your program appears to work fine.

All runs were done on a core2duo, 4 gigs mem, and Vista 64.

[Edit - ooops HD 2900 XT]

8.37.4.3 - default 2d clocks
GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 11.2 micros
Residual calculation: 10.9 micros
Restriction: fw: 8.1 micros
Interpolation + add: 10.8 micros
VCycle: 212 micros


BENCHMARK
Jacobi iteration: 11.5 micros
Residual calculation: 10.9 micros
Restriction: fw: 8.1 micros
Interpolation + add: 11 micros
VCycle: 211 micros


BENCHMARK
Jacobi iteration: 11.2 micros
Residual calculation: 11.3 micros
Restriction: fw: 8.1 micros
Interpolation + add: 10.8 micros
VCycle: 210 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 11.1111 micros
Residual calculation: 11.2222 micros
Restriction: fw: 8.22222 micros
Interpolation + add: 11.2222 micros
VCycle: 343.333 micros


BENCHMARK
Jacobi iteration: 11.1111 micros
Residual calculation: 11 micros
Restriction: fw: 8.22222 micros
Interpolation + add: 11 micros
VCycle: 344.444 micros


BENCHMARK
Jacobi iteration: 11.1111 micros
Residual calculation: 11.1111 micros
Restriction: fw: 8.44444 micros
Interpolation + add: 11 micros
VCycle: 344.444 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 11.125 micros
Residual calculation: 10.875 micros
Restriction: fw: 8.125 micros
Interpolation + add: 10.875 micros
VCycle: 475 micros


BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11.125 micros
Restriction: fw: 8.125 micros
Interpolation + add: 10.875 micros
VCycle: 481.25 micros


BENCHMARK
Jacobi iteration: 11.125 micros
Residual calculation: 11.375 micros
Restriction: fw: 8.375 micros
Interpolation + add: 10.875 micros
VCycle: 475 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 11.2857 micros
Residual calculation: 11.1429 micros
Restriction: fw: 8.28571 micros
Interpolation + add: 10.7143 micros
VCycle: 605.714 micros


BENCHMARK
Jacobi iteration: 11.1429 micros
Residual calculation: 11 micros
Restriction: fw: 8.57143 micros
Interpolation + add: 10.8571 micros
VCycle: 605.714 micros


BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11 micros
Restriction: fw: 8.28571 micros
Interpolation + add: 10.7143 micros
VCycle: 618.571 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 10.8333 micros
Restriction: fw: 8.16667 micros
Interpolation + add: 11.1667 micros
VCycle: 750 micros


BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11 micros
Restriction: fw: 8 micros
Interpolation + add: 11 micros
VCycle: 751.667 micros


BENCHMARK
Jacobi iteration: 11.1667 micros
Residual calculation: 11 micros
Restriction: fw: 8.16667 micros
Interpolation + add: 11 micros
VCycle: 740 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11.2 micros
Restriction: fw: 8.2 micros
Interpolation + add: 10.8 micros
VCycle: 868 micros


BENCHMARK
Jacobi iteration: 10.8 micros
Residual calculation: 11.2 micros
Restriction: fw: 8.2 micros
Interpolation + add: 11 micros
VCycle: 870 micros


BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11.2 micros
Restriction: fw: 8.4 micros
Interpolation + add: 10.8 micros
VCycle: 878 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11 micros
Restriction: fw: 8 micros
Interpolation + add: 11 micros
VCycle: 1012.5 micros


BENCHMARK
Jacobi iteration: 11.5 micros
Residual calculation: 11 micros
Restriction: fw: 8.25 micros
Interpolation + add: 11 micros
VCycle: 1017.5 micros


BENCHMARK
Jacobi iteration: 11.25 micros
Residual calculation: 10.75 micros
Restriction: fw: 8.25 micros
Interpolation + add: 11 micros
VCycle: 1010 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11.3333 micros
Restriction: fw: 8.66667 micros
Interpolation + add: 12 micros
VCycle: 1173.33 micros


BENCHMARK
Jacobi iteration: 12.3333 micros
Residual calculation: 11.6667 micros
Restriction: fw: 8 micros
Interpolation + add: 11 micros
VCycle: 1143.33 micros


BENCHMARK
Jacobi iteration: 11.3333 micros
Residual calculation: 11 micros
Restriction: fw: 8 micros
Interpolation + add: 11 micros
VCycle: 1140 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 11.5 micros
Residual calculation: 11 micros
Restriction: fw: 8.5 micros
Interpolation + add: 11 micros
VCycle: 1295 micros


BENCHMARK
Jacobi iteration: 10.5 micros
Residual calculation: 11.5 micros
Restriction: fw: 7.5 micros
Interpolation + add: 11 micros
VCycle: 1275 micros


BENCHMARK
Jacobi iteration: 10.5 micros
Residual calculation: 11 micros
Restriction: fw: 8.5 micros
Interpolation + add: 11 micros
VCycle: 1280 micros



------------------
-- size: 2047

BENCHMARK
Jacobi iteration: 10 micros
Residual calculation: 12 micros
Restriction: fw: 9 micros
Interpolation + add: 11 micros
VCycle: 1400 micros


BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11 micros
Restriction: fw: 9 micros
Interpolation + add: 10 micros
VCycle: 1440 micros


BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11 micros
Restriction: fw: 7 micros
Interpolation + add: 12 micros
VCycle: 1400 micros


Beta 8.38 release 7 - default 2d clocks.
GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 10.9 micros
Restriction: fw: 8 micros
Interpolation + add: 11.1 micros
VCycle: 245 micros


BENCHMARK
Jacobi iteration: 11.1 micros
Residual calculation: 10.9 micros
Restriction: fw: 8 micros
Interpolation + add: 10.9 micros
VCycle: 247 micros


BENCHMARK
Jacobi iteration: 11.1 micros
Residual calculation: 11 micros
Restriction: fw: 8 micros
Interpolation + add: 10.7 micros
VCycle: 246 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 10.8889 micros
Residual calculation: 10.6667 micros
Restriction: fw: 8.11111 micros
Interpolation + add: 10.8889 micros
VCycle: 400 micros


BENCHMARK
Jacobi iteration: 10.7778 micros
Residual calculation: 10.6667 micros
Restriction: fw: 7.88889 micros
Interpolation + add: 10.8889 micros
VCycle: 401.111 micros


BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 10.7778 micros
Restriction: fw: 8 micros
Interpolation + add: 10.8889 micros
VCycle: 402.222 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 10.875 micros
Residual calculation: 10.875 micros
Restriction: fw: 7.875 micros
Interpolation + add: 10.875 micros
VCycle: 552.5 micros


BENCHMARK
Jacobi iteration: 10.75 micros
Residual calculation: 10.75 micros
Restriction: fw: 8 micros
Interpolation + add: 10.875 micros
VCycle: 555 micros


BENCHMARK
Jacobi iteration: 10.75 micros
Residual calculation: 10.875 micros
Restriction: fw: 8.125 micros
Interpolation + add: 10.75 micros
VCycle: 551.25 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 10.7143 micros
Residual calculation: 11.1429 micros
Restriction: fw: 8 micros
Interpolation + add: 10.8571 micros
VCycle: 725.714 micros


BENCHMARK
Jacobi iteration: 11.2857 micros
Residual calculation: 11 micros
Restriction: fw: 8 micros
Interpolation + add: 11 micros
VCycle: 712.857 micros


BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11.2857 micros
Restriction: fw: 8.14286 micros
Interpolation + add: 11 micros
VCycle: 714.286 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 10.8333 micros
Restriction: fw: 8 micros
Interpolation + add: 10.6667 micros
VCycle: 861.667 micros


BENCHMARK
Jacobi iteration: 10.3333 micros
Residual calculation: 10.8333 micros
Restriction: fw: 8 micros
Interpolation + add: 11.1667 micros
VCycle: 860 micros


BENCHMARK
Jacobi iteration: 11.1667 micros
Residual calculation: 10.8333 micros
Restriction: fw: 8 micros
Interpolation + add: 10.8333 micros
VCycle: 888.333 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 11.6 micros
Residual calculation: 11.2 micros
Restriction: fw: 8.6 micros
Interpolation + add: 11.2 micros
VCycle: 1090 micros


BENCHMARK
Jacobi iteration: 10.8 micros
Residual calculation: 10.6 micros
Restriction: fw: 8 micros
Interpolation + add: 10.8 micros
VCycle: 1056 micros


BENCHMARK
Jacobi iteration: 10.6 micros
Residual calculation: 10.8 micros
Restriction: fw: 7.6 micros
Interpolation + add: 10.8 micros
VCycle: 1046 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11 micros
Restriction: fw: 7.75 micros
Interpolation + add: 10.75 micros
VCycle: 4290 micros


BENCHMARK
Jacobi iteration: 11.25 micros
Residual calculation: 10.75 micros
Restriction: fw: 8 micros
Interpolation + add: 10.75 micros
VCycle: 4590 micros


BENCHMARK
Jacobi iteration: 11.25 micros
Residual calculation: 10.75 micros
Restriction: fw: 7.75 micros
Interpolation + add: 10.75 micros
VCycle: 4525 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 10.6667 micros
Restriction: fw: 8 micros
Interpolation + add: 11 micros
VCycle: 20886.7 micros


BENCHMARK
Jacobi iteration: 16.3333 micros
Residual calculation: 13.6667 micros
Restriction: fw: 7.66667 micros
Interpolation + add: 10.6667 micros
VCycle: 22576.7 micros


BENCHMARK
Jacobi iteration: 11 micros
Residual calculation: 11.6667 micros
Restriction: fw: 11.6667 micros
Interpolation + add: 12.3333 micros
VCycle: 22606.7 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 15.5 micros
Residual calculation: 16.5 micros
Restriction: fw: 11.5 micros
Interpolation + add: 12 micros
VCycle: 82325 micros


BENCHMARK
Jacobi iteration: 40 micros
Residual calculation: 37 micros
Restriction: fw: 40 micros
Interpolation + add: 41.5 micros
VCycle: 89840 micros


BENCHMARK
Jacobi iteration: 40 micros
Residual calculation: 36.5 micros
Restriction: fw: 40.5 micros
Interpolation + add: 41 micros
VCycle: 89760 micros



------------------
-- size: 2047

BENCHMARK
Jacobi iteration: 10 micros
Residual calculation: 11 micros
Restriction: fw: 8 micros
Interpolation + add: 11 micros
VCycle: 1600 micros


BENCHMARK
Jacobi iteration: 6648 micros
Residual calculation: 6009 micros
Restriction: fw: 2761 micros
Interpolation + add: 3047 micros
VCycle: 50800 micros


BENCHMARK
Jacobi iteration: 6804 micros
Residual calculation: 5847 micros
Restriction: fw: 2271 micros
Interpolation + add: 3460 micros
VCycle: 50760 micros


Regards,
SB

It`s still screwy for 2047...too fast. Otherwise everything seems in line with what(I guess) should be happening. Interesting.

banksie
30-May-2007, 09:12
Here ya go :-

Windows XP Pro
AMD A64 3000+ (Socket 754)
Radeon HD 2900 XT
Catalyst 8.37.4.3-070510a-047096E-ATI

All stock clocks.


GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 14 micros
Residual calculation: 14.1 micros
Restriction: fw: 9.4 micros
Interpolation + add: 14 micros
VCycle: 282 micros


BENCHMARK
Jacobi iteration: 14 micros
Residual calculation: 14.1 micros
Restriction: fw: 9.4 micros
Interpolation + add: 14 micros
VCycle: 282 micros


BENCHMARK
Jacobi iteration: 14.1 micros
Residual calculation: 14 micros
Restriction: fw: 11 micros
Interpolation + add: 14 micros
VCycle: 282 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 13.8889 micros
Residual calculation: 13.8889 micros
Restriction: fw: 10.4444 micros
Interpolation + add: 15.5556 micros
VCycle: 434.444 micros


BENCHMARK
Jacobi iteration: 13.8889 micros
Residual calculation: 15.5556 micros
Restriction: fw: 10.4444 micros
Interpolation + add: 13.8889 micros
VCycle: 451.111 micros


BENCHMARK
Jacobi iteration: 15.6667 micros
Residual calculation: 13.8889 micros
Restriction: fw: 10.3333 micros
Interpolation + add: 13.8889 micros
VCycle: 468.889 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 13.75 micros
Residual calculation: 13.625 micros
Restriction: fw: 11.75 micros
Interpolation + add: 13.625 micros
VCycle: 606.25 micros


BENCHMARK
Jacobi iteration: 13.75 micros
Residual calculation: 13.625 micros
Restriction: fw: 11.75 micros
Interpolation + add: 13.625 micros
VCycle: 606.25 micros


BENCHMARK
Jacobi iteration: 13.75 micros
Residual calculation: 15.625 micros
Restriction: fw: 9.75 micros
Interpolation + add: 13.625 micros
VCycle: 606.25 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 15.5714 micros
Residual calculation: 13.4286 micros
Restriction: fw: 11.1429 micros
Interpolation + add: 13.4286 micros
VCycle: 781.429 micros


BENCHMARK
Jacobi iteration: 13.4286 micros
Residual calculation: 15.5714 micros
Restriction: fw: 11.2857 micros
Interpolation + add: 13.2857 micros
VCycle: 781.429 micros


BENCHMARK
Jacobi iteration: 15.5714 micros
Residual calculation: 13.4286 micros
Restriction: fw: 11.1429 micros
Interpolation + add: 13.4286 micros
VCycle: 781.429 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 13 micros
Residual calculation: 15.6667 micros
Restriction: fw: 10.5 micros
Interpolation + add: 13 micros
VCycle: 963.333 micros


BENCHMARK
Jacobi iteration: 13 micros
Residual calculation: 13 micros
Restriction: fw: 10.5 micros
Interpolation + add: 15.6667 micros
VCycle: 988.333 micros


BENCHMARK
Jacobi iteration: 15.6667 micros
Residual calculation: 15.6667 micros
Restriction: fw: 10.3333 micros
Interpolation + add: 15.6667 micros
VCycle: 963.333 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 12.6 micros
Residual calculation: 15.6 micros
Restriction: fw: 9.4 micros
Interpolation + add: 15.6 micros
VCycle: 1124 micros


BENCHMARK
Jacobi iteration: 15.8 micros
Residual calculation: 12.4 micros
Restriction: fw: 12.6 micros
Interpolation + add: 12.4 micros
VCycle: 1156 micros


BENCHMARK
Jacobi iteration: 15.6 micros
Residual calculation: 12.6 micros
Restriction: fw: 12.4 micros
Interpolation + add: 12.6 micros
VCycle: 1124 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 11.75 micros
Residual calculation: 15.5 micros
Restriction: fw: 8 micros
Interpolation + add: 15.5 micros
VCycle: 1290 micros


BENCHMARK
Jacobi iteration: 15.75 micros
Residual calculation: 15.5 micros
Restriction: fw: 7.75 micros
Interpolation + add: 15.75 micros
VCycle: 1290 micros


BENCHMARK
Jacobi iteration: 11.5 micros
Residual calculation: 15.75 micros
Restriction: fw: 7.75 micros
Interpolation + add: 15.75 micros
VCycle: 1327.5 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 10.6667 micros
Residual calculation: 15.6667 micros
Restriction: fw: 10.3333 micros
Interpolation + add: 15.6667 micros
VCycle: 1456.67 micros


BENCHMARK
Jacobi iteration: 15.6667 micros
Residual calculation: 15.6667 micros
Restriction: fw: 10.3333 micros
Interpolation + add: 15.6667 micros
VCycle: 1460 micros


BENCHMARK
Jacobi iteration: 10.6667 micros
Residual calculation: 15.6667 micros
Restriction: fw: 10.3333 micros
Interpolation + add: 15.6667 micros
VCycle: 1456.67 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 8 micros
Residual calculation: 15.5 micros
Restriction: fw: 16 micros
Interpolation + add: 7.5 micros
VCycle: 1640 micros


BENCHMARK
Jacobi iteration: 15.5 micros
Residual calculation: 15.5 micros
Restriction: fw: 8 micros
Interpolation + add: 15.5 micros
VCycle: 1640 micros


BENCHMARK
Jacobi iteration: 15.5 micros
Residual calculation: 15.5 micros
Restriction: fw: 8 micros
Interpolation + add: 15.5 micros
VCycle: 1720 micros



------------------
-- size: 2047

BENCHMARK
Jacobi iteration: 16 micros
Residual calculation: 16 micros
Restriction: fw: 0 micros
Interpolation + add: 15 micros
VCycle: 1880 micros


BENCHMARK
Jacobi iteration: 15 micros
Residual calculation: 16 micros
Restriction: fw: 0 micros
Interpolation + add: 15 micros
VCycle: 1880 micros


BENCHMARK
Jacobi iteration: 16 micros
Residual calculation: 15 micros
Restriction: fw: 16 micros
Interpolation + add: 16 micros
VCycle: 1870 micros

AlexV
30-May-2007, 09:42
It seems the 8.37 drivers are the culprits, with the 8.38s doing what should be done...so ATi, now you actually have a valid reason for releasing new Catalysts...this man's master thesis must have valid benchmarks backing it up!:D

SuperBORG
30-May-2007, 10:48
HD 2900XT 512MB Stock Clocks
Catalyst 8.39
Vista 64bit
4GB RAM


GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 7.8 micros
Residual calculation: 7.8 micros
Restriction: fw: 6.2 micros
Interpolation + add: 7.8 micros
VCycle: 827 micros


BENCHMARK
Jacobi iteration: 7.8 micros
Residual calculation: 6.2 micros
Restriction: fw: 7.8 micros
Interpolation + add: 6.2 micros
VCycle: 827 micros


BENCHMARK
Jacobi iteration: 7.8 micros
Residual calculation: 6.2 micros
Restriction: fw: 6.2 micros
Interpolation + add: 7.8 micros
VCycle: 827 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 7 micros
Residual calculation: 6.88889 micros
Restriction: fw: 7 micros
Interpolation + add: 6.88889 micros
VCycle: 1560 micros


BENCHMARK
Jacobi iteration: 7 micros
Residual calculation: 6.88889 micros
Restriction: fw: 7 micros
Interpolation + add: 6.88889 micros
VCycle: 1542.22 micros


BENCHMARK
Jacobi iteration: 6.88889 micros
Residual calculation: 7 micros
Restriction: fw: 6.88889 micros
Interpolation + add: 6.88889 micros
VCycle: 1560 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 5.75 micros
Residual calculation: 7.875 micros
Restriction: fw: 5.875 micros
Interpolation + add: 7.75 micros
VCycle: 2301.25 micros


BENCHMARK
Jacobi iteration: 7.75 micros
Residual calculation: 5.875 micros
Restriction: fw: 7.75 micros
Interpolation + add: 5.875 micros
VCycle: 2320 micros


BENCHMARK
Jacobi iteration: 5.875 micros
Residual calculation: 7.75 micros
Restriction: fw: 5.875 micros
Interpolation + add: 7.75 micros
VCycle: 2301.25 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 6.71429 micros
Residual calculation: 8.85714 micros
Restriction: fw: 4.57143 micros
Interpolation + add: 8.85714 micros
VCycle: 3052.86 micros


BENCHMARK
Jacobi iteration: 6.57143 micros
Residual calculation: 6.71429 micros
Restriction: fw: 6.71429 micros
Interpolation + add: 6.71429 micros
VCycle: 3075.71 micros


BENCHMARK
Jacobi iteration: 9 micros
Residual calculation: 6.71429 micros
Restriction: fw: 6.57143 micros
Interpolation + add: 6.71429 micros
VCycle: 3075.71 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 7.66667 micros
Residual calculation: 7.83333 micros
Restriction: fw: 7.83333 micros
Interpolation + add: 5.16667 micros
VCycle: 3821.67 micros


BENCHMARK
Jacobi iteration: 5.16667 micros
Residual calculation: 7.83333 micros
Restriction: fw: 7.83333 micros
Interpolation + add: 5.16667 micros
VCycle: 3821.67 micros


BENCHMARK
Jacobi iteration: 5.16667 micros
Residual calculation: 7.83333 micros
Restriction: fw: 7.83333 micros
Interpolation + add: 5.16667 micros
VCycle: 3821.67 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 6.2 micros
Residual calculation: 9.4 micros
Restriction: fw: 6.2 micros
Interpolation + add: 9.4 micros
VCycle: 4616 micros


BENCHMARK
Jacobi iteration: 6.2 micros
Residual calculation: 6.4 micros
Restriction: fw: 6.2 micros
Interpolation + add: 9.4 micros
VCycle: 4554 micros


BENCHMARK
Jacobi iteration: 6.2 micros
Residual calculation: 6.2 micros
Restriction: fw: 6.4 micros
Interpolation + add: 9.2 micros
VCycle: 4556 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 7.75 micros
Residual calculation: 4 micros
Restriction: fw: 7.75 micros
Interpolation + add: 7.75 micros
VCycle: 8815 micros


BENCHMARK
Jacobi iteration: 7.75 micros
Residual calculation: 4 micros
Restriction: fw: 7.75 micros
Interpolation + add: 7.75 micros
VCycle: 8892.5 micros


BENCHMARK
Jacobi iteration: 8 micros
Residual calculation: 7.75 micros
Restriction: fw: 7.75 micros
Interpolation + add: 7.75 micros
VCycle: 8852.5 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 5.33333 micros
Residual calculation: 5 micros
Restriction: fw: 10.3333 micros
Interpolation + add: 5.33333 micros
VCycle: 30836.7 micros


BENCHMARK
Jacobi iteration: 5.33333 micros
Residual calculation: 10.3333 micros
Restriction: fw: 5 micros
Interpolation + add: 10.6667 micros
VCycle: 32913.3 micros


BENCHMARK
Jacobi iteration: 10.3333 micros
Residual calculation: 5.33333 micros
Restriction: fw: 5 micros
Interpolation + add: 15.6667 micros
VCycle: 32863.3 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 8 micros
Residual calculation: 7.5 micros
Restriction: fw: 8 micros
Interpolation + add: 8 micros
VCycle: 119340 micros


BENCHMARK
Jacobi iteration: 62.5 micros
Residual calculation: 54.5 micros
Restriction: fw: 62.5 micros
Interpolation + add: 54.5 micros
VCycle: 129405 micros


BENCHMARK
Jacobi iteration: 62 micros
Residual calculation: 55 micros
Restriction: fw: 62 micros
Interpolation + add: 55 micros
VCycle: 129480 micros



------------------
-- size: 2047

BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 16 micros
Restriction: fw: 0 micros
Interpolation + add: 15 micros
VCycle: 7340 micros


BENCHMARK
Jacobi iteration: 5382 micros
Residual calculation: 4836 micros
Restriction: fw: 2745 micros
Interpolation + add: 203 micros
VCycle: 68800 micros


BENCHMARK
Jacobi iteration: 5507 micros
Residual calculation: 4898 micros
Restriction: fw: 2106 micros
Interpolation + add: 203 micros
VCycle: 68800 micros

neliz
30-May-2007, 11:09
borg, remove the spaces in the [ code ]

Blacklash
30-May-2007, 11:42
Here's three run in a row-

8800GTX Driver 158.22 OS WinXP SP-2

661|1050 (2100 effective)
http://members.cox.net/malficar2/GPUMGbench1.txt

666|1050 (2100 effective)
http://members.cox.net/malficar2/GPUMGbench2.txt

670|1050 (2100 effective)
http://members.cox.net/malficar2/GPUMGbench3.txt

PeterT
30-May-2007, 11:48
Thanks everybody for all the results, especially Silent_Buddha for running 2 different driver versions. From what I can see, the post-8.37 results are more plausible at least than those at 8.37, but still very strange:
- The V-cycle times (those are the most important ones by far) seem to be in line with expectations up to 1023^2, if a bit disappointing compared to G80 results.
- The other 4 values are all still far too good to be true.
- at 2047^2, all bets are off. I have no idea what's going on there.

I will try to release a functionality testing app later today to see which (if any) driver versions actually perform the computations correctly.

Silent_Buddha
30-May-2007, 18:29
PeterT - I'm not sure what's happening however, when actually running mgbench under 2047^2 it took a little over twice as long to finish as it did for 1023^2. I'm not sure why it then recorded cycle times that were lower than 1023^2.

Regards,
SB

Sxotty
31-May-2007, 01:56
Well my x1900 AIW just crashed and it is at stock clocks... ah well

PeterT
31-May-2007, 04:41
I have finished a new version. It will now start with a trial run to see if the right calculations are carried out. The procedure is as follows: a 511x511 system is solved with a few V-cycles, and the result is displayed. The program then asks you to either abort the benchmark or continue.

The output shown should look like this: (It's a height-colored representation of the solution)
http://peter.metaclassofnil.com/uni/result.jpg

Additionally, I changed the time measurement method to hopefully be more accurate. Please download the new version and try it:
http://peter.metaclassofnil.com/uni/mgbench310507.zip

PeterT - I'm not sure what's happening however, when actually running mgbench under 2047^2 it took a little over twice as long to finish as it did for 1023^2. I'm not sure why it then recorded cycle times that were lower than 1023^2.Thanks for this report. "A little over twice as long" is exactly as long as 2047^2 should take - the workload is a little over 4 times as high, but I only run half as many iterations. This means that most likely my time measurement method was at fault, which inspired the changes in the new version above.

Silent_Buddha
31-May-2007, 06:25
You're lucky I'm running Vista and don't have to reboot after each driver change. ;) Otherwise I might be disinclined to install 8.38 to run this again.

Also, one other thing. While running the the original MGbench that I ran, on the 2047^2 test, the system would become completely unresponsive at certain intervals. Not sure if this is part of the reason it wasn't timing it correctly or not.

[edit - adding scores]

Same system as before...8.37 obviously didn't display anything in the preview window.

8.38 displayed the preview window exactly how you have it, not sure what you did, but it's apparently running faster now. :P


GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 10.8391 micros
Residual calculation: 10.7415 micros
Restriction: fw: 7.7144 micros
Interpolation + add: 10.6438 micros
VCycle: 249.984 micros


BENCHMARK
Jacobi iteration: 10.8391 micros
Residual calculation: 10.8392 micros
Restriction: fw: 7.812 micros
Interpolation + add: 10.7415 micros
VCycle: 248.031 micros


BENCHMARK
Jacobi iteration: 10.7415 micros
Residual calculation: 10.5462 micros
Restriction: fw: 7.812 micros
Interpolation + add: 10.8392 micros
VCycle: 249.984 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 11.067 micros
Residual calculation: 10.7416 micros
Restriction: fw: 8.029 micros
Interpolation + add: 10.85 micros
VCycle: 406.874 micros


BENCHMARK
Jacobi iteration: 11.067 micros
Residual calculation: 10.85 micros
Restriction: fw: 7.812 micros
Interpolation + add: 11.067 micros
VCycle: 403.62 micros


BENCHMARK
Jacobi iteration: 11.067 micros
Residual calculation: 10.85 micros
Restriction: fw: 8.029 micros
Interpolation + add: 11.067 micros
VCycle: 407.96 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 10.6195 micros
Residual calculation: 10.9856 micros
Restriction: fw: 7.68987 micros
Interpolation + add: 10.9856 micros
VCycle: 563.929 micros


BENCHMARK
Jacobi iteration: 10.8635 micros
Residual calculation: 10.6195 micros
Restriction: fw: 8.05612 micros
Interpolation + add: 10.7415 micros
VCycle: 557.825 micros


BENCHMARK
Jacobi iteration: 10.8636 micros
Residual calculation: 10.7415 micros
Restriction: fw: 7.812 micros
Interpolation + add: 11.1076 micros
VCycle: 561.487 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 10.881 micros
Residual calculation: 10.4624 micros
Restriction: fw: 7.95157 micros
Interpolation + add: 11.5784 micros
VCycle: 717.03 micros


BENCHMARK
Jacobi iteration: 10.602 micros
Residual calculation: 10.7414 micros
Restriction: fw: 7.95157 micros
Interpolation + add: 10.7414 micros
VCycle: 715.636 micros


BENCHMARK
Jacobi iteration: 11.0206 micros
Residual calculation: 10.881 micros
Restriction: fw: 7.95143 micros
Interpolation + add: 11.0206 micros
VCycle: 722.61 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 10.7415 micros
Residual calculation: 10.9042 micros
Restriction: fw: 8.1375 micros
Interpolation + add: 10.9043 micros
VCycle: 882.105 micros


BENCHMARK
Jacobi iteration: 11.067 micros
Residual calculation: 10.7415 micros
Restriction: fw: 7.812 micros
Interpolation + add: 10.7415 micros
VCycle: 883.732 micros


BENCHMARK
Jacobi iteration: 10.9042 micros
Residual calculation: 10.9043 micros
Restriction: fw: 8.30017 micros
Interpolation + add: 11.067 micros
VCycle: 878.85 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 11.5226 micros
Residual calculation: 11.9134 micros
Restriction: fw: 8.7884 micros
Interpolation + add: 12.304 micros
VCycle: 1107.35 micros


BENCHMARK
Jacobi iteration: 10.9368 micros
Residual calculation: 10.7416 micros
Restriction: fw: 8.2026 micros
Interpolation + add: 10.7414 micros
VCycle: 1105.4 micros


BENCHMARK
Jacobi iteration: 10.7414 micros
Residual calculation: 10.7416 micros
Restriction: fw: 7.812 micros
Interpolation + add: 10.7414 micros
VCycle: 1111.26 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 10.7415 micros
Residual calculation: 10.7415 micros
Restriction: fw: 7.812 micros
Interpolation + add: 10.9857 micros
VCycle: 4321.01 micros


BENCHMARK
Jacobi iteration: 11.4738 micros
Residual calculation: 10.9855 micros
Restriction: fw: 7.568 micros
Interpolation + add: 10.9855 micros
VCycle: 4591.99 micros


BENCHMARK
Jacobi iteration: 10.9857 micros
Residual calculation: 10.7415 micros
Restriction: fw: 7.812 micros
Interpolation + add: 10.7415 micros
VCycle: 4530.96 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 11.067 micros
Residual calculation: 10.7413 micros
Restriction: fw: 8.13767 micros
Interpolation + add: 11.718 micros
VCycle: 20740.9 micros


BENCHMARK
Jacobi iteration: 12.6943 micros
Residual calculation: 11.067 micros
Restriction: fw: 8.463 micros
Interpolation + add: 11.067 micros
VCycle: 22492 micros


BENCHMARK
Jacobi iteration: 16.926 micros
Residual calculation: 11.067 micros
Restriction: fw: 8.13767 micros
Interpolation + add: 11.067 micros
VCycle: 22469.3 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 10.7415 micros
Residual calculation: 10.2535 micros
Restriction: fw: 7.812 micros
Interpolation + add: 10.7415 micros
VCycle: 82445.9 micros


BENCHMARK
Jacobi iteration: 40.525 micros
Residual calculation: 40.5245 micros
Restriction: fw: 36.1305 micros
Interpolation + add: 40.525 micros
VCycle: 89530.4 micros


BENCHMARK
Jacobi iteration: 41.013 micros
Residual calculation: 40.0365 micros
Restriction: fw: 36.1305 micros
Interpolation + add: 40.525 micros
VCycle: 89535.3 micros



------------------
-- size: 2047

BENCHMARK
Jacobi iteration: 15.624 micros
Residual calculation: 16.6 micros
Restriction: fw: 12.695 micros
Interpolation + add: 10.741 micros
VCycle: 1601.46 micros


BENCHMARK
Jacobi iteration: 6255.46 micros
Residual calculation: 5349.27 micros
Restriction: fw: 2939.26 micros
Interpolation + add: 2506.68 micros
VCycle: 48610.2 micros


BENCHMARK
Jacobi iteration: 6435.14 micros
Residual calculation: 5215.49 micros
Restriction: fw: 3029.1 micros
Interpolation + add: 2398.28 micros
VCycle: 48961.7 micros


Regards,
SB

snk
31-May-2007, 07:34
These are with a C2D E6600 and 7900 GTX running on 64-bit Vista. Driver version 158.24. Once the test reached 2047 the display driver reset itself, and program seemed to hang. I'm pretty sure that it didn't use 3D-clocks, since at least one other OpenGL-based, windowed, benchmark doesn't use 3D-clocks.


GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 2.9 micros
Residual calculation: 2.9 micros
Restriction: fw: 2.4 micros
Interpolation + add: 2.8 micros
VCycle: 2804 micros


BENCHMARK
Jacobi iteration: 2.9 micros
Residual calculation: 2.9 micros
Restriction: fw: 2.3 micros
Interpolation + add: 2.8 micros
VCycle: 2819 micros


BENCHMARK
Jacobi iteration: 2.9 micros
Residual calculation: 2.9 micros
Restriction: fw: 2.4 micros
Interpolation + add: 2.8 micros
VCycle: 2838 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 2.88889 micros
Residual calculation: 2.88889 micros
Restriction: fw: 2.77778 micros
Interpolation + add: 2.88889 micros
VCycle: 4681.11 micros


BENCHMARK
Jacobi iteration: 3 micros
Residual calculation: 2.77778 micros
Restriction: fw: 2.66667 micros
Interpolation + add: 2.88889 micros
VCycle: 4690 micros


BENCHMARK
Jacobi iteration: 2.88889 micros
Residual calculation: 2.88889 micros
Restriction: fw: 2.77778 micros
Interpolation + add: 2.88889 micros
VCycle: 4713.33 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 2.875 micros
Residual calculation: 2.75 micros
Restriction: fw: 2.75 micros
Interpolation + add: 2.875 micros
VCycle: 6596.25 micros


BENCHMARK
Jacobi iteration: 3 micros
Residual calculation: 2.875 micros
Restriction: fw: 2.75 micros
Interpolation + add: 2.875 micros
VCycle: 6608.75 micros


BENCHMARK
Jacobi iteration: 2.875 micros
Residual calculation: 3 micros
Restriction: fw: 2.75 micros
Interpolation + add: 2.875 micros
VCycle: 6613.75 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 3 micros
Residual calculation: 3 micros
Restriction: fw: 2.71429 micros
Interpolation + add: 2.85714 micros
VCycle: 8515.71 micros


BENCHMARK
Jacobi iteration: 2.85714 micros
Residual calculation: 3 micros
Restriction: fw: 2.85714 micros
Interpolation + add: 2.85714 micros
VCycle: 8451.43 micros


BENCHMARK
Jacobi iteration: 3 micros
Residual calculation: 2.85714 micros
Restriction: fw: 2.71429 micros
Interpolation + add: 2.85714 micros
VCycle: 8561.43 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 11.6667 micros
Residual calculation: 13 micros
Restriction: fw: 9.33333 micros
Interpolation + add: 7 micros
VCycle: 10390 micros


BENCHMARK
Jacobi iteration: 12.5 micros
Residual calculation: 12.5 micros
Restriction: fw: 8.5 micros
Interpolation + add: 7.16667 micros
VCycle: 10380 micros


BENCHMARK
Jacobi iteration: 12 micros
Residual calculation: 12.6667 micros
Restriction: fw: 9.33333 micros
Interpolation + add: 7.83333 micros
VCycle: 10383.3 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 49.4 micros
Residual calculation: 53.6 micros
Restriction: fw: 39.6 micros
Interpolation + add: 28.2 micros
VCycle: 12364 micros


BENCHMARK
Jacobi iteration: 50 micros
Residual calculation: 53.4 micros
Restriction: fw: 37.2 micros
Interpolation + add: 27.6 micros
VCycle: 12374 micros


BENCHMARK
Jacobi iteration: 49.2 micros
Residual calculation: 51.4 micros
Restriction: fw: 37.8 micros
Interpolation + add: 27.8 micros
VCycle: 12372 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 194.75 micros
Residual calculation: 215.5 micros
Restriction: fw: 162 micros
Interpolation + add: 110.5 micros
VCycle: 15917.5 micros


BENCHMARK
Jacobi iteration: 187.25 micros
Residual calculation: 216 micros
Restriction: fw: 164 micros
Interpolation + add: 109.75 micros
VCycle: 15930 micros


BENCHMARK
Jacobi iteration: 194.25 micros
Residual calculation: 215.5 micros
Restriction: fw: 162 micros
Interpolation + add: 110.5 micros
VCycle: 15872.5 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 725 micros
Residual calculation: 881.333 micros
Restriction: fw: 672.667 micros
Interpolation + add: 454.667 micros
VCycle: 26426.7 micros


BENCHMARK
Jacobi iteration: 727.667 micros
Residual calculation: 883.333 micros
Restriction: fw: 673.667 micros
Interpolation + add: 459.333 micros
VCycle: 26463.3 micros


BENCHMARK
Jacobi iteration: 725.667 micros
Residual calculation: 884 micros
Restriction: fw: 674.333 micros
Interpolation + add: 460 micros
VCycle: 26433.3 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 2741.5 micros
Residual calculation: 3469 micros
Restriction: fw: 2981 micros
Interpolation + add: 2642 micros
VCycle: 76140 micros


BENCHMARK
Jacobi iteration: 2740 micros
Residual calculation: 3469 micros
Restriction: fw: 2981 micros
Interpolation + add: 2642.5 micros
VCycle: 76145 micros


BENCHMARK
Jacobi iteration: 2740.5 micros
Residual calculation: 3468.5 micros
Restriction: fw: 2981 micros
Interpolation + add: 2642.5 micros
VCycle: 76195 micros



------------------
-- size: 2047

neliz
31-May-2007, 08:19
Peter, looks like the App is a fine rebooter! :)

Tried it on my X1600Mobile XP (cat7.4) and it seems it rebooted at 1023 (I walked away because I was pretty confident that would happen.)

Results nonetheless..

GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 6.2502 micros
Residual calculation: 6.2501 micros
Restriction: fw: 4.6876 micros
Interpolation + add: 6.2502 micros
VCycle: 625.016 micros


BENCHMARK
Jacobi iteration: 6.2502 micros
Residual calculation: 6.2501 micros
Restriction: fw: 6.2502 micros
Interpolation + add: 6.2502 micros
VCycle: 640.641 micros


BENCHMARK
Jacobi iteration: 6.2502 micros
Residual calculation: 6.2501 micros
Restriction: fw: 6.2502 micros
Interpolation + add: 4.6876 micros
VCycle: 640.642 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 6.94467 micros
Residual calculation: 5.20844 micros
Restriction: fw: 6.94467 micros
Interpolation + add: 5.20844 micros
VCycle: 1059.05 micros


BENCHMARK
Jacobi iteration: 5.20856 micros
Residual calculation: 6.94456 micros
Restriction: fw: 5.20844 micros
Interpolation + add: 6.94467 micros
VCycle: 1041.69 micros


BENCHMARK
Jacobi iteration: 6.94456 micros
Residual calculation: 5.20856 micros
Restriction: fw: 6.94456 micros
Interpolation + add: 5.20844 micros
VCycle: 1076.42 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 5.8595 micros
Residual calculation: 5.8595 micros
Restriction: fw: 5.8595 micros
Interpolation + add: 5.85963 micros
VCycle: 1464.88 micros


BENCHMARK
Jacobi iteration: 5.8595 micros
Residual calculation: 5.8595 micros
Restriction: fw: 5.8595 micros
Interpolation + add: 5.85963 micros
VCycle: 1464.88 micros


BENCHMARK
Jacobi iteration: 5.8595 micros
Residual calculation: 5.8595 micros
Restriction: fw: 5.8595 micros
Interpolation + add: 5.85963 micros
VCycle: 1445.35 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 15.6254 micros
Residual calculation: 15.6254 micros
Restriction: fw: 17.8576 micros
Interpolation + add: 11.161 micros
VCycle: 1942.01 micros


BENCHMARK
Jacobi iteration: 15.6254 micros
Residual calculation: 17.8576 micros
Restriction: fw: 15.6254 micros
Interpolation + add: 11.161 micros
VCycle: 1964.34 micros


BENCHMARK
Jacobi iteration: 17.8576 micros
Residual calculation: 15.6254 micros
Restriction: fw: 17.8576 micros
Interpolation + add: 11.161 micros
VCycle: 1942.01 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 59.8973 micros
Residual calculation: 59.8973 micros
Restriction: fw: 67.7102 micros
Interpolation + add: 39.0635 micros
VCycle: 2734.45 micros


BENCHMARK
Jacobi iteration: 59.8973 micros
Residual calculation: 59.8973 micros
Restriction: fw: 67.7102 micros
Interpolation + add: 39.0635 micros
VCycle: 2708.4 micros


BENCHMARK
Jacobi iteration: 59.8973 micros
Residual calculation: 59.8973 micros
Restriction: fw: 65.1058 micros
Interpolation + add: 39.0635 micros
VCycle: 2708.4 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 481.262 micros
Residual calculation: 3.125 micros
Restriction: fw: 246.881 micros
Interpolation + add: 153.129 micros
VCycle: 5281.39 micros


BENCHMARK
Jacobi iteration: 221.881 micros
Residual calculation: 234.381 micros
Restriction: fw: 275.007 micros
Interpolation + add: 153.129 micros
VCycle: 5312.64 micros


BENCHMARK
Jacobi iteration: 221.881 micros
Residual calculation: 240.631 micros
Restriction: fw: 271.882 micros
Interpolation + add: 153.129 micros
VCycle: 5281.38 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 699.237 micros
Residual calculation: 1160.19 micros
Restriction: fw: 234.381 micros
Interpolation + add: 277.351 micros
VCycle: 19258.3 micros


BENCHMARK
Jacobi iteration: 781.27 micros
Residual calculation: 937.524 micros
Restriction: fw: 1113.31 micros
Interpolation + add: 703.143 micros
VCycle: 19336.4 micros


BENCHMARK
Jacobi iteration: 781.27 micros
Residual calculation: 937.524 micros
Restriction: fw: 1113.31 micros
Interpolation + add: 703.143 micros
VCycle: 19336.4 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 2005.26 micros
Residual calculation: 5.20833 micros
Restriction: fw: 6552.25 micros
Interpolation + add: 2250.06 micros
VCycle: 59272.4 micros


BENCHMARK
Jacobi iteration: 2494.86 micros
Residual calculation: 3744.89 micros
Restriction: fw: 4421.99 micros
Interpolation + add: 3520.92 micros
VCycle: 84012.6 micros


BENCHMARK
Jacobi iteration: 2432.35 micros
Residual calculation: 3739.68 micros
Restriction: fw: 4427.2 micros
Interpolation + add: 3541.76 micros
VCycle: 84377.2 micros



------------------
-- size: 1023

hoom
31-May-2007, 12:23
C2D 6600 @ stock
X1900GT @ stock
Cat 7.2
XP Home

Crashed hard half way through the 1023 both times I ran it (2nd version)
Screen went blank & system unresponsive :(
First time I was running Winamp & that was running audio fine after the blank for several mins so wasn't full hard lock but couldn't use x to abort or use reboot key combo.
2nd time nothing running in background but I ran task manager, CPU sat at 50-53% with almost all being Kernel time.

GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 3.125 micros
Residual calculation: 4.6875 micros
Restriction: fw: 3.125 micros
Interpolation + add: 4.6875 micros
VCycle: 421.875 micros


BENCHMARK
Jacobi iteration: 3.125 micros
Residual calculation: 3.125 micros
Restriction: fw: 4.6875 micros
Interpolation + add: 3.125 micros
VCycle: 421.875 micros


BENCHMARK
Jacobi iteration: 4.6875 micros
Residual calculation: 3.125 micros
Restriction: fw: 4.6875 micros
Interpolation + add: 3.125 micros
VCycle: 421.875 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 3.47222 micros
Residual calculation: 3.47222 micros
Restriction: fw: 5.20833 micros
Interpolation + add: 3.47222 micros
VCycle: 694.444 micros


BENCHMARK
Jacobi iteration: 3.47222 micros
Residual calculation: 3.47222 micros
Restriction: fw: 5.20833 micros
Interpolation + add: 3.47222 micros
VCycle: 694.444 micros


BENCHMARK
Jacobi iteration: 3.47222 micros
Residual calculation: 3.47222 micros
Restriction: fw: 3.47222 micros
Interpolation + add: 3.47222 micros
VCycle: 711.806 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 3.90625 micros
Residual calculation: 3.90625 micros
Restriction: fw: 3.90625 micros
Interpolation + add: 3.90625 micros
VCycle: 976.563 micros


BENCHMARK
Jacobi iteration: 3.90625 micros
Residual calculation: 3.90625 micros
Restriction: fw: 3.90625 micros
Interpolation + add: 1.95313 micros
VCycle: 996.094 micros


BENCHMARK
Jacobi iteration: 3.90625 micros
Residual calculation: 3.90625 micros
Restriction: fw: 3.90625 micros
Interpolation + add: 1.95313 micros
VCycle: 996.094 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 8.92857 micros
Residual calculation: 6.69643 micros
Restriction: fw: 8.92857 micros
Interpolation + add: 6.69643 micros
VCycle: 1294.64 micros


BENCHMARK
Jacobi iteration: 8.92857 micros
Residual calculation: 6.69643 micros
Restriction: fw: 13.3929 micros
Interpolation + add: 4.46429 micros
VCycle: 1272.32 micros


BENCHMARK
Jacobi iteration: 8.92857 micros
Residual calculation: 6.69643 micros
Restriction: fw: 8.92857 micros
Interpolation + add: 6.69643 micros
VCycle: 1294.64 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 20.8333 micros
Residual calculation: 31.25 micros
Restriction: fw: 10.4167 micros
Interpolation + add: 18.2292 micros
VCycle: 1640.63 micros


BENCHMARK
Jacobi iteration: 31.25 micros
Residual calculation: 7.8125 micros
Restriction: fw: 23.4375 micros
Interpolation + add: 15.625 micros
VCycle: 1666.67 micros


BENCHMARK
Jacobi iteration: 5.20833 micros
Residual calculation: 18.2292 micros
Restriction: fw: 23.4375 micros
Interpolation + add: 15.625 micros
VCycle: 1692.71 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 125 micros
Residual calculation: 9.375 micros
Restriction: fw: 90.625 micros
Interpolation + add: 78.125 micros
VCycle: 2031.25 micros


BENCHMARK
Jacobi iteration: 9.375 micros
Residual calculation: 106.25 micros
Restriction: fw: 78.125 micros
Interpolation + add: 15.625 micros
VCycle: 2437.5 micros


BENCHMARK
Jacobi iteration: 109.375 micros
Residual calculation: 100 micros
Restriction: fw: 6.25 micros
Interpolation + add: 68.75 micros
VCycle: 2218.75 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 11.7188 micros
Residual calculation: 480.469 micros
Restriction: fw: 343.75 micros
Interpolation + add: 42.9688 micros
VCycle: 5195.31 micros


BENCHMARK
Jacobi iteration: 7.8125 micros
Residual calculation: 484.375 micros
Restriction: fw: 343.75 micros
Interpolation + add: 46.875 micros
VCycle: 5117.19 micros


BENCHMARK
Jacobi iteration: 3.90625 micros
Residual calculation: 480.469 micros
Restriction: fw: 343.75 micros
Interpolation + add: 50.7813 micros
VCycle: 5039.06 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 916.667 micros
Residual calculation: 1182.29 micros
Restriction: fw: 677.083 micros
Interpolation + add: 973.958 micros
VCycle: 10416.7 micros


BENCHMARK
Jacobi iteration: 1197.92 micros
Residual calculation: 937.5 micros
Restriction: fw: 703.125 micros
Interpolation + add: 1010.42 micros
VCycle: 10833.3 micros


BENCHMARK
Jacobi iteration: 1250 micros
Residual calculation: 953.125 micros
Restriction: fw: 713.542 micros
Interpolation + add: 1010.42 micros
VCycle: 10729.2 micros



------------------
-- size: 1023

PeterT
31-May-2007, 13:42
Thanks for the results. It's amazing how many graphics cards are unstable at stock settings when faced with some unusual load.

snk
31-May-2007, 18:13
Thanks for the results. It's amazing how many graphics cards are unstable at stock settings when faced with some unusual load.

To be honest, I don't think that this is necessarily the case. First of all, I confirmed that at least on my rig the benchmark runs with 2D-clocks, since there was negligible difference in times between core clock of 580 and 680 MHz on 7900 GTX (default is 650). Secondly, 2047 reset the display driver at both clock speeds.

IIRC 2D clocks are somewhere around 300 MHz for this card, so I doubt that it would be unstable :)

GPU temperature during the 1023 test was around 70 °C, but it's even higher during a 3DMark06 run, so that shouldn't play a role either.

Rys
31-May-2007, 19:28
8800 GTX + 8600 GTS (app starts on a monitor being driven by the 8800 GTX), Vista x64, 8GB mem, Core 2 X6800, 158.42 driver.

GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.

------------------
-- size: 3

BENCHMARK
Jacobi iteration: 2.4898 micros
Residual calculation: 2.441 micros
Restriction: fw: 2.4899 micros
Interpolation + add: 2.4898 micros
VCycle: 6415.44 micros


BENCHMARK
Jacobi iteration: 2.5386 micros
Residual calculation: 2.441 micros
Restriction: fw: 2.4899 micros
Interpolation + add: 2.2945 micros
VCycle: 6446.68 micros


BENCHMARK
Jacobi iteration: 2.4898 micros
Residual calculation: 2.5386 micros
Restriction: fw: 2.4898 micros
Interpolation + add: 2.3434 micros
VCycle: 6436.43 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 2.54944 micros
Residual calculation: 2.441 micros
Restriction: fw: 2.49533 micros
Interpolation + add: 2.38667 micros
VCycle: 10857 micros


BENCHMARK
Jacobi iteration: 2.54956 micros
Residual calculation: 2.441 micros
Restriction: fw: 2.441 micros
Interpolation + add: 2.33244 micros
VCycle: 10822.3 micros


BENCHMARK
Jacobi iteration: 2.49533 micros
Residual calculation: 2.441 micros
Restriction: fw: 2.441 micros
Interpolation + add: 2.38667 micros
VCycle: 10822.9 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 2.502 micros
Residual calculation: 2.502 micros
Restriction: fw: 2.441 micros
Interpolation + add: 2.319 micros
VCycle: 15284.3 micros


BENCHMARK
Jacobi iteration: 2.502 micros
Residual calculation: 2.441 micros
Restriction: fw: 2.441 micros
Interpolation + add: 2.319 micros
VCycle: 15225.1 micros


BENCHMARK
Jacobi iteration: 2.441 micros
Residual calculation: 2.502 micros
Restriction: fw: 2.563 micros
Interpolation + add: 2.319 micros
VCycle: 15280 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 2.441 micros
Residual calculation: 2.51071 micros
Restriction: fw: 2.441 micros
Interpolation + add: 2.441 micros
VCycle: 19661.9 micros


BENCHMARK
Jacobi iteration: 2.51071 micros
Residual calculation: 2.51086 micros
Restriction: fw: 2.58043 micros
Interpolation + add: 2.58043 micros
VCycle: 19643.8 micros


BENCHMARK
Jacobi iteration: 2.65029 micros
Residual calculation: 2.51071 micros
Restriction: fw: 2.51071 micros
Interpolation + add: 2.30157 micros
VCycle: 19621.5 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 2.60367 micros
Residual calculation: 2.441 micros
Restriction: fw: 2.52233 micros
Interpolation + add: 2.27833 micros
VCycle: 24185.4 micros


BENCHMARK
Jacobi iteration: 2.35967 micros
Residual calculation: 2.60383 micros
Restriction: fw: 2.3595 micros
Interpolation + add: 2.27833 micros
VCycle: 24112.2 micros


BENCHMARK
Jacobi iteration: 2.441 micros
Residual calculation: 2.441 micros
Restriction: fw: 2.3595 micros
Interpolation + add: 2.197 micros
VCycle: 24174 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 2.441 micros
Residual calculation: 2.5386 micros
Restriction: fw: 10.057 micros
Interpolation + add: 12.0096 micros
VCycle: 28820.4 micros


BENCHMARK
Jacobi iteration: 2.5386 micros
Residual calculation: 2.441 micros
Restriction: fw: 10.1546 micros
Interpolation + add: 12.1074 micros
VCycle: 28782.3 micros


BENCHMARK
Jacobi iteration: 2.5386 micros
Residual calculation: 2.5386 micros
Restriction: fw: 10.057 micros
Interpolation + add: 12.0096 micros
VCycle: 28935.6 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 2.441 micros
Residual calculation: 2.441 micros
Restriction: fw: 33.0757 micros
Interpolation + add: 49.5523 micros
VCycle: 34689.1 micros


BENCHMARK
Jacobi iteration: 2.31875 micros
Residual calculation: 2.441 micros
Restriction: fw: 33.0757 micros
Interpolation + add: 49.6743 micros
VCycle: 34759.8 micros


BENCHMARK
Jacobi iteration: 2.441 micros
Residual calculation: 2.441 micros
Restriction: fw: 32.9535 micros
Interpolation + add: 49.6743 micros
VCycle: 34707.4 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 2.27833 micros
Residual calculation: 2.441 micros
Restriction: fw: 93.5717 micros
Interpolation + add: 146.785 micros
VCycle: 46203.2 micros


BENCHMARK
Jacobi iteration: 2.27833 micros
Residual calculation: 2.441 micros
Restriction: fw: 93.5717 micros
Interpolation + add: 146.297 micros
VCycle: 46242.3 micros


BENCHMARK
Jacobi iteration: 2.441 micros
Residual calculation: 2.441 micros
Restriction: fw: 94.711 micros
Interpolation + add: 147.274 micros
VCycle: 46523.8 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 2.685 micros
Residual calculation: 2.197 micros
Restriction: fw: 2.441 micros
Interpolation + add: 351.26 micros
VCycle: 89816.6 micros


BENCHMARK
Jacobi iteration: 2.441 micros
Residual calculation: 2.929 micros
Restriction: fw: 2.441 micros
Interpolation + add: 349.307 micros
VCycle: 89618.9 micros


BENCHMARK
Jacobi iteration: 2.6855 micros
Residual calculation: 2.1965 micros
Restriction: fw: 2.441 micros
Interpolation + add: 347.599 micros
VCycle: 89384.5 micros



------------------
-- size: 2047

BENCHMARK
Jacobi iteration: 4.394 micros
Residual calculation: 3.417 micros
Restriction: fw: 3.418 micros
Interpolation + add: 3.417 micros
VCycle: 253195 micros


BENCHMARK
Jacobi iteration: 2.441 micros
Residual calculation: 2.441 micros
Restriction: fw: 2.929 micros
Interpolation + add: 2.441 micros
VCycle: 252985 micros


BENCHMARK
Jacobi iteration: 2.441 micros
Residual calculation: 2.929 micros
Restriction: fw: 2.929 micros
Interpolation + add: 2.441 micros
VCycle: 253122 micros

fellix
31-May-2007, 20:08
Opteron 165 @ 2700MHz;
x1900XT 256MB @ 655/1764MHz;
Vista-32 & Catalyst 7.4;Nothing is being displayed in the Output window and the FPS counter (from ATi Tray Tools) says 0 frames.
In the console window, the last three runs finished really quickly.
GL multigrid benchmark 0.5 by Peter Thoman 2007
DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.

------------------
-- size: 3
BENCHMARK
Jacobi iteration: 12.5 micros
Residual calculation: 12.5 micros
Restriction: fw: 9.375 micros
Interpolation + add: 12.5 micros
VCycle: 265.625 micros

BENCHMARK
Jacobi iteration: 14.0625 micros
Residual calculation: 12.5 micros
Restriction: fw: 9.375 micros
Interpolation + add: 12.5 micros
VCycle: 250 micros

BENCHMARK
Jacobi iteration: 14.0625 micros
Residual calculation: 12.5 micros
Restriction: fw: 9.375 micros
Interpolation + add: 12.5 micros
VCycle: 265.625 micros

------------------
-- size: 7
BENCHMARK
Jacobi iteration: 12.1528 micros
Residual calculation: 12.1528 micros
Restriction: fw: 10.4167 micros
Interpolation + add: 12.1528 micros
VCycle: 399.306 micros

BENCHMARK
Jacobi iteration: 13.8889 micros
Residual calculation: 12.1528 micros
Restriction: fw: 8.68056 micros
Interpolation + add: 12.1528 micros
VCycle: 416.667 micros

BENCHMARK
Jacobi iteration: 12.1528 micros
Residual calculation: 12.1528 micros
Restriction: fw: 10.4167 micros
Interpolation + add: 12.1528 micros
VCycle: 416.667 micros

------------------
-- size: 15
BENCHMARK
Jacobi iteration: 13.6719 micros
Residual calculation: 13.6719 micros
Restriction: fw: 9.76563 micros
Interpolation + add: 11.7188 micros
VCycle: 566.406 micros

BENCHMARK
Jacobi iteration: 11.7188 micros
Residual calculation: 13.6719 micros
Restriction: fw: 9.76563 micros
Interpolation + add: 11.7188 micros
VCycle: 566.406 micros

BENCHMARK
Jacobi iteration: 11.7188 micros
Residual calculation: 13.6719 micros
Restriction: fw: 9.76563 micros
Interpolation + add: 11.7188 micros
VCycle: 566.406 micros

------------------
-- size: 31
BENCHMARK
Jacobi iteration: 13.3929 micros
Residual calculation: 13.3929 micros
Restriction: fw: 8.92857 micros
Interpolation + add: 13.3929 micros
VCycle: 714.286 micros

BENCHMARK
Jacobi iteration: 13.3929 micros
Residual calculation: 11.1607 micros
Restriction: fw: 11.1607 micros
Interpolation + add: 11.1607 micros
VCycle: 736.607 micros

BENCHMARK
Jacobi iteration: 13.3929 micros
Residual calculation: 13.3929 micros
Restriction: fw: 8.92857 micros
Interpolation + add: 13.3929 micros
VCycle: 714.286 micros

------------------
-- size: 63
BENCHMARK
Jacobi iteration: 13.0208 micros
Residual calculation: 13.0208 micros
Restriction: fw: 7.8125 micros
Interpolation + add: 13.0208 micros
VCycle: 885.417 micros

BENCHMARK
Jacobi iteration: 13.0208 micros
Residual calculation: 13.0208 micros
Restriction: fw: 7.8125 micros
Interpolation + add: 13.0208 micros
VCycle: 885.417 micros

BENCHMARK
Jacobi iteration: 13.0208 micros
Residual calculation: 13.0208 micros
Restriction: fw: 7.8125 micros
Interpolation + add: 13.0208 micros
VCycle: 885.417 micros

------------------
-- size: 127
BENCHMARK
Jacobi iteration: 12.5 micros
Residual calculation: 12.5 micros
Restriction: fw: 9.375 micros
Interpolation + add: 12.5 micros
VCycle: 1000 micros

BENCHMARK
Jacobi iteration: 12.5 micros
Residual calculation: 12.5 micros
Restriction: fw: 9.375 micros
Interpolation + add: 12.5 micros
VCycle: 1031.25 micros

BENCHMARK
Jacobi iteration: 12.5 micros
Residual calculation: 15.625 micros
Restriction: fw: 9.375 micros
Interpolation + add: 12.5 micros
VCycle: 1031.25 micros

------------------
-- size: 255
BENCHMARK
Jacobi iteration: 11.7188 micros
Residual calculation: 11.7188 micros
Restriction: fw: 11.7188 micros
Interpolation + add: 11.7188 micros
VCycle: 1171.88 micros

BENCHMARK
Jacobi iteration: 11.7188 micros
Residual calculation: 15.625 micros
Restriction: fw: 7.8125 micros
Interpolation + add: 11.7188 micros
VCycle: 1210.94 micros

BENCHMARK
Jacobi iteration: 11.7188 micros
Residual calculation: 15.625 micros
Restriction: fw: 7.8125 micros
Interpolation + add: 11.7188 micros
VCycle: 1210.94 micros

------------------
-- size: 511
BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 10.4167 micros
Restriction: fw: 10.4167 micros
Interpolation + add: 10.4167 micros
VCycle: 1406.25 micros

BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 10.4167 micros
Restriction: fw: 10.4167 micros
Interpolation + add: 10.4167 micros
VCycle: 1354.17 micros

BENCHMARK
Jacobi iteration: 10.4167 micros
Residual calculation: 15.625 micros
Restriction: fw: 5.20833 micros
Interpolation + add: 15.625 micros
VCycle: 1354.17 micros

------------------
-- size: 1023
BENCHMARK
Jacobi iteration: 7.8125 micros
Residual calculation: 15.625 micros
Restriction: fw: 7.8125 micros
Interpolation + add: 15.625 micros
VCycle: 1562.5 micros

BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 15.625 micros
Restriction: fw: 7.8125 micros
Interpolation + add: 15.625 micros
VCycle: 1484.38 micros

BENCHMARK
Jacobi iteration: 7.8125 micros
Residual calculation: 15.625 micros
Restriction: fw: 7.8125 micros
Interpolation + add: 15.625 micros
VCycle: 1562.5 micros

------------------
-- size: 2047
BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 15.625 micros
Restriction: fw: 15.625 micros
Interpolation + add: 15.625 micros
VCycle: 1562.5 micros

BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 0 micros
Restriction: fw: 15.625 micros
Interpolation + add: 15.625 micros
VCycle: 1562.5 micros

BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 0 micros
Restriction: fw: 15.625 micros
Interpolation + add: 15.625 micros
VCycle: 1562.5 micros

Arun
31-May-2007, 20:33
Thanks for the results. It's amazing how many graphics cards are unstable at stock settings when faced with some unusual load.The problem is not stability, afaik, but the driver model itself: if you let a graphics command run for too long, Windows will assume it hanged and it will ask the driver to reset itself. And yes, it's an horribly annoying and stupid problem! :(

PeterT
31-May-2007, 20:54
To be honest, I don't think that this is necessarily the case. [...]The problem is not stability, afaik, but the driver model itself: if you let a graphics command run for too long, Windows will assume it hanged and it will ask the driver to reset itself. And yes, it's an horribly annoying and stupid problem!Hmm, interesting. But most of the reports I got are of BSoDs. That can't be the expected driver behaviour, can it?

I assumed that it's a question of stability, as on my system, as soon as I overclock my GPU past a certain range (where everything else, e.g. looping 3DMark, still works perfectly) the benchmark causes a complete hang. This behaviour is perfectly reproducible. In fact, the range in which the benchmark manages to finish running even extends slightly if I fully turn up my case fans. It's quite natural to believe it to be a stability issue in such a case, don't you agree? But it could of course be a different issue that causes the crashes on other systems.

Anyway, this thread again goes to show why I'm happy with the emergence of APIs like CUDA and CTM, and why I'll use one of those for sure in case I do another GPGPU project. Graphics drivers are just too arbitrary at times.

Tim Murray
31-May-2007, 20:57
Hmm, interesting. But most of the reports I got are of BSoDs. That can't be the expected driver behaviour, can it?
From the CUDA v0.8 release notes:
Individual GPU program launches are limited to a run time of less than 5 seconds on the device. Exceeding this time limit usually causes a launch failure reported through the CUDA driver or the CUDA runtime, but in some cases hangs the entire machine, requiring a hard reset. Microsoft Windows has a "watchdog" timer that causes programs using the primary graphics adapter to time out if they run longer than the maximum allowed time. For this reason it is recommeded that CUDA is run on a G80 that is NOT attached to a display and does not have the Windows desktop extended onto it. In this case, the system must contain at least one NVIDIA GPU that serves as the primary graphics adapter.
I'm assuming that the five second watchdog timer affects your programs as well.

PeterT
31-May-2007, 21:38
I'm assuming that the five second watchdog timer affects your programs as well.Hmm, if I read that correctly it's 5 seconds per individual GPU operation. As my programs, even at the maximum size of 2047², are more in the range of 10 milliseconds per operation it can't be that specific timer event triggering (the benchmark runs thousands of those iterations).

Arun
31-May-2007, 22:30
Hmm, if I read that correctly it's 5 seconds per individual GPU operation. As my programs, even at the maximum size of 2047², are more in the range of 10 milliseconds per operation it can't be that specific timer event triggering (the benchmark runs thousands of those iterations).Areyou telling the GPU the frame is over after each iteration by presenting the backbuffer? Otherwise, all that time might accumulate... :(

PeterT
31-May-2007, 23:21
Are you telling the GPU the frame is over after each iteration by presenting the backbuffer? Otherwise, all that time might accumulate... :(Hmm, my inital benchmarking app didn't even present the backbuffer once. The new one does once in the beginning to show the confirmation picture. But if the watchdog did indeed count everything between 2 buffer flips as one "gpu operation" for the sake of that 5 second limit my benchmark should always cause it to call for a reset, no?

If it really works that way I can see why you called it an horribly annoying and stupid problem ;)

Geeforcer
31-May-2007, 23:56
Results using the latest version (8800 GTX, etc)

GL multigrid benchmark 0.5 by Peter Thoman 2007
DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.

------------------
-- size: 3
BENCHMARK
Jacobi iteration: 1.5625 micros
Residual calculation: 1.5625 micros
Restriction: fw: 3.125 micros
Interpolation + add: 1.5625 micros
VCycle: 390.625 micros

BENCHMARK
Jacobi iteration: 1.5625 micros
Residual calculation: 3.125 micros
Restriction: fw: 1.5625 micros
Interpolation + add: 1.5625 micros
VCycle: 390.625 micros

BENCHMARK
Jacobi iteration: 1.5625 micros
Residual calculation: 1.5625 micros
Restriction: fw: 1.5625 micros
Interpolation + add: 1.5625 micros
VCycle: 406.25 micros

------------------
-- size: 7
BENCHMARK
Jacobi iteration: 1.73611 micros
Residual calculation: 1.73611 micros
Restriction: fw: 1.73611 micros
Interpolation + add: 1.73611 micros
VCycle: 659.722 micros

BENCHMARK
Jacobi iteration: 1.73611 micros
Residual calculation: 1.73611 micros
Restriction: fw: 1.73611 micros
Interpolation + add: 1.73611 micros
VCycle: 659.722 micros

BENCHMARK
Jacobi iteration: 1.73611 micros
Residual calculation: 1.73611 micros
Restriction: fw: 1.73611 micros
Interpolation + add: 1.73611 micros
VCycle: 659.722 micros

------------------
-- size: 15
BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 1.95313 micros
Restriction: fw: 1.95313 micros
Interpolation + add: 1.95313 micros
VCycle: 937.5 micros

BENCHMARK
Jacobi iteration: 1.95313 micros
Residual calculation: 1.95313 micros
Restriction: fw: 1.95313 micros
Interpolation + add: 1.95313 micros
VCycle: 937.5 micros

BENCHMARK
Jacobi iteration: 1.95313 micros
Residual calculation: 1.95313 micros
Restriction: fw: 1.95313 micros
Interpolation + add: 1.95313 micros
VCycle: 917.969 micros

------------------
-- size: 31
BENCHMARK
Jacobi iteration: 2.23214 micros
Residual calculation: 2.23214 micros
Restriction: fw: 0 micros
Interpolation + add: 2.23214 micros
VCycle: 1205.36 micros

BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 2.23214 micros
Restriction: fw: 2.23214 micros
Interpolation + add: 2.23214 micros
VCycle: 1205.36 micros

BENCHMARK
Jacobi iteration: 2.23214 micros
Residual calculation: 2.23214 micros
Restriction: fw: 2.23214 micros
Interpolation + add: 0 micros
VCycle: 1227.68 micros

------------------
-- size: 63
BENCHMARK
Jacobi iteration: 5.20833 micros
Residual calculation: 2.60417 micros
Restriction: fw: 2.60417 micros
Interpolation + add: 2.60417 micros
VCycle: 1484.38 micros

BENCHMARK
Jacobi iteration: 5.20833 micros
Residual calculation: 2.60417 micros
Restriction: fw: 2.60417 micros
Interpolation + add: 2.60417 micros
VCycle: 1484.38 micros

BENCHMARK
Jacobi iteration: 2.60417 micros
Residual calculation: 5.20833 micros
Restriction: fw: 2.60417 micros
Interpolation + add: 2.60417 micros
VCycle: 1510.42 micros

------------------
-- size: 127
BENCHMARK
Jacobi iteration: 9.375 micros
Residual calculation: 12.5 micros
Restriction: fw: 9.375 micros
Interpolation + add: 6.25 micros
VCycle: 1781.25 micros

BENCHMARK
Jacobi iteration: 9.375 micros
Residual calculation: 12.5 micros
Restriction: fw: 9.375 micros
Interpolation + add: 6.25 micros
VCycle: 1812.5 micros

BENCHMARK
Jacobi iteration: 12.5 micros
Residual calculation: 9.375 micros
Restriction: fw: 12.5 micros
Interpolation + add: 6.25 micros
VCycle: 1781.25 micros

------------------
-- size: 255
BENCHMARK
Jacobi iteration: 50.7813 micros
Residual calculation: 46.875 micros
Restriction: fw: 46.875 micros
Interpolation + add: 31.25 micros
VCycle: 2265.63 micros

BENCHMARK
Jacobi iteration: 50.7813 micros
Residual calculation: 46.875 micros
Restriction: fw: 42.9688 micros
Interpolation + add: 35.1563 micros
VCycle: 2265.63 micros

BENCHMARK
Jacobi iteration: 46.875 micros
Residual calculation: 50.7813 micros
Restriction: fw: 42.9688 micros
Interpolation + add: 31.25 micros
VCycle: 2265.63 micros

------------------
-- size: 511
BENCHMARK
Jacobi iteration: 197.917 micros
Residual calculation: 192.708 micros
Restriction: fw: 192.708 micros
Interpolation + add: 125 micros
VCycle: 3854.17 micros

BENCHMARK
Jacobi iteration: 197.917 micros
Residual calculation: 192.708 micros
Restriction: fw: 187.5 micros
Interpolation + add: 130.208 micros
VCycle: 3854.17 micros

BENCHMARK
Jacobi iteration: 197.917 micros
Residual calculation: 192.708 micros
Restriction: fw: 192.708 micros
Interpolation + add: 125 micros
VCycle: 3854.17 micros

------------------
-- size: 1023
BENCHMARK
Jacobi iteration: 812.5 micros
Residual calculation: 773.438 micros
Restriction: fw: 835.938 micros
Interpolation + add: 562.5 micros
VCycle: 16484.4 micros

BENCHMARK
Jacobi iteration: 796.875 micros
Residual calculation: 781.25 micros
Restriction: fw: 867.188 micros
Interpolation + add: 539.063 micros
VCycle: 16093.8 micros

BENCHMARK
Jacobi iteration: 804.688 micros
Residual calculation: 773.438 micros
Restriction: fw: 851.563 micros
Interpolation + add: 546.875 micros
VCycle: 16015.6 micros

------------------
-- size: 2047
BENCHMARK
Jacobi iteration: 468.75 micros
Residual calculation: 2718.75 micros
Restriction: fw: 3906.25 micros
Interpolation + add: 3031.25 micros
VCycle: 95312.5 micros

BENCHMARK
Jacobi iteration: 484.375 micros
Residual calculation: 2734.38 micros
Restriction: fw: 3875 micros
Interpolation + add: 3031.25 micros
VCycle: 95468.8 micros

BENCHMARK
Jacobi iteration: 437.5 micros
Residual calculation: 2718.75 micros
Restriction: fw: 3890.63 micros
Interpolation + add: 3031.25 micros
VCycle: 95625 micros

Is it just me or do people with similar systems have quite different results? (From looking at other 8800 GTX results here).

BRiT
01-Jun-2007, 03:05
Will install the latest Nvidia Vista x64 drivers and give it another go. Here's results from the 158.43 with the MSI 8800GTX HDOC at it's stock clocks.

GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 4.6875 micros
Residual calculation: 3.125 micros
Restriction: fw: 3.125 micros
Interpolation + add: 3.125 micros
VCycle: 6390.63 micros


BENCHMARK
Jacobi iteration: 3.125 micros
Residual calculation: 4.6875 micros
Restriction: fw: 3.125 micros
Interpolation + add: 3.125 micros
VCycle: 6328.13 micros


BENCHMARK
Jacobi iteration: 4.6875 micros
Residual calculation: 3.125 micros
Restriction: fw: 3.125 micros
Interpolation + add: 3.125 micros
VCycle: 6421.88 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 5.20833 micros
Residual calculation: 1.73611 micros
Restriction: fw: 3.47222 micros
Interpolation + add: 3.47222 micros
VCycle: 10816 micros


BENCHMARK
Jacobi iteration: 3.47222 micros
Residual calculation: 3.47222 micros
Restriction: fw: 3.47222 micros
Interpolation + add: 3.47222 micros
VCycle: 10798.6 micros


BENCHMARK
Jacobi iteration: 5.20833 micros
Residual calculation: 3.47222 micros
Restriction: fw: 3.47222 micros
Interpolation + add: 3.47222 micros
VCycle: 10711.8 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 3.90625 micros
Residual calculation: 3.90625 micros
Restriction: fw: 3.90625 micros
Interpolation + add: 3.90625 micros
VCycle: 15253.9 micros


BENCHMARK
Jacobi iteration: 3.90625 micros
Residual calculation: 3.90625 micros
Restriction: fw: 3.90625 micros
Interpolation + add: 3.90625 micros
VCycle: 15195.3 micros


BENCHMARK
Jacobi iteration: 3.90625 micros
Residual calculation: 3.90625 micros
Restriction: fw: 3.90625 micros
Interpolation + add: 3.90625 micros
VCycle: 15293 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 4.46429 micros
Residual calculation: 2.23214 micros
Restriction: fw: 4.46429 micros
Interpolation + add: 4.46429 micros
VCycle: 19508.9 micros


BENCHMARK
Jacobi iteration: 4.46429 micros
Residual calculation: 2.23214 micros
Restriction: fw: 4.46429 micros
Interpolation + add: 2.23214 micros
VCycle: 19598.2 micros


BENCHMARK
Jacobi iteration: 2.23214 micros
Residual calculation: 4.46429 micros
Restriction: fw: 4.46429 micros
Interpolation + add: 2.23214 micros
VCycle: 19843.8 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 2.60417 micros
Residual calculation: 5.20833 micros
Restriction: fw: 2.60417 micros
Interpolation + add: 2.60417 micros
VCycle: 24088.5 micros


BENCHMARK
Jacobi iteration: 5.20833 micros
Residual calculation: 2.60417 micros
Restriction: fw: 5.20833 micros
Interpolation + add: 2.60417 micros
VCycle: 24375 micros


BENCHMARK
Jacobi iteration: 2.60417 micros
Residual calculation: 5.20833 micros
Restriction: fw: 2.60417 micros
Interpolation + add: 2.60417 micros
VCycle: 25045.5 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 3.125 micros
Residual calculation: 3.125 micros
Restriction: fw: 6.25 micros
Interpolation + add: 9.375 micros
VCycle: 29558.6 micros


BENCHMARK
Jacobi iteration: 6.25 micros
Residual calculation: 3.125 micros
Restriction: fw: 3.125 micros
Interpolation + add: 9.375 micros
VCycle: 30009.7 micros


BENCHMARK
Jacobi iteration: 3.125 micros
Residual calculation: 3.125 micros
Restriction: fw: 6.25 micros
Interpolation + add: 9.375 micros
VCycle: 29839.8 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 3.90625 micros
Residual calculation: 3.90625 micros
Restriction: fw: 27.3438 micros
Interpolation + add: 46.875 micros
VCycle: 34843.8 micros


BENCHMARK
Jacobi iteration: 3.90625 micros
Residual calculation: 3.90625 micros
Restriction: fw: 27.3438 micros
Interpolation + add: 46.875 micros
VCycle: 34804.7 micros


BENCHMARK
Jacobi iteration: 3.90625 micros
Residual calculation: 3.90625 micros
Restriction: fw: 27.3438 micros
Interpolation + add: 46.875 micros
VCycle: 34765.6 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 5.20833 micros
Restriction: fw: 98.9583 micros
Interpolation + add: 140.625 micros
VCycle: 45572.9 micros


BENCHMARK
Jacobi iteration: 5.20833 micros
Residual calculation: 5.20833 micros
Restriction: fw: 93.75 micros
Interpolation + add: 140.625 micros
VCycle: 45677.1 micros


BENCHMARK
Jacobi iteration: 5.20833 micros
Residual calculation: 0 micros
Restriction: fw: 98.9583 micros
Interpolation + add: 140.625 micros
VCycle: 45625 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 7.8125 micros
Residual calculation: 0 micros
Restriction: fw: 0 micros
Interpolation + add: 421.875 micros
VCycle: 84765.6 micros


BENCHMARK
Jacobi iteration: 7.8125 micros
Residual calculation: 0 micros
Restriction: fw: 7.8125 micros
Interpolation + add: 421.875 micros
VCycle: 84765.6 micros


BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 0 micros
Restriction: fw: 7.8125 micros
Interpolation + add: 421.875 micros
VCycle: 84765.6 micros



------------------
-- size: 2047

BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 0 micros
Restriction: fw: 0 micros
Interpolation + add: 15.625 micros
VCycle: 235781 micros


BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 15.625 micros
Restriction: fw: 0 micros
Interpolation + add: 0 micros
VCycle: 235938 micros


BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 15.625 micros
Restriction: fw: 0 micros
Interpolation + add: 0 micros
VCycle: 236094 micros

BRiT
01-Jun-2007, 03:56
This time with Nvidia 158.45 Vista Ultimate x64 drivers on the MSI 8800GTX HDOC, X2 4400+ @ 2.5Ghz, 2Gig DDR 250Mhz 3-4-4-8. Results seem inline/similar to previous results with 158.43.

As for why the numbers seem off for people with similar 8800GTX cards, driver, and OS, I can only guess it's perhaps due to the system CPU.

Peter, can you explain the "0 micros" results on several of my results (starting at size 255)?

GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 3.125 micros
Residual calculation: 4.6875 micros
Restriction: fw: 3.125 micros
Interpolation + add: 3.125 micros
VCycle: 5781.25 micros


BENCHMARK
Jacobi iteration: 3.125 micros
Residual calculation: 3.125 micros
Restriction: fw: 3.125 micros
Interpolation + add: 3.125 micros
VCycle: 5734.38 micros


BENCHMARK
Jacobi iteration: 3.125 micros
Residual calculation: 3.125 micros
Restriction: fw: 3.125 micros
Interpolation + add: 4.6875 micros
VCycle: 5687.5 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 3.47222 micros
Residual calculation: 3.47222 micros
Restriction: fw: 3.47222 micros
Interpolation + add: 3.47222 micros
VCycle: 9878.47 micros


BENCHMARK
Jacobi iteration: 3.47222 micros
Residual calculation: 3.47222 micros
Restriction: fw: 3.47222 micros
Interpolation + add: 3.47222 micros
VCycle: 9635.42 micros


BENCHMARK
Jacobi iteration: 3.47222 micros
Residual calculation: 3.47222 micros
Restriction: fw: 3.47222 micros
Interpolation + add: 3.47222 micros
VCycle: 9652.78 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 3.90625 micros
Residual calculation: 1.95313 micros
Restriction: fw: 3.90625 micros
Interpolation + add: 3.90625 micros
VCycle: 13671.9 micros


BENCHMARK
Jacobi iteration: 3.90625 micros
Residual calculation: 3.90625 micros
Restriction: fw: 3.90625 micros
Interpolation + add: 3.90625 micros
VCycle: 13730.5 micros


BENCHMARK
Jacobi iteration: 1.95313 micros
Residual calculation: 3.90625 micros
Restriction: fw: 3.90625 micros
Interpolation + add: 3.90625 micros
VCycle: 13457 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 4.46429 micros
Residual calculation: 4.46429 micros
Restriction: fw: 2.23214 micros
Interpolation + add: 4.46429 micros
VCycle: 17700.9 micros


BENCHMARK
Jacobi iteration: 2.23214 micros
Residual calculation: 4.46429 micros
Restriction: fw: 2.23214 micros
Interpolation + add: 4.46429 micros
VCycle: 17633.9 micros


BENCHMARK
Jacobi iteration: 2.23214 micros
Residual calculation: 4.46429 micros
Restriction: fw: 2.23214 micros
Interpolation + add: 4.46429 micros
VCycle: 17656.3 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 5.20833 micros
Residual calculation: 2.60417 micros
Restriction: fw: 2.60417 micros
Interpolation + add: 5.20833 micros
VCycle: 21744.8 micros


BENCHMARK
Jacobi iteration: 2.60417 micros
Residual calculation: 2.60417 micros
Restriction: fw: 5.20833 micros
Interpolation + add: 2.60417 micros
VCycle: 21718.8 micros


BENCHMARK
Jacobi iteration: 2.60417 micros
Residual calculation: 2.60417 micros
Restriction: fw: 5.20833 micros
Interpolation + add: 2.60417 micros
VCycle: 21536.5 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 3.125 micros
Residual calculation: 3.125 micros
Restriction: fw: 6.25 micros
Interpolation + add: 9.375 micros
VCycle: 25968.8 micros


BENCHMARK
Jacobi iteration: 3.125 micros
Residual calculation: 3.125 micros
Restriction: fw: 6.25 micros
Interpolation + add: 9.375 micros
VCycle: 25937.5 micros


BENCHMARK
Jacobi iteration: 3.125 micros
Residual calculation: 3.125 micros
Restriction: fw: 6.25 micros
Interpolation + add: 9.375 micros
VCycle: 25625 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 3.90625 micros
Residual calculation: 3.90625 micros
Restriction: fw: 35.1563 micros
Interpolation + add: 42.9688 micros
VCycle: 31796.9 micros


BENCHMARK
Jacobi iteration: 3.90625 micros
Residual calculation: 3.90625 micros
Restriction: fw: 35.1563 micros
Interpolation + add: 42.9688 micros
VCycle: 31640.6 micros


BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 3.90625 micros
Restriction: fw: 35.1563 micros
Interpolation + add: 42.9688 micros
VCycle: 31992.2 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 5.20833 micros
Residual calculation: 0 micros
Restriction: fw: 119.792 micros
Interpolation + add: 145.833 micros
VCycle: 42500 micros


BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 5.20833 micros
Restriction: fw: 114.583 micros
Interpolation + add: 151.042 micros
VCycle: 42760.4 micros


BENCHMARK
Jacobi iteration: 5.20833 micros
Residual calculation: 0 micros
Restriction: fw: 119.792 micros
Interpolation + add: 145.833 micros
VCycle: 42291.7 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 7.8125 micros
Restriction: fw: 0 micros
Interpolation + add: 539.063 micros
VCycle: 80937.5 micros


BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 7.8125 micros
Restriction: fw: 0 micros
Interpolation + add: 531.25 micros
VCycle: 80468.8 micros


BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 7.8125 micros
Restriction: fw: 0 micros
Interpolation + add: 531.25 micros
VCycle: 80703.1 micros



------------------
-- size: 2047

BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 0 micros
Restriction: fw: 0 micros
Interpolation + add: 0 micros
VCycle: 233438 micros


BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 15.625 micros
Restriction: fw: 0 micros
Interpolation + add: 0 micros
VCycle: 233750 micros


BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 15.625 micros
Restriction: fw: 0 micros
Interpolation + add: 0 micros
VCycle: 233438 micros

BRiT
01-Jun-2007, 04:09
Anybody know what`s the deviceID for an 8800GTX?As in one that would be useable in 3DAnalyze?

Not sure if this is what you're looking for or what 3DAnalyze needs, but from the latest NV device drivers:

%NVIDIA_G80.DEV_0191.1% = nv_NV3x, PCI\VEN_10DE&DEV_0191
%NVIDIA_G80.DEV_0193.1% = nv_NV3x, PCI\VEN_10DE&DEV_0193
%NVIDIA_G80.DEV_0194.1% = nv_NV3x, PCI\VEN_10DE&DEV_0194
%NVIDIA_G84.DEV_0400.1% = nv_NV3x, PCI\VEN_10DE&DEV_0400
%NVIDIA_G84.DEV_0402.1% = nv_NV3x, PCI\VEN_10DE&DEV_0402
%NVIDIA_G86.DEV_0421.1% = nv_NV3x, PCI\VEN_10DE&DEV_0421
%NVIDIA_G86.DEV_0422.1% = nv_NV3x, PCI\VEN_10DE&DEV_0422
%NVIDIA_G86.DEV_0423.1% = nv_NV3x, PCI\VEN_10DE&DEV_0423

armchair_architect
01-Jun-2007, 05:45
The problem is not stability, afaik, but the driver model itself: if you let a graphics command run for too long, Windows will assume it hanged and it will ask the driver to reset itself. And yes, it's an horribly annoying and stupid problem! :(

Here's Microsoft's description of this timeout mechanism:
http://www.microsoft.com/whdc/device/display/wddm_timeout.mspx

Under XP the timeout was 15 seconds, and resulted in a BSOD with error code 0xEA.

banksie
01-Jun-2007, 07:56
Catalyst 7.5
Windows XP Pro
AMD A64 3000+ (Socket 754)
2 Gb of RAM
Radeon HD 2900 XT


GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 14.0625 micros
Residual calculation: 15.625 micros
Restriction: fw: 10.9375 micros
Interpolation + add: 14.0625 micros
VCycle: 328.125 micros


BENCHMARK
Jacobi iteration: 14.0625 micros
Residual calculation: 15.625 micros
Restriction: fw: 9.375 micros
Interpolation + add: 15.625 micros
VCycle: 328.125 micros


BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 14.0625 micros
Restriction: fw: 9.375 micros
Interpolation + add: 15.625 micros
VCycle: 328.125 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 13.8889 micros
Residual calculation: 13.8889 micros
Restriction: fw: 10.4167 micros
Interpolation + add: 13.8889 micros
VCycle: 555.556 micros


BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 13.8889 micros
Restriction: fw: 10.4167 micros
Interpolation + add: 15.625 micros
VCycle: 555.556 micros


BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 13.8889 micros
Restriction: fw: 10.4167 micros
Interpolation + add: 13.8889 micros
VCycle: 555.556 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 13.6719 micros
Restriction: fw: 9.76563 micros
Interpolation + add: 15.625 micros
VCycle: 742.188 micros


BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 13.6719 micros
Restriction: fw: 11.7188 micros
Interpolation + add: 13.6719 micros
VCycle: 761.719 micros


BENCHMARK
Jacobi iteration: 13.6719 micros
Residual calculation: 15.625 micros
Restriction: fw: 9.76563 micros
Interpolation + add: 15.625 micros
VCycle: 761.719 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 13.3929 micros
Restriction: fw: 11.1607 micros
Interpolation + add: 13.3929 micros
VCycle: 959.821 micros


BENCHMARK
Jacobi iteration: 13.3929 micros
Residual calculation: 13.3929 micros
Restriction: fw: 11.1607 micros
Interpolation + add: 15.625 micros
VCycle: 959.821 micros


BENCHMARK
Jacobi iteration: 13.3929 micros
Residual calculation: 15.625 micros
Restriction: fw: 8.92857 micros
Interpolation + add: 15.625 micros
VCycle: 959.821 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 13.0208 micros
Residual calculation: 15.625 micros
Restriction: fw: 10.4167 micros
Interpolation + add: 15.625 micros
VCycle: 1171.88 micros


BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 13.0208 micros
Restriction: fw: 10.4167 micros
Interpolation + add: 15.625 micros
VCycle: 1171.88 micros


BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 13.0208 micros
Restriction: fw: 13.0208 micros
Interpolation + add: 13.0208 micros
VCycle: 1197.92 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 15.625 micros
Restriction: fw: 9.375 micros
Interpolation + add: 15.625 micros
VCycle: 1937.5 micros


BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 15.625 micros
Restriction: fw: 9.375 micros
Interpolation + add: 15.625 micros
VCycle: 1937.5 micros


BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 15.625 micros
Restriction: fw: 9.375 micros
Interpolation + add: 12.5 micros
VCycle: 1937.5 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 11.7188 micros
Residual calculation: 15.625 micros
Restriction: fw: 15.625 micros
Interpolation + add: 121.094 micros
VCycle: 5546.88 micros


BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 11.7188 micros
Restriction: fw: 27.3438 micros
Interpolation + add: 128.906 micros
VCycle: 5546.88 micros


BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 11.7188 micros
Restriction: fw: 31.25 micros
Interpolation + add: 117.188 micros
VCycle: 5625 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 15.625 micros
Restriction: fw: 10.4167 micros
Interpolation + add: 130.208 micros
VCycle: 30625 micros


BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 15.625 micros
Restriction: fw: 15.625 micros
Interpolation + add: 130.208 micros
VCycle: 31197.9 micros


BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 15.625 micros
Restriction: fw: 15.625 micros
Interpolation + add: 140.625 micros
VCycle: 31041.7 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 7.8125 micros
Restriction: fw: 15.625 micros
Interpolation + add: 15.625 micros
VCycle: 128047 micros


BENCHMARK
Jacobi iteration: 54.6875 micros
Residual calculation: 54.6875 micros
Restriction: fw: 62.5 micros
Interpolation + add: 54.6875 micros
VCycle: 133906 micros


BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 15.625 micros
Restriction: fw: 7.8125 micros
Interpolation + add: 15.625 micros
VCycle: 128125 micros



------------------
-- size: 2047

BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 15.625 micros
Restriction: fw: 15.625 micros
Interpolation + add: 15.625 micros
VCycle: 1.50234e+006 micros


BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 15.625 micros
Restriction: fw: 359.375 micros
Interpolation + add: 111484 micros
VCycle: 432344 micros


BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 15.625 micros
Restriction: fw: 15.625 micros
Interpolation + add: 15.625 micros
VCycle: 1.58656e+006 micros


Amazing the difference in time the drivers make. So much slower now it is actually computing something... ;) Test image seemed identical to the one posted.

fellix
01-Jun-2007, 10:44
OK, looks like the new Catalyst unleashes the heavens for the thing:

Opteron 165 @ 2700 MHz;
x1900XT-256MB @ 655/1764 MHz;
Vista-32;GL multigrid benchmark 0.5 by Peter Thoman 2007
DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.

------------------
-- size: 3
BENCHMARK
Jacobi iteration: 12.5 micros
Residual calculation: 12.5 micros
Restriction: fw: 9.375 micros
Interpolation + add: 12.5 micros
VCycle: 281.25 micros

BENCHMARK
Jacobi iteration: 12.5 micros
Residual calculation: 12.5 micros
Restriction: fw: 9.375 micros
Interpolation + add: 14.0625 micros
VCycle: 265.625 micros

BENCHMARK
Jacobi iteration: 14.0625 micros
Residual calculation: 12.5 micros
Restriction: fw: 7.8125 micros
Interpolation + add: 12.5 micros
VCycle: 281.25 micros

------------------
-- size: 7
BENCHMARK
Jacobi iteration: 12.1528 micros
Residual calculation: 13.8889 micros
Restriction: fw: 8.68056 micros
Interpolation + add: 13.8889 micros
VCycle: 451.389 micros

BENCHMARK
Jacobi iteration: 12.1528 micros
Residual calculation: 12.1528 micros
Restriction: fw: 10.4167 micros
Interpolation + add: 12.1528 micros
VCycle: 468.75 micros

BENCHMARK
Jacobi iteration: 12.1528 micros
Residual calculation: 12.1528 micros
Restriction: fw: 10.4167 micros
Interpolation + add: 12.1528 micros
VCycle: 468.75 micros

------------------
-- size: 15
BENCHMARK
Jacobi iteration: 11.7188 micros
Residual calculation: 13.6719 micros
Restriction: fw: 9.76563 micros
Interpolation + add: 11.7188 micros
VCycle: 644.531 micros

BENCHMARK
Jacobi iteration: 11.7188 micros
Residual calculation: 13.6719 micros
Restriction: fw: 9.76563 micros
Interpolation + add: 11.7188 micros
VCycle: 644.531 micros

BENCHMARK
Jacobi iteration: 13.6719 micros
Residual calculation: 13.6719 micros
Restriction: fw: 7.8125 micros
Interpolation + add: 13.6719 micros
VCycle: 644.531 micros

------------------
-- size: 31
BENCHMARK
Jacobi iteration: 13.3929 micros
Residual calculation: 13.3929 micros
Restriction: fw: 8.92857 micros
Interpolation + add: 13.3929 micros
VCycle: 803.571 micros

BENCHMARK
Jacobi iteration: 11.1607 micros
Residual calculation: 13.3929 micros
Restriction: fw: 8.92857 micros
Interpolation + add: 13.3929 micros
VCycle: 825.893 micros

BENCHMARK
Jacobi iteration: 13.3929 micros
Residual calculation: 11.1607 micros
Restriction: fw: 11.1607 micros
Interpolation + add: 11.1607 micros
VCycle: 825.893 micros

------------------
-- size: 63
BENCHMARK
Jacobi iteration: 13.0208 micros
Residual calculation: 10.4167 micros
Restriction: fw: 10.4167 micros
Interpolation + add: 13.0208 micros
VCycle: 989.583 micros

BENCHMARK
Jacobi iteration: 13.0208 micros
Residual calculation: 13.0208 micros
Restriction: fw: 10.4167 micros
Interpolation + add: 13.0208 micros
VCycle: 989.583 micros

BENCHMARK
Jacobi iteration: 13.0208 micros
Residual calculation: 13.0208 micros
Restriction: fw: 7.8125 micros
Interpolation + add: 13.0208 micros
VCycle: 989.583 micros

------------------
-- size: 127
BENCHMARK
Jacobi iteration: 12.5 micros
Residual calculation: 12.5 micros
Restriction: fw: 9.375 micros
Interpolation + add: 12.5 micros
VCycle: 2093.75 micros

BENCHMARK
Jacobi iteration: 12.5 micros
Residual calculation: 12.5 micros
Restriction: fw: 9.375 micros
Interpolation + add: 12.5 micros
VCycle: 2093.75 micros

BENCHMARK
Jacobi iteration: 12.5 micros
Residual calculation: 15.625 micros
Restriction: fw: 9.375 micros
Interpolation + add: 12.5 micros
VCycle: 2062.5 micros

------------------
-- size: 255
BENCHMARK
Jacobi iteration: 11.7188 micros
Residual calculation: 11.7188 micros
Restriction: fw: 7.8125 micros
Interpolation + add: 15.625 micros
VCycle: 8398.44 micros

BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 11.7188 micros
Restriction: fw: 7.8125 micros
Interpolation + add: 11.7188 micros
VCycle: 9101.56 micros

BENCHMARK
Jacobi iteration: 11.7188 micros
Residual calculation: 15.625 micros
Restriction: fw: 7.8125 micros
Interpolation + add: 11.7188 micros
VCycle: 9101.56 micros

------------------
-- size: 511
BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 10.4167 micros
Restriction: fw: 10.4167 micros
Interpolation + add: 10.4167 micros
VCycle: 34583.3 micros

BENCHMARK
Jacobi iteration: 20.8333 micros
Residual calculation: 26.0417 micros
Restriction: fw: 20.8333 micros
Interpolation + add: 26.0417 micros
VCycle: 36510.4 micros

BENCHMARK
Jacobi iteration: 26.0417 micros
Residual calculation: 20.8333 micros
Restriction: fw: 26.0417 micros
Interpolation + add: 20.8333 micros
VCycle: 36510.4 micros

------------------
-- size: 1023
BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 7.8125 micros
Restriction: fw: 15.625 micros
Interpolation + add: 7.8125 micros
VCycle: 136172 micros

BENCHMARK
Jacobi iteration: 78.125 micros
Residual calculation: 70.3125 micros
Restriction: fw: 78.125 micros
Interpolation + add: 78.125 micros
VCycle: 145469 micros

BENCHMARK
Jacobi iteration: 78.125 micros
Residual calculation: 78.125 micros
Restriction: fw: 78.125 micros
Interpolation + add: 70.3125 micros
VCycle: 145547 micros

------------------
-- size: 2047

During the last [2047] pass, the video driver resets itself -- a short blank screen flash -- and the program terminates the execution.

Anarchist4000
01-Jun-2007, 11:06
Opteron 165@2.25GHz
1900XT-512MB
Vista32
Cat 7.5

GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 15.1358 micros
Residual calculation: 15.331 micros
Restriction: fw: 10.9368 micros
Interpolation + add: 15.2334 micros
VCycle: 333.963 micros


BENCHMARK
Jacobi iteration: 15.2334 micros
Residual calculation: 15.2334 micros
Restriction: fw: 10.9368 micros
Interpolation + add: 15.1358 micros
VCycle: 333.963 micros


BENCHMARK
Jacobi iteration: 15.3311 micros
Residual calculation: 15.2334 micros
Restriction: fw: 10.9368 micros
Interpolation + add: 15.331 micros
VCycle: 332.987 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 15.19 micros
Residual calculation: 15.2986 micros
Restriction: fw: 10.9584 micros
Interpolation + add: 15.2986 micros
VCycle: 543.584 micros


BENCHMARK
Jacobi iteration: 15.2984 micros
Residual calculation: 15.19 micros
Restriction: fw: 11.067 micros
Interpolation + add: 15.407 micros
VCycle: 542.5 micros


BENCHMARK
Jacobi iteration: 15.2984 micros
Residual calculation: 15.407 micros
Restriction: fw: 11.067 micros
Interpolation + add: 15.2986 micros
VCycle: 544.67 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 15.0138 micros
Residual calculation: 15.2578 micros
Restriction: fw: 10.8636 micros
Interpolation + add: 15.0136 micros
VCycle: 753.126 micros


BENCHMARK
Jacobi iteration: 15.2579 micros
Residual calculation: 15.2578 micros
Restriction: fw: 10.9856 micros
Interpolation + add: 15.0138 micros
VCycle: 750.684 micros


BENCHMARK
Jacobi iteration: 15.2579 micros
Residual calculation: 15.2578 micros
Restriction: fw: 10.9856 micros
Interpolation + add: 15.1358 micros
VCycle: 753.126 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 15.066 micros
Residual calculation: 15.2054 micros
Restriction: fw: 11.0206 micros
Interpolation + add: 15.066 micros
VCycle: 958.364 micros


BENCHMARK
Jacobi iteration: 15.066 micros
Residual calculation: 15.2054 micros
Restriction: fw: 10.881 micros
Interpolation + add: 15.066 micros
VCycle: 965.34 micros


BENCHMARK
Jacobi iteration: 15.2054 micros
Residual calculation: 15.066 micros
Restriction: fw: 11.0206 micros
Interpolation + add: 15.2054 micros
VCycle: 965.34 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 15.4612 micros
Residual calculation: 15.2985 micros
Restriction: fw: 10.9043 micros
Interpolation + add: 15.1357 micros
VCycle: 1176.68 micros


BENCHMARK
Jacobi iteration: 15.2985 micros
Residual calculation: 15.4612 micros
Restriction: fw: 10.9043 micros
Interpolation + add: 15.2985 micros
VCycle: 1178.31 micros


BENCHMARK
Jacobi iteration: 15.2985 micros
Residual calculation: 15.2985 micros
Restriction: fw: 10.9042 micros
Interpolation + add: 15.4613 micros
VCycle: 1181.57 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 15.4288 micros
Residual calculation: 15.038 micros
Restriction: fw: 11.718 micros
Interpolation + add: 15.8194 micros
VCycle: 2187.36 micros


BENCHMARK
Jacobi iteration: 15.0382 micros
Residual calculation: 14.8428 micros
Restriction: fw: 10.9368 micros
Interpolation + add: 15.2334 micros
VCycle: 2210.8 micros


BENCHMARK
Jacobi iteration: 15.0382 micros
Residual calculation: 15.2334 micros
Restriction: fw: 10.9368 micros
Interpolation + add: 15.038 micros
VCycle: 2216.66 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 15.1358 micros
Residual calculation: 15.38 micros
Restriction: fw: 10.7415 micros
Interpolation + add: 15.1358 micros
VCycle: 8815.35 micros


BENCHMARK
Jacobi iteration: 15.1358 micros
Residual calculation: 15.1358 micros
Restriction: fw: 10.9857 micros
Interpolation + add: 15.1358 micros
VCycle: 9594.11 micros


BENCHMARK
Jacobi iteration: 15.1358 micros
Residual calculation: 15.3798 micros
Restriction: fw: 10.7415 micros
Interpolation + add: 14.8918 micros
VCycle: 9594.11 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 14.973 micros
Residual calculation: 15.2983 micros
Restriction: fw: 11.067 micros
Interpolation + add: 14.973 micros
VCycle: 36472.3 micros


BENCHMARK
Jacobi iteration: 24.738 micros
Residual calculation: 25.389 micros
Restriction: fw: 24.738 micros
Interpolation + add: 25.0633 micros
VCycle: 38750.8 micros


BENCHMARK
Jacobi iteration: 25.0637 micros
Residual calculation: 24.738 micros
Restriction: fw: 24.738 micros
Interpolation + add: 25.389 micros
VCycle: 38747.5 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 15.624 micros
Residual calculation: 15.136 micros
Restriction: fw: 10.7415 micros
Interpolation + add: 15.1355 micros
VCycle: 144107 micros


BENCHMARK
Jacobi iteration: 83.0025 micros
Residual calculation: 83.0025 micros
Restriction: fw: 83.0025 micros
Interpolation + add: 86.9085 micros
VCycle: 154433 micros


BENCHMARK
Jacobi iteration: 83.0025 micros
Residual calculation: 83.4905 micros
Restriction: fw: 90.8145 micros
Interpolation + add: 83.491 micros
VCycle: 154385 micros



------------------
-- size: 2047

BENCHMARK
Jacobi iteration: 14.648 micros
Residual calculation: 15.624 micros
Restriction: fw: 9.765 micros
Interpolation + add: 15.624 micros
VCycle: 7333.51 micros


BENCHMARK
Jacobi iteration: 7481.94 micros
Residual calculation: 279.279 micros
Restriction: fw: 278.303 micros
Interpolation + add: 280.255 micros
VCycle: 92894.4 micros


BENCHMARK
Jacobi iteration: 8299.27 micros
Residual calculation: 288.068 micros
Restriction: fw: 277.326 micros
Interpolation + add: 345.681 micros
VCycle: 94691.2 micros



Actually made it through all of the tests and the results seem reasonable. The newest drivers really seem to help because it was blank before.

PeterT
01-Jun-2007, 11:32
Peter, can you explain the "0 micros" results on several of my results (starting at size 255)?No. In fact, for most people, all the non-vcycle results after 255 are more "wrong" than right, and I have no idea why that is the case. I thought it was a problem with my measurement method, so I switched from using the standard C "clock()" method to kernel32's "GetSystemTime" for the new version, but that doesn't seem to have had any effect. I'm open to and happy about any ideas or suggestions regarding that. It's not deal-breaking, as the most important measurement by far is the V-cycle time, but it would still be interesting to know what's going on there.

At least the V-cycle times are mostly reasonable - except on ATI cards at 2047, where they have to be wrong. Again, I thought the problem was my measurement method, but apparently it is something more weird.

Here's Microsoft's description of this timeout mechanism:
http://www.microsoft.com/whdc/device/display/wddm_timeout.mspx
Under XP the timeout was 15 seconds, and resulted in a BSOD with error code 0xEA.Thanks. Sadly this still doesn't define what a "particular task" is that shouldn't take longer than t seconds. If it's just one operation, then my benchmark shouldn't run any risk at all of violating that limit.

Davros
01-Jun-2007, 12:29
7600gt standard clocks 83.90
p4 northwood 3ghz
1024mb ram
win xp

GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 7.8125 micros
Residual calculation: 6.25 micros
Restriction: fw: 7.8125 micros
Interpolation + add: 6.25 micros
VCycle: 609.375 micros


BENCHMARK
Jacobi iteration: 6.25 micros
Residual calculation: 6.25 micros
Restriction: fw: 7.8125 micros
Interpolation + add: 6.25 micros
VCycle: 609.375 micros


BENCHMARK
Jacobi iteration: 6.25 micros
Residual calculation: 6.25 micros
Restriction: fw: 7.8125 micros
Interpolation + add: 6.25 micros
VCycle: 625 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 6.94444 micros
Residual calculation: 6.94444 micros
Restriction: fw: 6.94444 micros
Interpolation + add: 6.94444 micros
VCycle: 1024.31 micros


BENCHMARK
Jacobi iteration: 6.94444 micros
Residual calculation: 6.94444 micros
Restriction: fw: 6.94444 micros
Interpolation + add: 6.94444 micros
VCycle: 1024.31 micros


BENCHMARK
Jacobi iteration: 6.94444 micros
Residual calculation: 6.94444 micros
Restriction: fw: 6.94444 micros
Interpolation + add: 6.94444 micros
VCycle: 1024.31 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 5.85938 micros
Residual calculation: 5.85938 micros
Restriction: fw: 7.8125 micros
Interpolation + add: 7.8125 micros
VCycle: 1484.38 micros


BENCHMARK
Jacobi iteration: 5.85938 micros
Residual calculation: 5.85938 micros
Restriction: fw: 7.8125 micros
Interpolation + add: 7.8125 micros
VCycle: 1484.38 micros


BENCHMARK
Jacobi iteration: 5.85938 micros
Residual calculation: 5.85938 micros
Restriction: fw: 7.8125 micros
Interpolation + add: 7.8125 micros
VCycle: 1503.91 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 6.69643 micros
Residual calculation: 6.69643 micros
Restriction: fw: 8.92857 micros
Interpolation + add: 6.69643 micros
VCycle: 1875 micros


BENCHMARK
Jacobi iteration: 6.69643 micros
Residual calculation: 8.92857 micros
Restriction: fw: 6.69643 micros
Interpolation + add: 6.69643 micros
VCycle: 1875 micros


BENCHMARK
Jacobi iteration: 8.92857 micros
Residual calculation: 6.69643 micros
Restriction: fw: 6.69643 micros
Interpolation + add: 6.69643 micros
VCycle: 1875 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 31.25 micros
Residual calculation: 31.25 micros
Restriction: fw: 15.625 micros
Interpolation + add: 18.2292 micros
VCycle: 2395.83 micros


BENCHMARK
Jacobi iteration: 31.25 micros
Residual calculation: 28.6458 micros
Restriction: fw: 18.2292 micros
Interpolation + add: 18.2292 micros
VCycle: 2395.83 micros


BENCHMARK
Jacobi iteration: 28.6458 micros
Residual calculation: 31.25 micros
Restriction: fw: 15.625 micros
Interpolation + add: 18.2292 micros
VCycle: 2395.83 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 121.875 micros
Residual calculation: 118.75 micros
Restriction: fw: 112.5 micros
Interpolation + add: 87.5 micros
VCycle: 3031.25 micros


BENCHMARK
Jacobi iteration: 121.875 micros
Residual calculation: 121.875 micros
Restriction: fw: 112.5 micros
Interpolation + add: 87.5 micros
VCycle: 3031.25 micros


BENCHMARK
Jacobi iteration: 121.875 micros
Residual calculation: 118.75 micros
Restriction: fw: 112.5 micros
Interpolation + add: 87.5 micros
VCycle: 3000 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 496.094 micros
Residual calculation: 496.094 micros
Restriction: fw: 312.5 micros
Interpolation + add: 289.063 micros
VCycle: 5820.31 micros


BENCHMARK
Jacobi iteration: 496.094 micros
Residual calculation: 496.094 micros
Restriction: fw: 312.5 micros
Interpolation + add: 289.063 micros
VCycle: 5820.31 micros


BENCHMARK
Jacobi iteration: 500 micros
Residual calculation: 496.094 micros
Restriction: fw: 312.5 micros
Interpolation + add: 289.063 micros
VCycle: 5820.31 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 2041.67 micros
Residual calculation: 2041.67 micros
Restriction: fw: 1463.54 micros
Interpolation + add: 1218.75 micros
VCycle: 23125 micros


BENCHMARK
Jacobi iteration: 2046.88 micros
Residual calculation: 2041.67 micros
Restriction: fw: 1458.33 micros
Interpolation + add: 1218.75 micros
VCycle: 23177.1 micros


BENCHMARK
Jacobi iteration: 2052.08 micros
Residual calculation: 2041.67 micros
Restriction: fw: 1458.33 micros
Interpolation + add: 1223.96 micros
VCycle: 23125 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 8804.69 micros
Residual calculation: 9437.5 micros
Restriction: fw: 7312.5 micros
Interpolation + add: 5968.75 micros
VCycle: 114453 micros


BENCHMARK
Jacobi iteration: 8781.25 micros
Residual calculation: 9421.88 micros
Restriction: fw: 7335.94 micros
Interpolation + add: 5968.75 micros
VCycle: 114453 micros


BENCHMARK
Jacobi iteration: 8765.63 micros
Residual calculation: 9453.13 micros
Restriction: fw: 7328.13 micros
Interpolation + add: 5976.56 micros
VCycle: 114375 micros



------------------
-- size: 2047
GLFilterStep -- Error doing makeCurrent on target.

PeterT
01-Jun-2007, 16:09
Those are some of the most well-behaved results I've seen so far (just like those on my system): Scaling factor around 4 across the board once GPU limitations set in, 1 before. VCycle times growing by a fixed additive factor until GPU limitations hit. No strange low results / zeros to be seen. Why can't all systems play so nice? Would make my life a lot easier ;)

BRiT
01-Jun-2007, 17:20
No. In fact, for most people, all the non-vcycle results after 255 are more "wrong" than right, and I have no idea why that is the case. I thought it was a problem with my measurement method, so I switched from using the standard C "clock()" method to kernel32's "GetSystemTime" for the new version, but that doesn't seem to have had any effect. I'm open to and happy about any ideas or suggestions regarding that. It's not deal-breaking, as the most important measurement by far is the V-cycle time, but it would still be interesting to know what's going on there.

I'm curious if it's a matter of processor timestamps being off from one another on multi-core systems. It shouldn't be using something like the RDTSC instruction which can and typically does vary from core to core. Under WinXP MS/AMD released a fix that keeps both cores' timestamp counter in lock-step.

neliz
01-Jun-2007, 21:38
Hi Pete,

Same 2900, this time with the official 7.5's (still crashes the display driver)

GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 20.3125 micros
Residual calculation: 20.3125 micros
Restriction: fw: 14.0625 micros
Interpolation + add: 20.3125 micros
VCycle: 468.75 micros


BENCHMARK
Jacobi iteration: 20.3125 micros
Residual calculation: 21.875 micros
Restriction: fw: 14.0625 micros
Interpolation + add: 21.875 micros
VCycle: 453.125 micros


BENCHMARK
Jacobi iteration: 20.3125 micros
Residual calculation: 20.3125 micros
Restriction: fw: 15.625 micros
Interpolation + add: 20.3125 micros
VCycle: 484.375 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 20.8333 micros
Residual calculation: 20.8333 micros
Restriction: fw: 15.625 micros
Interpolation + add: 19.0972 micros
VCycle: 781.25 micros


BENCHMARK
Jacobi iteration: 20.8333 micros
Residual calculation: 20.8333 micros
Restriction: fw: 13.8889 micros
Interpolation + add: 22.5694 micros
VCycle: 763.889 micros


BENCHMARK
Jacobi iteration: 20.8333 micros
Residual calculation: 19.0972 micros
Restriction: fw: 15.625 micros
Interpolation + add: 20.8333 micros
VCycle: 781.25 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 21.4844 micros
Residual calculation: 19.5313 micros
Restriction: fw: 15.625 micros
Interpolation + add: 19.5313 micros
VCycle: 1074.22 micros


BENCHMARK
Jacobi iteration: 21.4844 micros
Residual calculation: 21.4844 micros
Restriction: fw: 13.6719 micros
Interpolation + add: 21.4844 micros
VCycle: 1074.22 micros


BENCHMARK
Jacobi iteration: 19.5313 micros
Residual calculation: 23.4375 micros
Restriction: fw: 13.6719 micros
Interpolation + add: 21.4844 micros
VCycle: 1074.22 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 20.0893 micros
Residual calculation: 20.0893 micros
Restriction: fw: 15.625 micros
Interpolation + add: 20.0893 micros
VCycle: 1361.61 micros


BENCHMARK
Jacobi iteration: 20.0893 micros
Residual calculation: 20.0893 micros
Restriction: fw: 15.625 micros
Interpolation + add: 20.0893 micros
VCycle: 1406.25 micros


BENCHMARK
Jacobi iteration: 22.3214 micros
Residual calculation: 20.0893 micros
Restriction: fw: 15.625 micros
Interpolation + add: 20.0893 micros
VCycle: 1406.25 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 20.8333 micros
Residual calculation: 20.8333 micros
Restriction: fw: 13.0208 micros
Interpolation + add: 20.8333 micros
VCycle: 1692.71 micros


BENCHMARK
Jacobi iteration: 20.8333 micros
Residual calculation: 20.8333 micros
Restriction: fw: 13.0208 micros
Interpolation + add: 20.8333 micros
VCycle: 1692.71 micros


BENCHMARK
Jacobi iteration: 20.8333 micros
Residual calculation: 20.8333 micros
Restriction: fw: 15.625 micros
Interpolation + add: 18.2292 micros
VCycle: 1770.83 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 18.75 micros
Residual calculation: 21.875 micros
Restriction: fw: 15.625 micros
Interpolation + add: 18.75 micros
VCycle: 2000 micros


BENCHMARK
Jacobi iteration: 21.875 micros
Residual calculation: 18.75 micros
Restriction: fw: 15.625 micros
Interpolation + add: 18.75 micros
VCycle: 1968.75 micros


BENCHMARK
Jacobi iteration: 18.75 micros
Residual calculation: 21.875 micros
Restriction: fw: 15.625 micros
Interpolation + add: 18.75 micros
VCycle: 1968.75 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 23.4375 micros
Residual calculation: 19.5313 micros
Restriction: fw: 15.625 micros
Interpolation + add: 19.5313 micros
VCycle: 4335.94 micros


BENCHMARK
Jacobi iteration: 19.5313 micros
Residual calculation: 23.4375 micros
Restriction: fw: 15.625 micros
Interpolation + add: 19.5313 micros
VCycle: 4375 micros


BENCHMARK
Jacobi iteration: 19.5313 micros
Residual calculation: 23.4375 micros
Restriction: fw: 15.625 micros
Interpolation + add: 19.5313 micros
VCycle: 4375 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 20.8333 micros
Residual calculation: 20.8333 micros
Restriction: fw: 15.625 micros
Interpolation + add: 20.8333 micros
VCycle: 20052.1 micros


BENCHMARK
Jacobi iteration: 20.8333 micros
Residual calculation: 20.8333 micros
Restriction: fw: 10.4167 micros
Interpolation + add: 20.8333 micros
VCycle: 21927.1 micros


BENCHMARK
Jacobi iteration: 20.8333 micros
Residual calculation: 15.625 micros
Restriction: fw: 15.625 micros
Interpolation + add: 20.8333 micros
VCycle: 21927.1 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 23.4375 micros
Residual calculation: 15.625 micros
Restriction: fw: 15.625 micros
Interpolation + add: 23.4375 micros
VCycle: 81484.4 micros


BENCHMARK
Jacobi iteration: 39.0625 micros
Residual calculation: 39.0625 micros
Restriction: fw: 39.0625 micros
Interpolation + add: 46.875 micros
VCycle: 89140.6 micros


BENCHMARK
Jacobi iteration: 46.875 micros
Residual calculation: 39.0625 micros
Restriction: fw: 39.0625 micros
Interpolation + add: 39.0625 micros
VCycle: 89375 micros



------------------
-- size: 2047

Anarchist4000
01-Jun-2007, 21:58
I tried setting the affinity and priority and while it stabilized the times a bit the numbers are still off.

What exactly are you measuring for the first 4 tests under each benchmark? The VCycle numbers I'm guessing are about right and the first 4 are the ones that are off?

Davros
02-Jun-2007, 00:08
Those are some of the most well-behaved results I've seen so far (just like those on my system): Scaling factor around 4 across the board once GPU limitations set in, 1 before. VCycle times growing by a fixed additive factor until GPU limitations hit. No strange low results / zeros to be seen. Why can't all systems play so nice? Would make my life a lot easier ;)

was that adressed to me ?

Silent_Buddha
02-Jun-2007, 01:25
Interesting, I'll test with the new 7.5's here in a moment. Although I'm not expecting it to do any differently than the 8.38 release 7 betas. Hopefully the FAN issue is fixed however.

Also, I mention how on my card at least the desktop in Vista 64 goes completely unresponsive at certain points during the 2047^2 test. Is it possible that it isn't generating a "time" while this is happening?

It doesn't affect my system clock (IE - my system clock isn't suddenly running minutes slower than my other computer after multiple runs) so I'm not sure why it isn't getting a proper time on it.

Regards,
SB

XMAN26
02-Jun-2007, 01:27
Hi enthusiast community, I once again turn to you for your help. If you own one of these cards (I'm especially interested in the new ATI/AMD GPUs of course), use a windows OS, and have a bit of time then please follow the instructions below.

Please download this: (611 KiB)
http://peter.metaclassofnil.com/uni/mgbench310507.zip
It extracts to a folder called "mgbench".
Then close all GPU-intensive or high-priority CPU apps you may have running and run the .exe contained in the folder. Now, this can take 3 to ? (probably around 10) minutes. After it has finished, it should have generated a file called "GPUMGbench.txt". Either send me that file by email (peter@metaclassofnil.com), or post its contents here, or send it via forum PM. However, please don't forget to mention your graphics card type in detail, and if you use non-standard clock speeds. Driver version would also be nice but is not strictly required.

Some important points:
- If you're worried about this being malware, you can see in my posting history that I've already done something like this 2 times before. I've also been a member of this forum for over 5 years.
- Better not try to use the values reported by this program to compare GPU performance. It's very specific, somewhat overhead-limited and probably buggy.
- WARNING: If you're not 100% sure that your GPU is stable, save your data or return it to standard clocks before running this. After getting some reports of crashes I found out that this tool is a very good "GPU stability test" - even configurations that make it through hours of 3Dmark loops may well BSoD running this program. I even had one card crash at stock speeds and only be able to complete the run downclocked.

Thanks for your help!

[edit]
I updated the link above to point to the new version, for more information see my post later in this thread. (http://forum.beyond3d.com/showpost.php?p=1013723&postcount=61)


Mine stops in the cmd window stating that OGL windows needs to be focused. How the hell do you do that? 8800GTS BFG OC, X2 3800 @ 2.5, 158.18(9), XPPro Dell E228WFP 22" WS

neliz
02-Jun-2007, 01:39
click on the "graphical" (output) screen (the one with the OGL logo) to make it "active" and press enter... otherwise the dos window will be active and pressing enter doesn't do anything..

this post from peter shows what the window looks like

I have finished a new version. It will now start with a trial run to see if the right calculations are carried out. The procedure is as follows: a 511x511 system is solved with a few V-cycles, and the result is displayed. The program then asks you to either abort the benchmark or continue.

The output shown should look like this: (It's a height-colored representation of the solution)
http://peter.metaclassofnil.com/uni/result.jpg

Additionally, I changed the time measurement method to hopefully be more accurate. Please download the new version and try it:
http://peter.metaclassofnil.com/uni/mgbench310507.zip

Thanks for this report. "A little over twice as long" is exactly as long as 2047^2 should take - the workload is a little over 4 times as high, but I only run half as many iterations. This means that most likely my time measurement method was at fault, which inspired the changes in the new version above.

Silent_Buddha
02-Jun-2007, 03:58
I swear it gets faster everytime I run this.

With official 7.5 catalyst drivers, same system as previously.


GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 11.1321 micros
Residual calculation: 10.8391 micros
Restriction: fw: 7.812 micros
Interpolation + add: 10.7415 micros
VCycle: 246.078 micros


BENCHMARK
Jacobi iteration: 10.7415 micros
Residual calculation: 10.9368 micros
Restriction: fw: 7.9097 micros
Interpolation + add: 10.6438 micros
VCycle: 246.078 micros


BENCHMARK
Jacobi iteration: 10.5462 micros
Residual calculation: 10.6439 micros
Restriction: fw: 7.9096 micros
Interpolation + add: 10.7415 micros
VCycle: 247.055 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 11.067 micros
Residual calculation: 11.067 micros
Restriction: fw: 7.812 micros
Interpolation + add: 11.067 micros
VCycle: 403.62 micros


BENCHMARK
Jacobi iteration: 10.9584 micros
Residual calculation: 11.067 micros
Restriction: fw: 8.029 micros
Interpolation + add: 10.9586 micros
VCycle: 404.704 micros


BENCHMARK
Jacobi iteration: 11.067 micros
Residual calculation: 10.85 micros
Restriction: fw: 7.92044 micros
Interpolation + add: 10.9586 micros
VCycle: 404.704 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 10.8636 micros
Residual calculation: 10.8635 micros
Restriction: fw: 7.812 micros
Interpolation + add: 10.8636 micros
VCycle: 557.825 micros


BENCHMARK
Jacobi iteration: 10.6194 micros
Residual calculation: 10.6195 micros
Restriction: fw: 7.934 micros
Interpolation + add: 10.7415 micros
VCycle: 556.605 micros


BENCHMARK
Jacobi iteration: 10.8636 micros
Residual calculation: 10.8635 micros
Restriction: fw: 7.69 micros
Interpolation + add: 10.8635 micros
VCycle: 557.826 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 10.881 micros
Residual calculation: 10.602 micros
Restriction: fw: 7.95157 micros
Interpolation + add: 11.0204 micros
VCycle: 710.056 micros


BENCHMARK
Jacobi iteration: 10.881 micros
Residual calculation: 10.602 micros
Restriction: fw: 7.812 micros
Interpolation + add: 10.881 micros
VCycle: 707.266 micros


BENCHMARK
Jacobi iteration: 11.0204 micros
Residual calculation: 10.881 micros
Restriction: fw: 7.95157 micros
Interpolation + add: 10.881 micros
VCycle: 705.87 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 10.7415 micros
Residual calculation: 10.9042 micros
Restriction: fw: 7.97483 micros
Interpolation + add: 10.5787 micros
VCycle: 862.575 micros


BENCHMARK
Jacobi iteration: 10.9042 micros
Residual calculation: 10.5788 micros
Restriction: fw: 7.97467 micros
Interpolation + add: 10.7415 micros
VCycle: 860.948 micros


BENCHMARK
Jacobi iteration: 10.7415 micros
Residual calculation: 10.7415 micros
Restriction: fw: 7.812 micros
Interpolation + add: 10.7415 micros
VCycle: 867.457 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 11.5228 micros
Residual calculation: 11.718 micros
Restriction: fw: 8.7884 micros
Interpolation + add: 11.5228 micros
VCycle: 1083.91 micros


BENCHMARK
Jacobi iteration: 10.9368 micros
Residual calculation: 10.7414 micros
Restriction: fw: 7.812 micros
Interpolation + add: 11.1322 micros
VCycle: 1050.71 micros


BENCHMARK
Jacobi iteration: 10.7414 micros
Residual calculation: 10.9368 micros
Restriction: fw: 8.0074 micros
Interpolation + add: 10.9368 micros
VCycle: 1037.04 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 10.7415 micros
Residual calculation: 10.4975 micros
Restriction: fw: 7.812 micros
Interpolation + add: 10.7415 micros
VCycle: 4350.31 micros


BENCHMARK
Jacobi iteration: 10.9857 micros
Residual calculation: 10.4972 micros
Restriction: fw: 7.812 micros
Interpolation + add: 10.7415 micros
VCycle: 4508.99 micros


BENCHMARK
Jacobi iteration: 10.7415 micros
Residual calculation: 10.7415 micros
Restriction: fw: 7.56775 micros
Interpolation + add: 10.7415 micros
VCycle: 4609.08 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 10.7417 micros
Residual calculation: 10.7413 micros
Restriction: fw: 8.13767 micros
Interpolation + add: 10.7413 micros
VCycle: 20812.5 micros


BENCHMARK
Jacobi iteration: 11.718 micros
Residual calculation: 11.718 micros
Restriction: fw: 12.0433 micros
Interpolation + add: 11.718 micros
VCycle: 22501.8 micros


BENCHMARK
Jacobi iteration: 16.275 micros
Residual calculation: 10.7417 micros
Restriction: fw: 8.13733 micros
Interpolation + add: 10.416 micros
VCycle: 22475.8 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 10.7415 micros
Residual calculation: 10.253 micros
Restriction: fw: 7.812 micros
Interpolation + add: 13.671 micros
VCycle: 82543.5 micros


BENCHMARK
Jacobi iteration: 41.5015 micros
Residual calculation: 40.5245 micros
Restriction: fw: 36.1305 micros
Interpolation + add: 42.478 micros
VCycle: 89784.3 micros


BENCHMARK
Jacobi iteration: 41.013 micros
Residual calculation: 40.525 micros
Restriction: fw: 36.1305 micros
Interpolation + add: 42.4775 micros
VCycle: 89774.5 micros



------------------
-- size: 2047

BENCHMARK
Jacobi iteration: 16.6 micros
Residual calculation: 15.624 micros
Restriction: fw: 11.718 micros
Interpolation + add: 15.624 micros
VCycle: 1699.11 micros


BENCHMARK
Jacobi iteration: 6390.22 micros
Residual calculation: 3730.23 micros
Restriction: fw: 3290.8 micros
Interpolation + add: 1469.63 micros
VCycle: 47155.2 micros


BENCHMARK
Jacobi iteration: 6320.89 micros
Residual calculation: 3620.86 micros
Restriction: fw: 3337.68 micros
Interpolation + add: 1327.06 micros
VCycle: 47291.9 micros


Also, I'm not sure why but 2047^2 actually seems faster now than the first time I successfully ran it in mgbench2.

Regards,
SB

XMAN26
02-Jun-2007, 04:00
My results
X2 3800+ @2.5
8800GTS BFGOC 640MB
2GB Ram
XPPro
158.19
GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 3.125 micros
Residual calculation: 3.125 micros
Restriction: fw: 3.125 micros
Interpolation + add: 1.5625 micros
VCycle: 500 micros


BENCHMARK
Jacobi iteration: 1.5625 micros
Residual calculation: 3.125 micros
Restriction: fw: 3.125 micros
Interpolation + add: 3.125 micros
VCycle: 500 micros


BENCHMARK
Jacobi iteration: 3.125 micros
Residual calculation: 3.125 micros
Restriction: fw: 3.125 micros
Interpolation + add: 1.5625 micros
VCycle: 500 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 1.73611 micros
Residual calculation: 3.47222 micros
Restriction: fw: 1.73611 micros
Interpolation + add: 3.47222 micros
VCycle: 833.333 micros


BENCHMARK
Jacobi iteration: 3.47222 micros
Residual calculation: 3.47222 micros
Restriction: fw: 1.73611 micros
Interpolation + add: 3.47222 micros
VCycle: 815.972 micros


BENCHMARK
Jacobi iteration: 3.47222 micros
Residual calculation: 1.73611 micros
Restriction: fw: 3.47222 micros
Interpolation + add: 1.73611 micros
VCycle: 833.333 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 3.90625 micros
Residual calculation: 1.95313 micros
Restriction: fw: 3.90625 micros
Interpolation + add: 1.95313 micros
VCycle: 1171.88 micros


BENCHMARK
Jacobi iteration: 1.95313 micros
Residual calculation: 3.90625 micros
Restriction: fw: 1.95313 micros
Interpolation + add: 1.95313 micros
VCycle: 1171.88 micros


BENCHMARK
Jacobi iteration: 3.90625 micros
Residual calculation: 1.95313 micros
Restriction: fw: 1.95313 micros
Interpolation + add: 3.90625 micros
VCycle: 1171.88 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 2.23214 micros
Residual calculation: 4.46429 micros
Restriction: fw: 2.23214 micros
Interpolation + add: 2.23214 micros
VCycle: 1517.86 micros


BENCHMARK
Jacobi iteration: 2.23214 micros
Residual calculation: 2.23214 micros
Restriction: fw: 2.23214 micros
Interpolation + add: 2.23214 micros
VCycle: 1517.86 micros


BENCHMARK
Jacobi iteration: 2.23214 micros
Residual calculation: 4.46429 micros
Restriction: fw: 2.23214 micros
Interpolation + add: 2.23214 micros
VCycle: 1517.86 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 5.20833 micros
Residual calculation: 2.60417 micros
Restriction: fw: 5.20833 micros
Interpolation + add: 2.60417 micros
VCycle: 1875 micros


BENCHMARK
Jacobi iteration: 5.20833 micros
Residual calculation: 5.20833 micros
Restriction: fw: 2.60417 micros
Interpolation + add: 2.60417 micros
VCycle: 1875 micros


BENCHMARK
Jacobi iteration: 5.20833 micros
Residual calculation: 5.20833 micros
Restriction: fw: 2.60417 micros
Interpolation + add: 2.60417 micros
VCycle: 1875 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 18.75 micros
Residual calculation: 15.625 micros
Restriction: fw: 15.625 micros
Interpolation + add: 9.375 micros
VCycle: 2281.25 micros


BENCHMARK
Jacobi iteration: 18.75 micros
Residual calculation: 18.75 micros
Restriction: fw: 12.5 micros
Interpolation + add: 12.5 micros
VCycle: 2281.25 micros


BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 18.75 micros
Restriction: fw: 15.625 micros
Interpolation + add: 9.375 micros
VCycle: 2281.25 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 78.125 micros
Residual calculation: 70.3125 micros
Restriction: fw: 66.4063 micros
Interpolation + add: 46.875 micros
VCycle: 2890.63 micros


BENCHMARK
Jacobi iteration: 74.2188 micros
Residual calculation: 70.3125 micros
Restriction: fw: 66.4063 micros
Interpolation + add: 46.875 micros
VCycle: 2929.69 micros


BENCHMARK
Jacobi iteration: 78.125 micros
Residual calculation: 70.3125 micros
Restriction: fw: 66.4063 micros
Interpolation + add: 46.875 micros
VCycle: 2890.63 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 281.25 micros
Residual calculation: 281.25 micros
Restriction: fw: 265.625 micros
Interpolation + add: 177.083 micros
VCycle: 5364.58 micros


BENCHMARK
Jacobi iteration: 281.25 micros
Residual calculation: 281.25 micros
Restriction: fw: 265.625 micros
Interpolation + add: 171.875 micros
VCycle: 5416.67 micros


BENCHMARK
Jacobi iteration: 281.25 micros
Residual calculation: 281.25 micros
Restriction: fw: 265.625 micros
Interpolation + add: 177.083 micros
VCycle: 5364.58 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 1195.31 micros
Residual calculation: 1093.75 micros
Restriction: fw: 1210.94 micros
Interpolation + add: 765.625 micros
VCycle: 22968.8 micros


BENCHMARK
Jacobi iteration: 1195.31 micros
Residual calculation: 1078.13 micros
Restriction: fw: 1218.75 micros
Interpolation + add: 765.625 micros
VCycle: 22890.6 micros


BENCHMARK
Jacobi iteration: 1195.31 micros
Residual calculation: 1093.75 micros
Restriction: fw: 1195.31 micros
Interpolation + add: 773.438 micros
VCycle: 22890.6 micros



------------------
-- size: 2047

BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 3796.88 micros
Restriction: fw: 5656.25 micros
Interpolation + add: 4640.63 micros
VCycle: 139844 micros


BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 3828.13 micros
Restriction: fw: 5625 micros
Interpolation + add: 4562.5 micros
VCycle: 140469 micros


BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 3843.75 micros
Restriction: fw: 5640.63 micros
Interpolation + add: 4625 micros
VCycle: 139375 micros

PeterT
02-Jun-2007, 14:38
I'm curious if it's a matter of processor timestamps being off from one another on multi-core systems. It shouldn't be using something like the RDTSC instruction which can and typically does vary from core to core. Under WinXP MS/AMD released a fix that keeps both cores' timestamp counter in lock-step.I also thought about this, that's why I switched from clock() to GetSystemTime(). I believe the latter should be synchronized. Also, I personally test on a dual-core system and don't experience the strange numbers.

What exactly are you measuring for the first 4 tests under each benchmark? The VCycle numbers I'm guessing are about right and the first 4 are the ones that are off?Basically the 4 main operations of the multigrid solver are Smoothing, Residual Calculation, Restriction and Interpolation. Each of those is used in the V-cycle. The first 4 tests are designed to test them individually -- this could be helpful in determining why one system is faster than another in V-cycle times (the true measurement of performance). I still don't understand why the others are off on some systems.

was that adressed to me ?Yes, your results are exactly as they "should be".

pegisys
03-Jun-2007, 01:09
I ran the test and I got a BSOD :sad: I did get some numbers though

I'm using winxp pro on a 1900gt and A64 3500


GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 4.6875 micros
Residual calculation: 4.6875 micros
Restriction: fw: 6.25 micros
Interpolation + add: 4.6875 micros
VCycle: 515.625 micros


BENCHMARK
Jacobi iteration: 4.6875 micros
Residual calculation: 4.6875 micros
Restriction: fw: 4.6875 micros
Interpolation + add: 4.6875 micros
VCycle: 515.625 micros


BENCHMARK
Jacobi iteration: 4.6875 micros
Residual calculation: 4.6875 micros
Restriction: fw: 6.25 micros
Interpolation + add: 4.6875 micros
VCycle: 515.625 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 5.20833 micros
Residual calculation: 5.20833 micros
Restriction: fw: 5.20833 micros
Interpolation + add: 5.20833 micros
VCycle: 850.694 micros


BENCHMARK
Jacobi iteration: 3.47222 micros
Residual calculation: 5.20833 micros
Restriction: fw: 5.20833 micros
Interpolation + add: 5.20833 micros
VCycle: 850.694 micros


BENCHMARK
Jacobi iteration: 5.20833 micros
Residual calculation: 3.47222 micros
Restriction: fw: 5.20833 micros
Interpolation + add: 5.20833 micros
VCycle: 850.694 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 5.85938 micros
Residual calculation: 3.90625 micros
Restriction: fw: 5.85938 micros
Interpolation + add: 3.90625 micros
VCycle: 1191.41 micros


BENCHMARK
Jacobi iteration: 5.85938 micros
Residual calculation: 3.90625 micros
Restriction: fw: 5.85938 micros
Interpolation + add: 3.90625 micros
VCycle: 1191.41 micros


BENCHMARK
Jacobi iteration: 5.85938 micros
Residual calculation: 5.85938 micros
Restriction: fw: 3.90625 micros
Interpolation + add: 5.85938 micros
VCycle: 1191.41 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 8.92857 micros
Residual calculation: 6.69643 micros
Restriction: fw: 8.92857 micros
Interpolation + add: 6.69643 micros
VCycle: 1562.5 micros


BENCHMARK
Jacobi iteration: 8.92857 micros
Residual calculation: 6.69643 micros
Restriction: fw: 8.92857 micros
Interpolation + add: 6.69643 micros
VCycle: 1562.5 micros


BENCHMARK
Jacobi iteration: 6.69643 micros
Residual calculation: 6.69643 micros
Restriction: fw: 8.92857 micros
Interpolation + add: 8.92857 micros
VCycle: 1540.18 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 20.8333 micros
Residual calculation: 18.2292 micros
Restriction: fw: 20.8333 micros
Interpolation + add: 18.2292 micros
VCycle: 1979.17 micros


BENCHMARK
Jacobi iteration: 20.8333 micros
Residual calculation: 18.2292 micros
Restriction: fw: 23.4375 micros
Interpolation + add: 15.625 micros
VCycle: 1979.17 micros


BENCHMARK
Jacobi iteration: 18.2292 micros
Residual calculation: 20.8333 micros
Restriction: fw: 20.8333 micros
Interpolation + add: 15.625 micros
VCycle: 2005.21 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 65.625 micros
Residual calculation: 65.625 micros
Restriction: fw: 81.25 micros
Interpolation + add: 46.875 micros
VCycle: 2781.25 micros


BENCHMARK
Jacobi iteration: 65.625 micros
Residual calculation: 65.625 micros
Restriction: fw: 78.125 micros
Interpolation + add: 46.875 micros
VCycle: 2781.25 micros


BENCHMARK
Jacobi iteration: 65.625 micros
Residual calculation: 68.75 micros
Restriction: fw: 78.125 micros
Interpolation + add: 46.875 micros
VCycle: 2781.25 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 214.844 micros
Residual calculation: 265.625 micros
Restriction: fw: 308.594 micros
Interpolation + add: 195.313 micros
VCycle: 5859.38 micros


BENCHMARK
Jacobi iteration: 386.719 micros
Residual calculation: 89.8438 micros
Restriction: fw: 308.594 micros
Interpolation + add: 199.219 micros
VCycle: 5898.44 micros


BENCHMARK
Jacobi iteration: 214.844 micros
Residual calculation: 261.719 micros
Restriction: fw: 308.594 micros
Interpolation + add: 199.219 micros
VCycle: 5859.38 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 671.875 micros
Residual calculation: 1026.04 micros
Restriction: fw: 1213.54 micros
Interpolation + add: 968.75 micros
VCycle: 23697.9 micros


BENCHMARK
Jacobi iteration: 677.083 micros
Residual calculation: 1026.04 micros
Restriction: fw: 1250 micros
Interpolation + add: 937.5 micros
VCycle: 23854.2 micros


BENCHMARK
Jacobi iteration: 671.875 micros
Residual calculation: 1026.04 micros
Restriction: fw: 1234.38 micros
Interpolation + add: 968.75 micros
VCycle: 23697.9 micros



------------------
-- size: 1023

Davros
03-Jun-2007, 10:40
"Yes, your results are exactly as they "should be"."

nice to know having my gfx card in a 4 lane pci-e slot isnt messing anything up...

Sxotty
03-Jun-2007, 15:53
The problem is not stability, afaik, but the driver model itself: if you let a graphics command run for too long, Windows will assume it hanged and it will ask the driver to reset itself. And yes, it's an horribly annoying and stupid problem! :(

Not on mine. I am pretty sure it over heated.

It started writing out the BSOD about a fault (this was on the last set btw 2047) but never finished before the computer rebooted. The display was corrupted on reboot as well, flecks of color and random stuff until after the POST...No permanent damage or anything :)

BlizzardOne
03-Jun-2007, 17:11
Re-ran lastest version of the program with Cat 7.5's:

C2D X6800 @ 2.93ghz
2GB PC6400 @ 800mhz
Win XP Pro
Crossfire'd X1950XT-X's

Ran it with Crossfire enabled and disabled, to see if there's any difference.. there are some differences, but not many, and not huge ones at that either.

Crossfire Disabled

GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 3.125 micros
Residual calculation: 3.125 micros
Restriction: fw: 3.125 micros
Interpolation + add: 3.125 micros
VCycle: 359.375 micros


BENCHMARK
Jacobi iteration: 3.125 micros
Residual calculation: 3.125 micros
Restriction: fw: 3.125 micros
Interpolation + add: 3.125 micros
VCycle: 359.375 micros


BENCHMARK
Jacobi iteration: 3.125 micros
Residual calculation: 3.125 micros
Restriction: fw: 3.125 micros
Interpolation + add: 3.125 micros
VCycle: 359.375 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 1.73611 micros
Residual calculation: 3.47222 micros
Restriction: fw: 3.47222 micros
Interpolation + add: 3.47222 micros
VCycle: 590.278 micros


BENCHMARK
Jacobi iteration: 3.47222 micros
Residual calculation: 3.47222 micros
Restriction: fw: 3.47222 micros
Interpolation + add: 1.73611 micros
VCycle: 607.639 micros


BENCHMARK
Jacobi iteration: 3.47222 micros
Residual calculation: 1.73611 micros
Restriction: fw: 3.47222 micros
Interpolation + add: 3.47222 micros
VCycle: 590.278 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 1.95313 micros
Residual calculation: 3.90625 micros
Restriction: fw: 3.90625 micros
Interpolation + add: 3.90625 micros
VCycle: 820.313 micros


BENCHMARK
Jacobi iteration: 3.90625 micros
Residual calculation: 1.95313 micros
Restriction: fw: 3.90625 micros
Interpolation + add: 3.90625 micros
VCycle: 839.844 micros


BENCHMARK
Jacobi iteration: 3.90625 micros
Residual calculation: 3.90625 micros
Restriction: fw: 3.90625 micros
Interpolation + add: 1.95313 micros
VCycle: 839.844 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 4.46429 micros
Residual calculation: 4.46429 micros
Restriction: fw: 6.69643 micros
Interpolation + add: 6.69643 micros
VCycle: 1071.43 micros


BENCHMARK
Jacobi iteration: 4.46429 micros
Residual calculation: 4.46429 micros
Restriction: fw: 6.69643 micros
Interpolation + add: 6.69643 micros
VCycle: 1071.43 micros


BENCHMARK
Jacobi iteration: 4.46429 micros
Residual calculation: 4.46429 micros
Restriction: fw: 4.46429 micros
Interpolation + add: 8.92857 micros
VCycle: 1071.43 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 13.0208 micros
Residual calculation: 13.0208 micros
Restriction: fw: 15.625 micros
Interpolation + add: 7.8125 micros
VCycle: 1380.21 micros


BENCHMARK
Jacobi iteration: 13.0208 micros
Residual calculation: 15.625 micros
Restriction: fw: 13.0208 micros
Interpolation + add: 10.4167 micros
VCycle: 1380.21 micros


BENCHMARK
Jacobi iteration: 13.0208 micros
Residual calculation: 15.625 micros
Restriction: fw: 13.0208 micros
Interpolation + add: 10.4167 micros
VCycle: 1380.21 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 46.875 micros
Residual calculation: 50 micros
Restriction: fw: 59.375 micros
Interpolation + add: 34.375 micros
VCycle: 1937.5 micros


BENCHMARK
Jacobi iteration: 46.875 micros
Residual calculation: 50 micros
Restriction: fw: 59.375 micros
Interpolation + add: 34.375 micros
VCycle: 1968.75 micros


BENCHMARK
Jacobi iteration: 46.875 micros
Residual calculation: 50 micros
Restriction: fw: 59.375 micros
Interpolation + add: 34.375 micros
VCycle: 1937.5 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 164.063 micros
Residual calculation: 199.219 micros
Restriction: fw: 234.375 micros
Interpolation + add: 148.438 micros
VCycle: 4609.38 micros


BENCHMARK
Jacobi iteration: 167.969 micros
Residual calculation: 199.219 micros
Restriction: fw: 230.469 micros
Interpolation + add: 152.344 micros
VCycle: 4609.38 micros


BENCHMARK
Jacobi iteration: 164.063 micros
Residual calculation: 195.313 micros
Restriction: fw: 234.375 micros
Interpolation + add: 148.438 micros
VCycle: 4648.44 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 515.625 micros
Residual calculation: 786.458 micros
Restriction: fw: 942.708 micros
Interpolation + add: 723.958 micros
VCycle: 18541.7 micros


BENCHMARK
Jacobi iteration: 515.625 micros
Residual calculation: 781.25 micros
Restriction: fw: 932.292 micros
Interpolation + add: 739.583 micros
VCycle: 18593.8 micros


BENCHMARK
Jacobi iteration: 510.417 micros
Residual calculation: 786.458 micros
Restriction: fw: 927.083 micros
Interpolation + add: 734.375 micros
VCycle: 18697.9 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 609.375 micros
Residual calculation: 2203.13 micros
Restriction: fw: 3757.81 micros
Interpolation + add: 3289.06 micros
VCycle: 96484.4 micros


BENCHMARK
Jacobi iteration: 601.563 micros
Residual calculation: 2132.81 micros
Restriction: fw: 3671.88 micros
Interpolation + add: 3289.06 micros
VCycle: 98125 micros


BENCHMARK
Jacobi iteration: 601.563 micros
Residual calculation: 2054.69 micros
Restriction: fw: 3679.69 micros
Interpolation + add: 3281.25 micros
VCycle: 98906.3 micros



------------------
-- size: 2047


Crossfire Enabled

GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 3.125 micros
Residual calculation: 3.125 micros
Restriction: fw: 3.125 micros
Interpolation + add: 3.125 micros
VCycle: 375 micros


BENCHMARK
Jacobi iteration: 3.125 micros
Residual calculation: 3.125 micros
Restriction: fw: 3.125 micros
Interpolation + add: 3.125 micros
VCycle: 375 micros


BENCHMARK
Jacobi iteration: 3.125 micros
Residual calculation: 3.125 micros
Restriction: fw: 3.125 micros
Interpolation + add: 3.125 micros
VCycle: 375 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 3.47222 micros
Residual calculation: 1.73611 micros
Restriction: fw: 3.47222 micros
Interpolation + add: 3.47222 micros
VCycle: 625 micros


BENCHMARK
Jacobi iteration: 3.47222 micros
Residual calculation: 3.47222 micros
Restriction: fw: 1.73611 micros
Interpolation + add: 3.47222 micros
VCycle: 625 micros


BENCHMARK
Jacobi iteration: 3.47222 micros
Residual calculation: 3.47222 micros
Restriction: fw: 1.73611 micros
Interpolation + add: 3.47222 micros
VCycle: 625 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 3.90625 micros
Residual calculation: 3.90625 micros
Restriction: fw: 3.90625 micros
Interpolation + add: 1.95313 micros
VCycle: 859.375 micros


BENCHMARK
Jacobi iteration: 3.90625 micros
Residual calculation: 1.95313 micros
Restriction: fw: 5.85938 micros
Interpolation + add: 1.95313 micros
VCycle: 859.375 micros


BENCHMARK
Jacobi iteration: 3.90625 micros
Residual calculation: 1.95313 micros
Restriction: fw: 3.90625 micros
Interpolation + add: 3.90625 micros
VCycle: 859.375 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 4.46429 micros
Residual calculation: 4.46429 micros
Restriction: fw: 6.69643 micros
Interpolation + add: 6.69643 micros
VCycle: 1116.07 micros


BENCHMARK
Jacobi iteration: 4.46429 micros
Residual calculation: 4.46429 micros
Restriction: fw: 6.69643 micros
Interpolation + add: 6.69643 micros
VCycle: 1116.07 micros


BENCHMARK
Jacobi iteration: 4.46429 micros
Residual calculation: 4.46429 micros
Restriction: fw: 6.69643 micros
Interpolation + add: 6.69643 micros
VCycle: 1116.07 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 13.0208 micros
Residual calculation: 13.0208 micros
Restriction: fw: 15.625 micros
Interpolation + add: 7.8125 micros
VCycle: 1432.29 micros


BENCHMARK
Jacobi iteration: 15.625 micros
Residual calculation: 13.0208 micros
Restriction: fw: 15.625 micros
Interpolation + add: 7.8125 micros
VCycle: 1432.29 micros


BENCHMARK
Jacobi iteration: 13.0208 micros
Residual calculation: 15.625 micros
Restriction: fw: 13.0208 micros
Interpolation + add: 10.4167 micros
VCycle: 1432.29 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 46.875 micros
Residual calculation: 50 micros
Restriction: fw: 59.375 micros
Interpolation + add: 34.375 micros
VCycle: 2031.25 micros


BENCHMARK
Jacobi iteration: 46.875 micros
Residual calculation: 50 micros
Restriction: fw: 59.375 micros
Interpolation + add: 34.375 micros
VCycle: 2031.25 micros


BENCHMARK
Jacobi iteration: 46.875 micros
Residual calculation: 50 micros
Restriction: fw: 59.375 micros
Interpolation + add: 34.375 micros
VCycle: 2031.25 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 160.156 micros
Residual calculation: 199.219 micros
Restriction: fw: 238.281 micros
Interpolation + add: 148.438 micros
VCycle: 4648.44 micros


BENCHMARK
Jacobi iteration: 167.969 micros
Residual calculation: 195.313 micros
Restriction: fw: 234.375 micros
Interpolation + add: 148.438 micros
VCycle: 4648.44 micros


BENCHMARK
Jacobi iteration: 164.063 micros
Residual calculation: 195.313 micros
Restriction: fw: 234.375 micros
Interpolation + add: 152.344 micros
VCycle: 4648.44 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 505.208 micros
Residual calculation: 781.25 micros
Restriction: fw: 942.708 micros
Interpolation + add: 729.167 micros
VCycle: 18750 micros


BENCHMARK
Jacobi iteration: 500 micros
Residual calculation: 781.25 micros
Restriction: fw: 942.708 micros
Interpolation + add: 723.958 micros
VCycle: 18802.1 micros


BENCHMARK
Jacobi iteration: 515.625 micros
Residual calculation: 755.208 micros
Restriction: fw: 942.708 micros
Interpolation + add: 750 micros
VCycle: 18645.8 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 648.438 micros
Residual calculation: 2156.25 micros
Restriction: fw: 3523.44 micros
Interpolation + add: 3445.31 micros
VCycle: 97343.8 micros


BENCHMARK
Jacobi iteration: 554.688 micros
Residual calculation: 2023.44 micros
Restriction: fw: 3679.69 micros
Interpolation + add: 3359.38 micros
VCycle: 98906.3 micros


BENCHMARK
Jacobi iteration: 507.813 micros
Residual calculation: 2070.31 micros
Restriction: fw: 3679.69 micros
Interpolation + add: 3289.06 micros
VCycle: 99609.4 micros



------------------
-- size: 2047


No idea what it all means :)

PeterT
03-Jun-2007, 19:37
Here's a collection of most V-cycle results in this thread. Obviously wrong ones or ones that were very similar to any already present were excluded. (Next time I'll write a program for this, I just spent 2 hours copying & pasting numbers :/)

http://peter.metaclassofnil.com/uni/mgbenchresults.png

I don't want to make too many comments, just note that:
- Results below 255^2 are not likely to be GPU limited
- Apperantly NV's 7xxx Vista drivers are better than the 8xxx ones
- Most of those results probably use 2D clocks on cards that have such a thing
- In Vista, 2900XT and 8800GTX results are close, in XP the latter is 6 times faster
- 88 series cards are the only ones that can complete the 2047^2 test correctly

nyt
03-Jun-2007, 20:01
Hi,
I can't get anything but a BSoD (infinite loop) on my X1900XTX (AcceleroX2 fan), whatever the frequency (even 2D). It happens mostly at the end (size 2047). I have a taskmanager running on the side and can see the refresh stalling by periods of 5 to 10s at times or other times slowing down a lot. Eventually it freezes and 10s later, BSoD. This is on XP32/CAT7.5. What other test do you recommend to see if hardware is really bad? I have very few crashes in game even OC'd.

PeterT
03-Jun-2007, 20:18
If you don't get any GPU-related crashes in games I wouldn't worry. As discussed earlier in this thread, the crashes in this application may also be unrelated to stability. Especially if yours are independent of clock speed.

BRiT
03-Jun-2007, 21:03
This time with Nvidia 158.45 Vista Ultimate x64 drivers on the MSI 8800GTX HDOC, X2 4400+ @ 2.5Ghz, 2Gig DDR 250Mhz 3-4-4-8. Results seem inline/similar to previous results with 158.43.

As for why the numbers seem off for people with similar 8800GTX cards, driver, and OS, I can only guess it's perhaps due to the system CPU.

Here's a run but this time on a X3220 Core 2 Quad @ 2.7Ghz, 4Gig DDR2 900Mhz 5-4-4-12, still running the Nvidia 158.45 drivers for the MSI 8800 GTX HDOC on a fresh Vista Ultimate x64 install.

Still plenty of odd numbers for the first four timings, though VCycle seems +/- a few percentage off the previous numbers.

GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 3.125 micros
Residual calculation: 1.5625 micros
Restriction: fw: 3.125 micros
Interpolation + add: 1.5625 micros
VCycle: 6453.13 micros


BENCHMARK
Jacobi iteration: 3.125 micros
Residual calculation: 1.5625 micros
Restriction: fw: 3.125 micros
Interpolation + add: 1.5625 micros
VCycle: 6296.88 micros


BENCHMARK
Jacobi iteration: 1.5625 micros
Residual calculation: 3.125 micros
Restriction: fw: 3.125 micros
Interpolation + add: 1.5625 micros
VCycle: 6375 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 1.73611 micros
Residual calculation: 1.73611 micros
Restriction: fw: 3.47222 micros
Interpolation + add: 1.73611 micros
VCycle: 10677.1 micros


BENCHMARK
Jacobi iteration: 3.47222 micros
Residual calculation: 1.73611 micros
Restriction: fw: 3.47222 micros
Interpolation + add: 1.73611 micros
VCycle: 10642.4 micros


BENCHMARK
Jacobi iteration: 3.47222 micros
Residual calculation: 1.73611 micros
Restriction: fw: 3.47222 micros
Interpolation + add: 1.73611 micros
VCycle: 11684 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 1.95313 micros
Residual calculation: 1.95313 micros
Restriction: fw: 1.95313 micros
Interpolation + add: 3.90625 micros
VCycle: 14824.2 micros


BENCHMARK
Jacobi iteration: 1.95313 micros
Residual calculation: 1.95313 micros
Restriction: fw: 3.90625 micros
Interpolation + add: 1.95313 micros
VCycle: 14941.4 micros


BENCHMARK
Jacobi iteration: 3.90625 micros
Residual calculation: 1.95313 micros
Restriction: fw: 1.95313 micros
Interpolation + add: 3.90625 micros
VCycle: 14785.2 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 2.23214 micros
Residual calculation: 2.23214 micros
Restriction: fw: 4.46429 micros
Interpolation + add: 2.23214 micros
VCycle: 19129.5 micros


BENCHMARK
Jacobi iteration: 2.23214 micros
Residual calculation: 2.23214 micros
Restriction: fw: 4.46429 micros
Interpolation + add: 2.23214 micros
VCycle: 19062.5 micros


BENCHMARK
Jacobi iteration: 2.23214 micros
Residual calculation: 2.23214 micros
Restriction: fw: 4.46429 micros
Interpolation + add: 2.23214 micros
VCycle: 18995.5 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 2.60417 micros
Residual calculation: 2.60417 micros
Restriction: fw: 2.60417 micros
Interpolation + add: 2.60417 micros
VCycle: 23489.6 micros


BENCHMARK
Jacobi iteration: 2.60417 micros
Residual calculation: 2.60417 micros
Restriction: fw: 2.60417 micros
Interpolation + add: 2.60417 micros
VCycle: 23515.6 micros


BENCHMARK
Jacobi iteration: 2.60417 micros
Residual calculation: 2.60417 micros
Restriction: fw: 2.60417 micros
Interpolation + add: 2.60417 micros
VCycle: 23333.3 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 3.125 micros
Residual calculation: 3.125 micros
Restriction: fw: 9.375 micros
Interpolation + add: 12.5 micros
VCycle: 29031.3 micros


BENCHMARK
Jacobi iteration: 3.125 micros
Residual calculation: 3.125 micros
Restriction: fw: 34.375 micros
Interpolation + add: 3.125 micros
VCycle: 28218.7 micros


BENCHMARK
Jacobi iteration: 3.125 micros
Residual calculation: 3.125 micros
Restriction: fw: 9.375 micros
Interpolation + add: 9.375 micros
VCycle: 27750 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 23.4375 micros
Restriction: fw: 42.9688 micros
Interpolation + add: 42.9688 micros
VCycle: 33476.6 micros


BENCHMARK
Jacobi iteration: 3.90625 micros
Residual calculation: 19.5313 micros
Restriction: fw: 46.875 micros
Interpolation + add: 42.9688 micros
VCycle: 33554.7 micros


BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 23.4375 micros
Restriction: fw: 42.9688 micros
Interpolation + add: 42.9688 micros
VCycle: 33593.8 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 5.20833 micros
Residual calculation: 0 micros
Restriction: fw: 145.833 micros
Interpolation + add: 197.917 micros
VCycle: 42239.6 micros


BENCHMARK
Jacobi iteration: 5.20833 micros
Residual calculation: 0 micros
Restriction: fw: 145.833 micros
Interpolation + add: 197.917 micros
VCycle: 42187.5 micros


BENCHMARK
Jacobi iteration: 5.20833 micros
Residual calculation: 0 micros
Restriction: fw: 145.833 micros
Interpolation + add: 197.917 micros
VCycle: 42187.5 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 7.8125 micros
Restriction: fw: 0 micros
Interpolation + add: 585.938 micros
VCycle: 79140.6 micros


BENCHMARK
Jacobi iteration: 7.8125 micros
Residual calculation: 0 micros
Restriction: fw: 0 micros
Interpolation + add: 593.75 micros
VCycle: 79296.9 micros


BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 0 micros
Restriction: fw: 7.8125 micros
Interpolation + add: 585.938 micros
VCycle: 79296.9 micros



------------------
-- size: 2047

BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 0 micros
Restriction: fw: 0 micros
Interpolation + add: 15.625 micros
VCycle: 234375 micros


BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 0 micros
Restriction: fw: 0 micros
Interpolation + add: 15.625 micros
VCycle: 233906 micros


BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 0 micros
Restriction: fw: 0 micros
Interpolation + add: 0 micros
VCycle: 234063 micros

Anarchist4000
03-Jun-2007, 22:31
Ok did a run with 3D clocks and Aero disabled.

1900XT512MB
Vista32 w/ Cat 7.5
3D Clocks/No Aero
All driver settings disabled

GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 15.0381 micros
Residual calculation: 15.1358 micros
Restriction: fw: 10.8391 micros
Interpolation + add: 15.1358 micros
VCycle: 334.939 micros


BENCHMARK
Jacobi iteration: 15.0381 micros
Residual calculation: 14.9405 micros
Restriction: fw: 10.7415 micros
Interpolation + add: 15.1357 micros
VCycle: 332.987 micros


BENCHMARK
Jacobi iteration: 15.2334 micros
Residual calculation: 15.1358 micros
Restriction: fw: 10.9368 micros
Interpolation + add: 15.0381 micros
VCycle: 333.963 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 15.5154 micros
Residual calculation: 15.5156 micros
Restriction: fw: 11.1754 micros
Interpolation + add: 15.841 micros
VCycle: 550.096 micros


BENCHMARK
Jacobi iteration: 15.407 micros
Residual calculation: 15.2986 micros
Restriction: fw: 10.9584 micros
Interpolation + add: 15.19 micros
VCycle: 546.84 micros


BENCHMARK
Jacobi iteration: 15.19 micros
Residual calculation: 15.407 micros
Restriction: fw: 11.067 micros
Interpolation + add: 15.0814 micros
VCycle: 550.096 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 15.1358 micros
Residual calculation: 15.3799 micros
Restriction: fw: 11.1077 micros
Interpolation + add: 15.3799 micros
VCycle: 762.89 micros


BENCHMARK
Jacobi iteration: 15.502 micros
Residual calculation: 15.624 micros
Restriction: fw: 10.9856 micros
Interpolation + add: 15.3799 micros
VCycle: 766.553 micros


BENCHMARK
Jacobi iteration: 15.2578 micros
Residual calculation: 15.2579 micros
Restriction: fw: 11.1076 micros
Interpolation + add: 15.7461 micros
VCycle: 762.89 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 14.9266 micros
Residual calculation: 15.2054 micros
Restriction: fw: 11.0206 micros
Interpolation + add: 15.066 micros
VCycle: 969.524 micros


BENCHMARK
Jacobi iteration: 15.066 micros
Residual calculation: 15.2056 micros
Restriction: fw: 11.16 micros
Interpolation + add: 14.9264 micros
VCycle: 963.946 micros


BENCHMARK
Jacobi iteration: 15.2054 micros
Residual calculation: 14.9266 micros
Restriction: fw: 11.0204 micros
Interpolation + add: 15.066 micros
VCycle: 970.92 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 15.4613 micros
Residual calculation: 15.2985 micros
Restriction: fw: 11.067 micros
Interpolation + add: 15.4612 micros
VCycle: 1179.94 micros


BENCHMARK
Jacobi iteration: 15.2985 micros
Residual calculation: 15.2985 micros
Restriction: fw: 10.9042 micros
Interpolation + add: 15.2985 micros
VCycle: 1179.94 micros


BENCHMARK
Jacobi iteration: 15.1357 micros
Residual calculation: 15.2985 micros
Restriction: fw: 11.3925 micros
Interpolation + add: 15.4613 micros
VCycle: 1179.94 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 15.038 micros
Residual calculation: 15.2334 micros
Restriction: fw: 11.5228 micros
Interpolation + add: 15.4286 micros
VCycle: 2191.27 micros


BENCHMARK
Jacobi iteration: 15.2334 micros
Residual calculation: 15.2334 micros
Restriction: fw: 11.132 micros
Interpolation + add: 15.2334 micros
VCycle: 2206.89 micros


BENCHMARK
Jacobi iteration: 15.038 micros
Residual calculation: 15.2334 micros
Restriction: fw: 10.9368 micros
Interpolation + add: 15.2334 micros
VCycle: 2206.89 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 15.3798 micros
Residual calculation: 15.1358 micros
Restriction: fw: 10.9857 micros
Interpolation + add: 15.3798 micros
VCycle: 8825.12 micros


BENCHMARK
Jacobi iteration: 15.3798 micros
Residual calculation: 15.1358 micros
Restriction: fw: 10.9857 micros
Interpolation + add: 15.1358 micros
VCycle: 9616.08 micros


BENCHMARK
Jacobi iteration: 15.1358 micros
Residual calculation: 15.1358 micros
Restriction: fw: 10.9855 micros
Interpolation + add: 15.1358 micros
VCycle: 9613.64 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 15.2987 micros
Residual calculation: 14.973 micros
Restriction: fw: 11.067 micros
Interpolation + add: 14.973 micros
VCycle: 36488.6 micros


BENCHMARK
Jacobi iteration: 24.738 micros
Residual calculation: 25.0637 micros
Restriction: fw: 25.0633 micros
Interpolation + add: 25.389 micros
VCycle: 38773.6 micros


BENCHMARK
Jacobi iteration: 25.0637 micros
Residual calculation: 24.738 micros
Restriction: fw: 25.389 micros
Interpolation + add: 25.0633 micros
VCycle: 38776.8 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 15.136 micros
Residual calculation: 15.1355 micros
Restriction: fw: 11.23 micros
Interpolation + add: 15.1355 micros
VCycle: 144019 micros


BENCHMARK
Jacobi iteration: 83.491 micros
Residual calculation: 92.7675 micros
Restriction: fw: 83.4905 micros
Interpolation + add: 83.491 micros
VCycle: 154424 micros


BENCHMARK
Jacobi iteration: 83.0025 micros
Residual calculation: 83.0025 micros
Restriction: fw: 93.744 micros
Interpolation + add: 83.0025 micros
VCycle: 154429 micros



------------------
-- size: 2047

BENCHMARK
Jacobi iteration: 15.624 micros
Residual calculation: 14.647 micros
Restriction: fw: 11.718 micros
Interpolation + add: 14.648 micros
VCycle: 6737.85 micros


BENCHMARK
Jacobi iteration: 8309.04 micros
Residual calculation: 1270.43 micros
Restriction: fw: 341.775 micros
Interpolation + add: 298.809 micros
VCycle: 91917.9 micros


BENCHMARK
Jacobi iteration: 8235.8 micros
Residual calculation: 1793.83 micros
Restriction: fw: 360.329 micros
Interpolation + add: 280.255 micros
VCycle: 93441.3 micros



The residual calculation looks closer to being correct but the others still seem off. Times improved a bit with the 3D clocks and Aero disabled but that doesn't seem to be causing the problem.

Any chance of converting things to DirectX and seeing if that makes a difference?

PeterT
03-Jun-2007, 23:58
Here's a run but this time on a X3220 Core 2 Quad @ 2.7Ghz, 4Gig DDR2 900Mhz 5-4-4-12, still running the Nvidia 158.45 drivers for the MSI 8800 GTX HDOC on a fresh Vista Ultimate x64 install.Your numbers are by far the strangest I've yet encountered (these and the earlier ones). The component times are obviously completely unusable, and the V-cycle times are very bad for a 8800 GTX. In fact, you'd probably be better off running the multigrid method with an SSE-optimized solver on your monstrous CPU :P

Then again your numbers are also the only 8800 Vista results reported, so maybe the NV Vista 8-series drivers just really suck for what I'm doing...

Any chance of converting things to DirectX and seeing if that makes a difference?Not at this point, sorry, my thesis should have been done "end of May" ... by the way, I have to get back to writing :D

Davros
04-Jun-2007, 00:18
- In Vista, 2900XT and 8800GTX results are close, in XP the latter is 6 times faster


Ok maybe not to someone like yourself who understands the hardware, but to an adverage joe (i.e: me)
it appears that the 8800 Really is 6 times faster than the x2900 and the reason they are close in vista is because nvidia's vista drivers are truely awful

but then again im just a gamer ;)

BRiT
04-Jun-2007, 01:27
In fact, you'd probably be better off running the multigrid method with an SSE-optimized solver on your monstrous CPU :P

Then again your numbers are also the only 8800 Vista results reported, so maybe the NV Vista 8-series drivers just really suck for what I'm doing...


Rys also posted some 8800 GTX (or) GTS Vista 64 (http://forum.beyond3d.com/showpost.php?p=1014009&postcount=68) numbers also using 158.42. His Vcycle times come in mostly around what mine are, with his being a bit higher in the larger sets. I would hope, for my sake, his numbers were for the GTX in his system and not his GTS. Is there any way of telling which OpenGL adapter the application bound itself to? I suspect it's just a matter of Vista drivers not being as optimized for this code path as the XP ones are.

Albuquerque
04-Jun-2007, 01:48
Machine:

Dell Inspiron e1505
Intel 945PM chipset
T2300 (Core Duo) 1.66Ghz
2Gb 667Mhz DDR2 5-5-5-15
7300Go 128mb PCI-E
Vista Home Premium x32 w/Dell-provided drivers

On "battery mode":
GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 277.68 micros
Residual calculation: 282.36 micros
Restriction: fw: 254.28 micros
Interpolation + add: 287.04 micros
VCycle: 18922.8 micros


BENCHMARK
Jacobi iteration: 290.16 micros
Residual calculation: 255.84 micros
Restriction: fw: 274.56 micros
Interpolation + add: 257.4 micros
VCycle: 18876 micros


BENCHMARK
Jacobi iteration: 255.84 micros
Residual calculation: 257.4 micros
Restriction: fw: 248.04 micros
Interpolation + add: 254.28 micros
VCycle: 18876 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 256.533 micros
Residual calculation: 251.333 micros
Restriction: fw: 280.8 micros
Interpolation + add: 266.933 micros
VCycle: 30957.3 micros


BENCHMARK
Jacobi iteration: 280.8 micros
Residual calculation: 287.733 micros
Restriction: fw: 280.8 micros
Interpolation + add: 284.267 micros
VCycle: 30870.7 micros


BENCHMARK
Jacobi iteration: 282.533 micros
Residual calculation: 284.267 micros
Restriction: fw: 249.6 micros
Interpolation + add: 256.533 micros
VCycle: 30922.7 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 276.9 micros
Residual calculation: 278.85 micros
Restriction: fw: 284.7 micros
Interpolation + add: 267.15 micros
VCycle: 42939 micros


BENCHMARK
Jacobi iteration: 278.85 micros
Residual calculation: 284.7 micros
Restriction: fw: 274.95 micros
Interpolation + add: 259.35 micros
VCycle: 42997.5 micros


BENCHMARK
Jacobi iteration: 269.1 micros
Residual calculation: 282.75 micros
Restriction: fw: 282.75 micros
Interpolation + add: 286.65 micros
VCycle: 42939 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 274.114 micros
Residual calculation: 271.886 micros
Restriction: fw: 265.2 micros
Interpolation + add: 258.514 micros
VCycle: 54756 micros


BENCHMARK
Jacobi iteration: 287.486 micros
Residual calculation: 276.343 micros
Restriction: fw: 260.743 micros
Interpolation + add: 254.057 micros
VCycle: 54466.3 micros


BENCHMARK
Jacobi iteration: 285.257 micros
Residual calculation: 276.343 micros
Restriction: fw: 267.429 micros
Interpolation + add: 287.486 micros
VCycle: 54510.9 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 280.8 micros
Residual calculation: 291.2 micros
Restriction: fw: 278.2 micros
Interpolation + add: 260 micros
VCycle: 65156 micros


BENCHMARK
Jacobi iteration: 286 micros
Residual calculation: 280.8 micros
Restriction: fw: 296.4 micros
Interpolation + add: 273 micros
VCycle: 65130 micros


BENCHMARK
Jacobi iteration: 275.6 micros
Residual calculation: 299 micros
Restriction: fw: 280.8 micros
Interpolation + add: 265.2 micros
VCycle: 65338 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 636.48 micros
Residual calculation: 733.2 micros
Restriction: fw: 1070.16 micros
Interpolation + add: 393.12 micros
VCycle: 76440 micros


BENCHMARK
Jacobi iteration: 645.84 micros
Residual calculation: 726.96 micros
Restriction: fw: 1067.04 micros
Interpolation + add: 396.24 micros
VCycle: 76408.8 micros


BENCHMARK
Jacobi iteration: 639.6 micros
Residual calculation: 726.96 micros
Restriction: fw: 1063.92 micros
Interpolation + add: 390 micros
VCycle: 76252.8 micros



------------------
-- size: 255
Error using previous program Object.


On "AC mode"
GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 68.64 micros
Residual calculation: 68.64 micros
Restriction: fw: 68.64 micros
Interpolation + add: 68.64 micros
VCycle: 4446 micros


BENCHMARK
Jacobi iteration: 68.64 micros
Residual calculation: 68.64 micros
Restriction: fw: 68.64 micros
Interpolation + add: 68.64 micros
VCycle: 4446 micros


BENCHMARK
Jacobi iteration: 68.64 micros
Residual calculation: 68.64 micros
Restriction: fw: 40.56 micros
Interpolation + add: 59.28 micros
VCycle: 4274.4 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 69.3333 micros
Residual calculation: 67.6 micros
Restriction: fw: 71.0667 micros
Interpolation + add: 67.6 micros
VCycle: 7366.67 micros


BENCHMARK
Jacobi iteration: 69.3333 micros
Residual calculation: 69.3333 micros
Restriction: fw: 69.3333 micros
Interpolation + add: 69.3333 micros
VCycle: 7366.67 micros


BENCHMARK
Jacobi iteration: 69.3333 micros
Residual calculation: 67.6 micros
Restriction: fw: 69.3333 micros
Interpolation + add: 69.3333 micros
VCycle: 7384 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 72.15 micros
Residual calculation: 72.15 micros
Restriction: fw: 70.2 micros
Interpolation + add: 70.2 micros
VCycle: 10354.5 micros


BENCHMARK
Jacobi iteration: 72.15 micros
Residual calculation: 70.2 micros
Restriction: fw: 72.15 micros
Interpolation + add: 68.25 micros
VCycle: 10354.5 micros


BENCHMARK
Jacobi iteration: 72.15 micros
Residual calculation: 70.2 micros
Restriction: fw: 72.15 micros
Interpolation + add: 68.25 micros
VCycle: 10354.5 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 140.4 micros
Residual calculation: 138.171 micros
Restriction: fw: 100.286 micros
Interpolation + add: 80.2286 micros
VCycle: 13304.6 micros


BENCHMARK
Jacobi iteration: 138.171 micros
Residual calculation: 140.4 micros
Restriction: fw: 102.514 micros
Interpolation + add: 84.6857 micros
VCycle: 13326.9 micros


BENCHMARK
Jacobi iteration: 140.4 micros
Residual calculation: 142.629 micros
Restriction: fw: 100.286 micros
Interpolation + add: 82.4571 micros
VCycle: 13282.3 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 512.2 micros
Residual calculation: 543.4 micros
Restriction: fw: 374.4 micros
Interpolation + add: 301.6 micros
VCycle: 17550 micros


BENCHMARK
Jacobi iteration: 512.2 micros
Residual calculation: 540.8 micros
Restriction: fw: 377 micros
Interpolation + add: 301.6 micros
VCycle: 17472 micros


BENCHMARK
Jacobi iteration: 512.2 micros
Residual calculation: 540.8 micros
Restriction: fw: 377 micros
Interpolation + add: 301.6 micros
VCycle: 17524 micros



------------------
-- size: 127
GLFilterStep -- Error doing makeCurrent on target.

Unknown Soldier
05-Jun-2007, 06:25
Hi Peter,

I ran this on my PC and also found that it used my CPU, funnily enough, the 2047 test utilizes all CPU's to a degree. At the beginning, at least one CPU runs at 30%(memory at 277Mb). This is pretty constant until 1023 where the CPU or two hits around 34% with memory 330Mb+-

The last test - 2047 - utilises all CPU's to a degree with CPU usage maxxing at 34%. Memory climbed to 720Mb+

I ran a few tests after each other.

WinXP QX6600, 2Gig 800 DDR2 memory, Nvidia GTS - Default speeds all - and driver 158.22

I didn't use test 3 because I was trying to capture screenshots but that didn't work. CPU pics at the end were captured at the end of the test.

Last, my GTS 320Mb ran all tests .. including the 2047. ;)

Test 1

GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 3.125 micros
Residual calculation: 1.5625 micros
Restriction: fw: 1.5625 micros
Interpolation + add: 1.5625 micros
VCycle: 500 micros


BENCHMARK
Jacobi iteration: 1.5625 micros
Residual calculation: 1.5625 micros
Restriction: fw: 1.5625 micros
Interpolation + add: 3.125 micros
VCycle: 484.375 micros


BENCHMARK
Jacobi iteration: 1.5625 micros
Residual calculation: 1.5625 micros
Restriction: fw: 3.125 micros
Interpolation + add: 1.5625 micros
VCycle: 484.375 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 1.73611 micros
Residual calculation: 1.73611 micros
Restriction: fw: 1.73611 micros
Interpolation + add: 3.47222 micros
VCycle: 833.333 micros


BENCHMARK
Jacobi iteration: 1.73611 micros
Residual calculation: 1.73611 micros
Restriction: fw: 3.47222 micros
Interpolation + add: 1.73611 micros
VCycle: 815.972 micros


BENCHMARK
Jacobi iteration: 1.73611 micros
Residual calculation: 1.73611 micros
Restriction: fw: 3.47222 micros
Interpolation + add: 1.73611 micros
VCycle: 815.972 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 1.95313 micros
Residual calculation: 1.95313 micros
Restriction: fw: 1.95313 micros
Interpolation + add: 1.95313 micros
VCycle: 1152.34 micros


BENCHMARK
Jacobi iteration: 1.95313 micros
Residual calculation: 1.95313 micros
Restriction: fw: 1.95313 micros
Interpolation + add: 1.95313 micros
VCycle: 1152.34 micros


BENCHMARK
Jacobi iteration: 1.95313 micros
Residual calculation: 1.95313 micros
Restriction: fw: 1.95313 micros
Interpolation + add: 1.95313 micros
VCycle: 1152.34 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 2.23214 micros
Residual calculation: 2.23214 micros
Restriction: fw: 2.23214 micros
Interpolation + add: 2.23214 micros
VCycle: 1517.86 micros


BENCHMARK
Jacobi iteration: 2.23214 micros
Residual calculation: 2.23214 micros
Restriction: fw: 2.23214 micros
Interpolation + add: 2.23214 micros
VCycle: 1473.21 micros


BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 2.23214 micros
Restriction: fw: 2.23214 micros
Interpolation + add: 2.23214 micros
VCycle: 1473.21 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 5.20833 micros
Residual calculation: 5.20833 micros
Restriction: fw: 2.60417 micros
Interpolation + add: 2.60417 micros
VCycle: 1822.92 micros


BENCHMARK
Jacobi iteration: 2.60417 micros
Residual calculation: 5.20833 micros
Restriction: fw: 2.60417 micros
Interpolation + add: 5.20833 micros
VCycle: 1822.92 micros


BENCHMARK
Jacobi iteration: 5.20833 micros
Residual calculation: 5.20833 micros
Restriction: fw: 2.60417 micros
Interpolation + add: 2.60417 micros
VCycle: 1875 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 21.875 micros
Residual calculation: 18.75 micros
Restriction: fw: 15.625 micros
Interpolation + add: 12.5 micros
VCycle: 2218.75 micros


BENCHMARK
Jacobi iteration: 18.75 micros
Residual calculation: 18.75 micros
Restriction: fw: 15.625 micros
Interpolation + add: 12.5 micros
VCycle: 2218.75 micros


BENCHMARK
Jacobi iteration: 18.75 micros
Residual calculation: 18.75 micros
Restriction: fw: 15.625 micros
Interpolation + add: 12.5 micros
VCycle: 2218.75 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 82.0313 micros
Residual calculation: 78.125 micros
Restriction: fw: 74.2188 micros
Interpolation + add: 50.7813 micros
VCycle: 2851.56 micros


BENCHMARK
Jacobi iteration: 82.0313 micros
Residual calculation: 78.125 micros
Restriction: fw: 74.2188 micros
Interpolation + add: 50.7813 micros
VCycle: 2851.56 micros


BENCHMARK
Jacobi iteration: 82.0313 micros
Residual calculation: 78.125 micros
Restriction: fw: 74.2188 micros
Interpolation + add: 50.7813 micros
VCycle: 2851.56 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 328.125 micros
Residual calculation: 296.875 micros
Restriction: fw: 312.5 micros
Interpolation + add: 192.708 micros
VCycle: 5729.17 micros


BENCHMARK
Jacobi iteration: 328.125 micros
Residual calculation: 302.083 micros
Restriction: fw: 317.708 micros
Interpolation + add: 187.5 micros
VCycle: 5729.17 micros


BENCHMARK
Jacobi iteration: 322.917 micros
Residual calculation: 302.083 micros
Restriction: fw: 317.708 micros
Interpolation + add: 187.5 micros
VCycle: 5729.17 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 1304.69 micros
Residual calculation: 1210.94 micros
Restriction: fw: 1312.5 micros
Interpolation + add: 804.688 micros
VCycle: 24375 micros


BENCHMARK
Jacobi iteration: 1304.69 micros
Residual calculation: 1210.94 micros
Restriction: fw: 1320.31 micros
Interpolation + add: 812.5 micros
VCycle: 24296.9 micros


BENCHMARK
Jacobi iteration: 1296.88 micros
Residual calculation: 1218.75 micros
Restriction: fw: 1328.13 micros
Interpolation + add: 804.688 micros
VCycle: 24375 micros



------------------
-- size: 2047

BENCHMARK
Jacobi iteration: 687.5 micros
Residual calculation: 4687.5 micros
Restriction: fw: 6656.25 micros
Interpolation + add: 4671.88 micros
VCycle: 138750 micros


BENCHMARK
Jacobi iteration: 734.375 micros
Residual calculation: 4687.5 micros
Restriction: fw: 6640.63 micros
Interpolation + add: 4687.5 micros
VCycle: 138594 micros


BENCHMARK
Jacobi iteration: 718.75 micros
Residual calculation: 4656.25 micros
Restriction: fw: 6656.25 micros
Interpolation + add: 4718.75 micros
VCycle: 138438 micros



Test 2

GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 1.5625 micros
Residual calculation: 3.125 micros
Restriction: fw: 1.5625 micros
Interpolation + add: 1.5625 micros
VCycle: 500 micros


BENCHMARK
Jacobi iteration: 1.5625 micros
Residual calculation: 1.5625 micros
Restriction: fw: 1.5625 micros
Interpolation + add: 3.125 micros
VCycle: 484.375 micros


BENCHMARK
Jacobi iteration: 1.5625 micros
Residual calculation: 3.125 micros
Restriction: fw: 1.5625 micros
Interpolation + add: 1.5625 micros
VCycle: 500 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 1.73611 micros
Residual calculation: 1.73611 micros
Restriction: fw: 3.47222 micros
Interpolation + add: 1.73611 micros
VCycle: 815.972 micros


BENCHMARK
Jacobi iteration: 1.73611 micros
Residual calculation: 1.73611 micros
Restriction: fw: 1.73611 micros
Interpolation + add: 1.73611 micros
VCycle: 833.333 micros


BENCHMARK
Jacobi iteration: 1.73611 micros
Residual calculation: 1.73611 micros
Restriction: fw: 1.73611 micros
Interpolation + add: 1.73611 micros
VCycle: 850.694 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 1.95313 micros
Residual calculation: 1.95313 micros
Restriction: fw: 1.95313 micros
Interpolation + add: 1.95313 micros
VCycle: 1171.88 micros


BENCHMARK
Jacobi iteration: 3.90625 micros
Residual calculation: 1.95313 micros
Restriction: fw: 1.95313 micros
Interpolation + add: 1.95313 micros
VCycle: 1152.34 micros


BENCHMARK
Jacobi iteration: 1.95313 micros
Residual calculation: 1.95313 micros
Restriction: fw: 1.95313 micros
Interpolation + add: 1.95313 micros
VCycle: 1171.88 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 2.23214 micros
Residual calculation: 0 micros
Restriction: fw: 2.23214 micros
Interpolation + add: 2.23214 micros
VCycle: 1495.54 micros


BENCHMARK
Jacobi iteration: 2.23214 micros
Residual calculation: 2.23214 micros
Restriction: fw: 0 micros
Interpolation + add: 2.23214 micros
VCycle: 1495.54 micros


BENCHMARK
Jacobi iteration: 2.23214 micros
Residual calculation: 2.23214 micros
Restriction: fw: 2.23214 micros
Interpolation + add: 2.23214 micros
VCycle: 1495.54 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 5.20833 micros
Residual calculation: 5.20833 micros
Restriction: fw: 2.60417 micros
Interpolation + add: 2.60417 micros
VCycle: 1848.96 micros


BENCHMARK
Jacobi iteration: 5.20833 micros
Residual calculation: 2.60417 micros
Restriction: fw: 5.20833 micros
Interpolation + add: 2.60417 micros
VCycle: 1848.96 micros


BENCHMARK
Jacobi iteration: 5.20833 micros
Residual calculation: 2.60417 micros
Restriction: fw: 5.20833 micros
Interpolation + add: 2.60417 micros
VCycle: 1848.96 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 18.75 micros
Residual calculation: 18.75 micros
Restriction: fw: 15.625 micros
Interpolation + add: 12.5 micros
VCycle: 2218.75 micros


BENCHMARK
Jacobi iteration: 18.75 micros
Residual calculation: 18.75 micros
Restriction: fw: 15.625 micros
Interpolation + add: 12.5 micros
VCycle: 2250 micros


BENCHMARK
Jacobi iteration: 18.75 micros
Residual calculation: 18.75 micros
Restriction: fw: 15.625 micros
Interpolation + add: 12.5 micros
VCycle: 2250 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 82.0313 micros
Residual calculation: 78.125 micros
Restriction: fw: 70.3125 micros
Interpolation + add: 50.7813 micros
VCycle: 2929.69 micros


BENCHMARK
Jacobi iteration: 82.0313 micros
Residual calculation: 78.125 micros
Restriction: fw: 70.3125 micros
Interpolation + add: 50.7813 micros
VCycle: 2890.63 micros


BENCHMARK
Jacobi iteration: 82.0313 micros
Residual calculation: 78.125 micros
Restriction: fw: 74.2188 micros
Interpolation + add: 46.875 micros
VCycle: 2929.69 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 328.125 micros
Residual calculation: 296.875 micros
Restriction: fw: 317.708 micros
Interpolation + add: 187.5 micros
VCycle: 5781.25 micros


BENCHMARK
Jacobi iteration: 322.917 micros
Residual calculation: 302.083 micros
Restriction: fw: 317.708 micros
Interpolation + add: 187.5 micros
VCycle: 5781.25 micros


BENCHMARK
Jacobi iteration: 322.917 micros
Residual calculation: 302.083 micros
Restriction: fw: 317.708 micros
Interpolation + add: 187.5 micros
VCycle: 5781.25 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 1312.5 micros
Residual calculation: 1203.13 micros
Restriction: fw: 1335.94 micros
Interpolation + add: 828.125 micros
VCycle: 24531.3 micros


BENCHMARK
Jacobi iteration: 1304.69 micros
Residual calculation: 1187.5 micros
Restriction: fw: 1343.75 micros
Interpolation + add: 812.5 micros
VCycle: 24609.4 micros


BENCHMARK
Jacobi iteration: 1312.5 micros
Residual calculation: 1203.13 micros
Restriction: fw: 1320.31 micros
Interpolation + add: 835.938 micros
VCycle: 24531.3 micros



------------------
-- size: 2047

BENCHMARK
Jacobi iteration: 671.875 micros
Residual calculation: 4453.13 micros
Restriction: fw: 6359.38 micros
Interpolation + add: 4609.38 micros
VCycle: 142031 micros


BENCHMARK
Jacobi iteration: 718.75 micros
Residual calculation: 4453.13 micros
Restriction: fw: 6359.38 micros
Interpolation + add: 4578.13 micros
VCycle: 141875 micros


BENCHMARK
Jacobi iteration: 687.5 micros
Residual calculation: 4531.25 micros
Restriction: fw: 6328.13 micros
Interpolation + add: 4593.75 micros
VCycle: 141563 micros


Test 4

GL multigrid benchmark 0.5 by Peter Thoman 2007

DevIL initialized successfully.
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
GLEW version 1.3.5 initialized successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.



------------------
-- size: 3

BENCHMARK
Jacobi iteration: 1.5625 micros
Residual calculation: 3.125 micros
Restriction: fw: 1.5625 micros
Interpolation + add: 1.5625 micros
VCycle: 484.375 micros


BENCHMARK
Jacobi iteration: 3.125 micros
Residual calculation: 1.5625 micros
Restriction: fw: 1.5625 micros
Interpolation + add: 1.5625 micros
VCycle: 500 micros


BENCHMARK
Jacobi iteration: 1.5625 micros
Residual calculation: 1.5625 micros
Restriction: fw: 1.5625 micros
Interpolation + add: 3.125 micros
VCycle: 484.375 micros



------------------
-- size: 7

BENCHMARK
Jacobi iteration: 1.73611 micros
Residual calculation: 3.47222 micros
Restriction: fw: 1.73611 micros
Interpolation + add: 1.73611 micros
VCycle: 833.333 micros


BENCHMARK
Jacobi iteration: 1.73611 micros
Residual calculation: 1.73611 micros
Restriction: fw: 1.73611 micros
Interpolation + add: 1.73611 micros
VCycle: 815.972 micros


BENCHMARK
Jacobi iteration: 1.73611 micros
Residual calculation: 1.73611 micros
Restriction: fw: 3.47222 micros
Interpolation + add: 1.73611 micros
VCycle: 815.972 micros



------------------
-- size: 15

BENCHMARK
Jacobi iteration: 1.95313 micros
Residual calculation: 1.95313 micros
Restriction: fw: 1.95313 micros
Interpolation + add: 1.95313 micros
VCycle: 1152.34 micros


BENCHMARK
Jacobi iteration: 1.95313 micros
Residual calculation: 1.95313 micros
Restriction: fw: 1.95313 micros
Interpolation + add: 1.95313 micros
VCycle: 1152.34 micros


BENCHMARK
Jacobi iteration: 3.90625 micros
Residual calculation: 1.95313 micros
Restriction: fw: 1.95313 micros
Interpolation + add: 1.95313 micros
VCycle: 1132.81 micros



------------------
-- size: 31

BENCHMARK
Jacobi iteration: 2.23214 micros
Residual calculation: 2.23214 micros
Restriction: fw: 2.23214 micros
Interpolation + add: 2.23214 micros
VCycle: 1473.21 micros


BENCHMARK
Jacobi iteration: 2.23214 micros
Residual calculation: 2.23214 micros
Restriction: fw: 2.23214 micros
Interpolation + add: 2.23214 micros
VCycle: 1473.21 micros


BENCHMARK
Jacobi iteration: 0 micros
Residual calculation: 2.23214 micros
Restriction: fw: 2.23214 micros
Interpolation + add: 2.23214 micros
VCycle: 1473.21 micros



------------------
-- size: 63

BENCHMARK
Jacobi iteration: 5.20833 micros
Residual calculation: 5.20833 micros
Restriction: fw: 2.60417 micros
Interpolation + add: 2.60417 micros
VCycle: 1822.92 micros


BENCHMARK
Jacobi iteration: 5.20833 micros
Residual calculation: 5.20833 micros
Restriction: fw: 2.60417 micros
Interpolation + add: 2.60417 micros
VCycle: 1822.92 micros


BENCHMARK
Jacobi iteration: 5.20833 micros
Residual calculation: 5.20833 micros
Restriction: fw: 2.60417 micros
Interpolation + add: 2.60417 micros
VCycle: 1848.96 micros



------------------
-- size: 127

BENCHMARK
Jacobi iteration: 18.75 micros
Residual calculation: 18.75 micros
Restriction: fw: 15.625 micros
Interpolation + add: 12.5 micros
VCycle: 2218.75 micros


BENCHMARK
Jacobi iteration: 21.875 micros
Residual calculation: 18.75 micros
Restriction: fw: 15.625 micros
Interpolation + add: 12.5 micros
VCycle: 2187.5 micros


BENCHMARK
Jacobi iteration: 18.75 micros
Residual calculation: 18.75 micros
Restriction: fw: 15.625 micros
Interpolation + add: 12.5 micros
VCycle: 2218.75 micros



------------------
-- size: 255

BENCHMARK
Jacobi iteration: 82.0313 micros
Residual calculation: 78.125 micros
Restriction: fw: 74.2188 micros
Interpolation + add: 46.875 micros
VCycle: 2890.63 micros


BENCHMARK
Jacobi iteration: 82.0313 micros
Residual calculation: 78.125 micros
Restriction: fw: 70.3125 micros
Interpolation + add: 50.7813 micros
VCycle: 2890.63 micros


BENCHMARK
Jacobi iteration: 82.0313 micros
Residual calculation: 78.125 micros
Restriction: fw: 70.3125 micros
Interpolation + add: 50.7813 micros
VCycle: 2890.63 micros



------------------
-- size: 511

BENCHMARK
Jacobi iteration: 322.917 micros
Residual calculation: 302.083 micros
Restriction: fw: 317.708 micros
Interpolation + add: 187.5 micros
VCycle: 5729.17 micros


BENCHMARK
Jacobi iteration: 328.125 micros
Residual calculation: 302.083 micros
Restriction: fw: 312.5 micros
Interpolation + add: 187.5 micros
VCycle: 5781.25 micros


BENCHMARK
Jacobi iteration: 328.125 micros
Residual calculation: 302.083 micros
Restriction: fw: 312.5 micros
Interpolation + add: 187.5 micros
VCycle: 5781.25 micros



------------------
-- size: 1023

BENCHMARK
Jacobi iteration: 1304.69 micros
Residual calculation: 1210.94 micros
Restriction: fw: 1320.31 micros
Interpolation + add: 804.688 micros
VCycle: 24453.1 micros


BENCHMARK
Jacobi iteration: 1312.5 micros
Residual calculation: 1203.13 micros
Restriction: fw: 1312.5 micros
Interpolation + add: 812.5 micros
VCycle: 24375 micros


BENCHMARK
Jacobi iteration: 1304.69 micros
Residual calculation: 1203.13 micros
Restriction: fw: 1328.13 micros
Interpolation + add: 804.688 micros
VCycle: 24453.1 micros



------------------
-- size: 2047

BENCHMARK
Jacobi iteration: 734.375 micros
Residual calculation: 4656.25 micros
Restriction: fw: 6718.75 micros
Interpolation + add: 4640.63 micros
VCycle: 138594 micros


BENCHMARK
Jacobi iteration: 718.75 micros
Residual calculation: 4703.13 micros
Restriction: fw: 6656.25 micros
Interpolation + add: 4671.88 micros
VCycle: 138438 micros


BENCHMARK
Jacobi iteration: 734.375 micros
Residual calculation: 4671.88 micros
Restriction: fw: 6687.5 micros
Interpolation + add: 4640.63 micros
VCycle: 138438 micros


http://img530.imageshack.us/img530/5052/test2yq7.th.jpg (http://img530.imageshack.us/my.php?image=test2yq7.jpg)

http://img186.imageshack.us/img186/118/test3afterrp1.th.jpg (http://img186.imageshack.us/my.php?image=test3afterrp1.jpg)

http://img443.imageshack.us/img443/4551/test5hs6.th.jpg (http://img443.imageshack.us/my.php?image=test5hs6.jpg)

US