New GLSL / Pbuffer benchmark [Update: version 1.4 / ORCv0.4]

I would like to help out as well but I do not have the .NET framework installed. I will try and hunt it down. The benchmark is a great piece of software IMO and kudos to a job well done for providing some quite interesting results.

Seems that there could be a correlation with the poor performance of the Cat 5.4s and its poorer Doom 3 benchmark as was posted in a thread @ beyond3d I believe.

Just wanted to say some of the systems you have are quite :oops: and droolworthy. FX 55 with a Quadro?! WOW! I am on a 3.4 GHz, 1 gig DDR 400 and a 256 MB ATI Mobility laptop.
 
Here is my result: P4 3.2C, GF6800 256MB WINXP

Keep up the good work guys. Your work has been a tremendous help for our project as well. :) Good luck~!

WINDOWS XP SP2, P4 3.2C, 1GIG DDR 400 RAM
Geforce 6800 AGP 8x, 256mb,
Driver version 77.72

Code:
GL filter framework 1.4 test application by Peter Thoman 2004-2005

Gui initialized successfully.
DevIL initialized successfully.
 - DevIL Version: 167
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.

Press return key to start benchmark...



Testing 32x32 image:
Results for BufferCreateINT: msecs: 140 || ms/i: 23.3333 || i/s: 42.8571
No suitable INT format found. Trying FP... (Flaky 6x00 workaround)

Results for BufferCreateINT16: msecs: 219 || ms/i: 36.5 || i/s: 27.3973
Results for BufferCreateFP16: msecs: 125 || ms/i: 20.8333 || i/s: 48
Results for BufferCreateFP32: msecs: 141 || ms/i: 23.5 || i/s: 42.5532
Results for JustCopy: msecs: 203 || ms/i: 0.1015 || i/s: 9852.22
Results for SimpleSmooth: msecs: 266 || ms/i: 0.133 || i/s: 7518.8
Results for TexNoise: msecs: 250 || ms/i: 0.125 || i/s: 8000
Results for 3x3Conv: msecs: 172 || ms/i: 0.172 || i/s: 5813.95
Results for TEncode: msecs: 156 || ms/i: 0.156 || i/s: 6410.26
Results for TDecode: msecs: 156 || ms/i: 0.156 || i/s: 6410.26
Results for LinDiffINT: msecs: 250 || ms/i: 0.125 || i/s: 8000
Results for LinDiffINT16: msecs: 250 || ms/i: 0.125 || i/s: 8000
Results for LinDiffFP16: msecs: 250 || ms/i: 0.125 || i/s: 8000
Results for LinDiffFP32: msecs: 250 || ms/i: 0.125 || i/s: 8000
Results for LD_INT->FP16: msecs: 125 || ms/i: 0.125 || i/s: 8000
Results for LD_INT->FP32: msecs: 125 || ms/i: 0.125 || i/s: 8000
Results for LD_FP16->INT: msecs: 125 || ms/i: 0.125 || i/s: 8000
Results for LD_FP32->INT: msecs: 125 || ms/i: 0.125 || i/s: 8000
Results for PMTEncoded: msecs: 735 || ms/i: 0.735 || i/s: 1360.54
Results for PMStandard: msecs: 453 || ms/i: 0.453 || i/s: 2207.51
Results for PMBuffered: msecs: 47 || ms/i: 0.094 || i/s: 10638.3

Testing 64x64 image:
Results for BufferCreateINT: msecs: 156 || ms/i: 26 || i/s: 38.4615
Results for BufferCreateINT16: msecs: 266 || ms/i: 44.3333 || i/s: 22.5564
Results for BufferCreateFP16: msecs: 125 || ms/i: 20.8333 || i/s: 48
Results for BufferCreateFP32: msecs: 125 || ms/i: 20.8333 || i/s: 48
Results for JustCopy: msecs: 265 || ms/i: 0.1325 || i/s: 7547.17
Results for SimpleSmooth: msecs: 219 || ms/i: 0.1095 || i/s: 9132.42
Results for TexNoise: msecs: 297 || ms/i: 0.1485 || i/s: 6734.01
Results for 3x3Conv: msecs: 109 || ms/i: 0.109 || i/s: 9174.31
Results for TEncode: msecs: 94 || ms/i: 0.094 || i/s: 10638.3
Results for TDecode: msecs: 141 || ms/i: 0.141 || i/s: 7092.2
Results for LinDiffINT: msecs: 265 || ms/i: 0.1325 || i/s: 7547.17
Results for LinDiffINT16: msecs: 281 || ms/i: 0.1405 || i/s: 7117.44
Results for LinDiffFP16: msecs: 250 || ms/i: 0.125 || i/s: 8000
Results for LinDiffFP32: msecs: 250 || ms/i: 0.125 || i/s: 8000
Results for LD_INT->FP16: msecs: 125 || ms/i: 0.125 || i/s: 8000
Results for LD_INT->FP32: msecs: 110 || ms/i: 0.11 || i/s: 9090.91
Results for LD_FP16->INT: msecs: 156 || ms/i: 0.156 || i/s: 6410.26
Results for LD_FP32->INT: msecs: 141 || ms/i: 0.141 || i/s: 7092.2
Results for PMTEncoded: msecs: 454 || ms/i: 0.454 || i/s: 2202.64
Results for PMStandard: msecs: 438 || ms/i: 0.438 || i/s: 2283.11
Results for PMBuffered: msecs: 78 || ms/i: 0.156 || i/s: 6410.26

Testing 128x128 image:
Results for BufferCreateINT: msecs: 125 || ms/i: 20.8333 || i/s: 48
Results for BufferCreateINT16: msecs: 219 || ms/i: 36.5 || i/s: 27.3973
Results for BufferCreateFP16: msecs: 125 || ms/i: 20.8333 || i/s: 48
Results for BufferCreateFP32: msecs: 125 || ms/i: 20.8333 || i/s: 48
Results for JustCopy: msecs: 187 || ms/i: 0.0935 || i/s: 10695.2
Results for SimpleSmooth: msecs: 203 || ms/i: 0.1015 || i/s: 9852.22
Results for TexNoise: msecs: 203 || ms/i: 0.1015 || i/s: 9852.22
Results for 3x3Conv: msecs: 109 || ms/i: 0.109 || i/s: 9174.31
Results for TEncode: msecs: 109 || ms/i: 0.109 || i/s: 9174.31
Results for TDecode: msecs: 109 || ms/i: 0.109 || i/s: 9174.31
Results for LinDiffINT: msecs: 281 || ms/i: 0.1405 || i/s: 7117.44
Results for LinDiffINT16: msecs: 266 || ms/i: 0.133 || i/s: 7518.8
Results for LinDiffFP16: msecs: 282 || ms/i: 0.141 || i/s: 7092.2
Results for LinDiffFP32: msecs: 704 || ms/i: 0.352 || i/s: 2840.91
Results for LD_INT->FP16: msecs: 141 || ms/i: 0.141 || i/s: 7092.2
Results for LD_INT->FP32: msecs: 141 || ms/i: 0.141 || i/s: 7092.2
Results for LD_FP16->INT: msecs: 156 || ms/i: 0.156 || i/s: 6410.26
Results for LD_FP32->INT: msecs: 328 || ms/i: 0.328 || i/s: 3048.78
Results for PMTEncoded: msecs: 437 || ms/i: 0.437 || i/s: 2288.33
Results for PMStandard: msecs: 1125 || ms/i: 1.125 || i/s: 888.889
Results for PMBuffered: msecs: 265 || ms/i: 0.53 || i/s: 1886.79

Testing 256x256 image:
Results for BufferCreateINT: msecs: 141 || ms/i: 23.5 || i/s: 42.5532
Results for BufferCreateINT16: msecs: 219 || ms/i: 36.5 || i/s: 27.3973
Results for BufferCreateFP16: msecs: 125 || ms/i: 20.8333 || i/s: 48
Results for BufferCreateFP32: msecs: 125 || ms/i: 20.8333 || i/s: 48
Results for JustCopy: msecs: 234 || ms/i: 0.117 || i/s: 8547.01
Results for SimpleSmooth: msecs: 312 || ms/i: 0.156 || i/s: 6410.26
Results for TexNoise: msecs: 375 || ms/i: 0.1875 || i/s: 5333.33
Results for 3x3Conv: msecs: 344 || ms/i: 0.344 || i/s: 2906.98
Results for TEncode: msecs: 94 || ms/i: 0.094 || i/s: 10638.3
Results for TDecode: msecs: 219 || ms/i: 0.219 || i/s: 4566.21
Results for LinDiffINT: msecs: 328 || ms/i: 0.164 || i/s: 6097.56
Results for LinDiffINT16: msecs: 828 || ms/i: 0.414 || i/s: 2415.46
Results for LinDiffFP16: msecs: 922 || ms/i: 0.461 || i/s: 2169.2
Results for LinDiffFP32: msecs: 2734 || ms/i: 1.367 || i/s: 731.529
Results for LD_INT->FP16: msecs: 172 || ms/i: 0.172 || i/s: 5813.95
Results for LD_INT->FP32: msecs: 172 || ms/i: 0.172 || i/s: 5813.95
Results for LD_FP16->INT: msecs: 360 || ms/i: 0.36 || i/s: 2777.78
Results for LD_FP32->INT: msecs: 1343 || ms/i: 1.343 || i/s: 744.602
Results for PMTEncoded: msecs: 1250 || ms/i: 1.25 || i/s: 800
Results for PMStandard: msecs: 4203 || ms/i: 4.203 || i/s: 237.925
Results for PMBuffered: msecs: 1359 || ms/i: 2.718 || i/s: 367.918

Testing 512x512 image:
Results for BufferCreateINT: msecs: 125 || ms/i: 20.8333 || i/s: 48
Results for BufferCreateINT16: msecs: 234 || ms/i: 39 || i/s: 25.641
Results for BufferCreateFP16: msecs: 125 || ms/i: 20.8333 || i/s: 48
Results for BufferCreateFP32: msecs: 125 || ms/i: 20.8333 || i/s: 48
Results for JustCopy: msecs: 500 || ms/i: 0.5 || i/s: 2000
Results for SimpleSmooth: msecs: 469 || ms/i: 0.469 || i/s: 2132.2
Results for TexNoise: msecs: 484 || ms/i: 0.484 || i/s: 2066.12
Results for 3x3Conv: msecs: 578 || ms/i: 1.156 || i/s: 865.052
Results for TEncode: msecs: 140 || ms/i: 0.28 || i/s: 3571.43
Results for TDecode: msecs: 375 || ms/i: 0.75 || i/s: 1333.33
Results for LinDiffINT: msecs: 531 || ms/i: 0.531 || i/s: 1883.24
Results for LinDiffINT16: msecs: 1578 || ms/i: 1.578 || i/s: 633.714
Results for LinDiffFP16: msecs: 1390 || ms/i: 1.39 || i/s: 719.424
Results for LinDiffFP32: msecs: 4907 || ms/i: 4.907 || i/s: 203.791
Results for LD_INT->FP16: msecs: 281 || ms/i: 0.562 || i/s: 1779.36
Results for LD_INT->FP32: msecs: 313 || ms/i: 0.626 || i/s: 1597.44
Results for LD_FP16->INT: msecs: 625 || ms/i: 1.25 || i/s: 800
Results for LD_FP32->INT: msecs: 2266 || ms/i: 4.532 || i/s: 220.653
Results for PMTEncoded: msecs: 2328 || ms/i: 4.656 || i/s: 214.777
Results for PMStandard: msecs: 7735 || ms/i: 15.47 || i/s: 64.6412
Results for PMBuffered: msecs: 1703 || ms/i: 6.812 || i/s: 146.8

Testing 1024x1024 image:
Results for BufferCreateINT: msecs: 125 || ms/i: 20.8333 || i/s: 48
Results for BufferCreateINT16: msecs: 219 || ms/i: 36.5 || i/s: 27.3973
Results for BufferCreateFP16: msecs: 125 || ms/i: 20.8333 || i/s: 48
Results for BufferCreateFP32: msecs: 140 || ms/i: 23.3333 || i/s: 42.8571
Results for JustCopy: msecs: 1547 || ms/i: 1.547 || i/s: 646.412
Results for SimpleSmooth: msecs: 1813 || ms/i: 1.813 || i/s: 551.572
Results for TexNoise: msecs: 1781 || ms/i: 1.781 || i/s: 561.482
Results for 3x3Conv: msecs: 2234 || ms/i: 4.468 || i/s: 223.814
Results for TEncode: msecs: 516 || ms/i: 1.032 || i/s: 968.992
Results for TDecode: msecs: 1469 || ms/i: 2.938 || i/s: 340.368
Results for LinDiffINT: msecs: 2063 || ms/i: 2.063 || i/s: 484.731
Results for LinDiffINT16: msecs: 5344 || ms/i: 5.344 || i/s: 187.126
Results for LinDiffFP16: msecs: 5360 || ms/i: 5.36 || i/s: 186.567
Results for LinDiffFP32: msecs: 19547 || ms/i: 19.547 || i/s: 51.1587
Results for LD_INT->FP16: msecs: 1047 || ms/i: 2.094 || i/s: 477.555
Results for LD_INT->FP32: msecs: 1375 || ms/i: 2.75 || i/s: 363.636
Results for LD_FP16->INT: msecs: 2531 || ms/i: 5.062 || i/s: 197.55
Results for LD_FP32->INT: msecs: 9203 || ms/i: 18.406 || i/s: 54.3301
Results for PMTEncoded: msecs: 9141 || ms/i: 18.282 || i/s: 54.6986
Results for PMStandard: msecs: 31109 || ms/i: 62.218 || i/s: 16.0725
Results for PMBuffered: msecs: 13657 || ms/i: 54.628 || i/s: 18.3056

Finished. Press return key to close...
                Don't forget to copy the results!
 
Last edited by a moderator:
Back
Top