New GLSL / Pbuffer benchmark [Update: version 1.4 / ORCv0.4]

PeterT

Regular
I just finished a small benchmarking app based on my framework for hardware accelerated image processing. Now I'd like to get some results from other systems, especially R350 / NV4x equipped ones. So, if you can spare a few minutes (and have the required hardware), please run this program.

Requirements
  • A win2k / xp system
  • GLSL support, WGL_TYPE_RGBA_FLOAT_ATI, and the rendertexture extension
  • That means: Radeon 9500+ or GeForce 6xxx
  • Some other cards may also work (Wildcat Realizm!), so please try it if you have such a beast
A disclaimer
  • This is not an eyecandy benchmark, sorry :(
  • This is not a "my gfx is bigger than yours" benchmark, so it doesn't offer any "overall performance numbers" or similar fluff
  • This is research software, so it may crash, please tell me what it says if it does ;)
Now, the download: Get it! (New version with 6x00 fix)
(just 1.4 MB, so it won't strain your modem :D)

How to Use
  • Start the app
  • Click on the console window
  • Hit the return key
  • Wait a bit (please don't run any intensive stuff as that may affect the results)
  • Copy the results from the console into a 'code' box in a reply to this thread
  • Hit return again with the console window active to quit
  • Add your graphics card and CPU type to your reply
  • Post it
  • Profit!!

The output should look somehow like this:
(Radeon 9700NP // Athlon XP 1800+)
Code:
GL filter framework 1.2999 test application by Peter Thoman 2004-2005

Gui initialized successfully.
DevIL initialized successfully.
 - DevIL Version: 167
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.

Press return key to start benchmark...



Testing 32x32 image:
BufferCreateINT: msecs: 140 || ms/i: 23.3333 || i/s: 42.8571
BufferCreateINT16: msecs: 141 || ms/i: 23.5 || i/s: 42.5532
BufferCreateFP16: msecs: 156 || ms/i: 26 || i/s: 38.4615
BufferCreateFP32: msecs: 156 || ms/i: 26 || i/s: 38.4615
JustCopy: msecs: 797 || ms/i: 0.3985 || i/s: 2509.41
SimpleSmooth: msecs: 844 || ms/i: 0.422 || i/s: 2369.67
TexNoise: msecs: 860 || ms/i: 0.43 || i/s: 2325.58
3x3Conv: msecs: 453 || ms/i: 0.453 || i/s: 2207.51
TEncode: msecs: 406 || ms/i: 0.406 || i/s: 2463.05
TDecode: msecs: 875 || ms/i: 0.875 || i/s: 1142.86
LinDiffINT: msecs: 1015 || ms/i: 0.5075 || i/s: 1970.44
LinDiffINT16: msecs: 1016 || ms/i: 0.508 || i/s: 1968.5
LinDiffFP16: msecs: 1031 || ms/i: 0.5155 || i/s: 1939.86
LinDiffFP32: msecs: 1031 || ms/i: 0.5155 || i/s: 1939.86
PMTEncoded: msecs: 1500 || ms/i: 1.5 || i/s: 666.667
PMStandard: msecs: 1640 || ms/i: 1.64 || i/s: 609.756
PMBuffered: msecs: 172 || ms/i: 0.344 || i/s: 2906.98

Testing 64x64 image:
BufferCreateINT: msecs: 156 || ms/i: 26 || i/s: 38.4615
BufferCreateINT16: msecs: 172 || ms/i: 28.6667 || i/s: 34.8837
BufferCreateFP16: msecs: 172 || ms/i: 28.6667 || i/s: 34.8837
BufferCreateFP32: msecs: 156 || ms/i: 26 || i/s: 38.4615
JustCopy: msecs: 937 || ms/i: 0.4685 || i/s: 2134.47
SimpleSmooth: msecs: 1000 || ms/i: 0.5 || i/s: 2000
TexNoise: msecs: 1031 || ms/i: 0.5155 || i/s: 1939.86
3x3Conv: msecs: 547 || ms/i: 0.547 || i/s: 1828.15
TEncode: msecs: 438 || ms/i: 0.438 || i/s: 2283.11
TDecode: msecs: 985 || ms/i: 0.985 || i/s: 1015.23
LinDiffINT: msecs: 1031 || ms/i: 0.5155 || i/s: 1939.86
LinDiffINT16: msecs: 938 || ms/i: 0.469 || i/s: 2132.2
LinDiffFP16: msecs: 906 || ms/i: 0.453 || i/s: 2207.51
LinDiffFP32: msecs: 922 || ms/i: 0.461 || i/s: 2169.2
PMTEncoded: msecs: 1468 || ms/i: 1.468 || i/s: 681.199
PMStandard: msecs: 1468 || ms/i: 1.468 || i/s: 681.199
PMBuffered: msecs: 172 || ms/i: 0.344 || i/s: 2906.98

Testing 128x128 image:
BufferCreateINT: msecs: 156 || ms/i: 26 || i/s: 38.4615
BufferCreateINT16: msecs: 156 || ms/i: 26 || i/s: 38.4615
BufferCreateFP16: msecs: 172 || ms/i: 28.6667 || i/s: 34.8837
BufferCreateFP32: msecs: 172 || ms/i: 28.6667 || i/s: 34.8837
JustCopy: msecs: 797 || ms/i: 0.3985 || i/s: 2509.41
SimpleSmooth: msecs: 828 || ms/i: 0.414 || i/s: 2415.46
TexNoise: msecs: 875 || ms/i: 0.4375 || i/s: 2285.71
3x3Conv: msecs: 453 || ms/i: 0.453 || i/s: 2207.51
TEncode: msecs: 406 || ms/i: 0.406 || i/s: 2463.05
TDecode: msecs: 891 || ms/i: 0.891 || i/s: 1122.33
LinDiffINT: msecs: 937 || ms/i: 0.4685 || i/s: 2134.47
LinDiffINT16: msecs: 937 || ms/i: 0.4685 || i/s: 2134.47
LinDiffFP16: msecs: 1078 || ms/i: 0.539 || i/s: 1855.29
LinDiffFP32: msecs: 937 || ms/i: 0.4685 || i/s: 2134.47
PMTEncoded: msecs: 1735 || ms/i: 1.735 || i/s: 576.369
PMStandard: msecs: 1734 || ms/i: 1.734 || i/s: 576.701
PMBuffered: msecs: 266 || ms/i: 0.532 || i/s: 1879.7

Testing 256x256 image:
BufferCreateINT: msecs: 156 || ms/i: 26 || i/s: 38.4615
BufferCreateINT16: msecs: 172 || ms/i: 28.6667 || i/s: 34.8837
BufferCreateFP16: msecs: 156 || ms/i: 26 || i/s: 38.4615
BufferCreateFP32: msecs: 172 || ms/i: 28.6667 || i/s: 34.8837
JustCopy: msecs: 844 || ms/i: 0.422 || i/s: 2369.67
SimpleSmooth: msecs: 859 || ms/i: 0.4295 || i/s: 2328.29
TexNoise: msecs: 891 || ms/i: 0.4455 || i/s: 2244.67
3x3Conv: msecs: 1266 || ms/i: 1.266 || i/s: 789.889
TEncode: msecs: 422 || ms/i: 0.422 || i/s: 2369.67
TDecode: msecs: 906 || ms/i: 0.906 || i/s: 1103.75
LinDiffINT: msecs: 1093 || ms/i: 0.5465 || i/s: 1829.83
LinDiffINT16: msecs: 1109 || ms/i: 0.5545 || i/s: 1803.43
LinDiffFP16: msecs: 1109 || ms/i: 0.5545 || i/s: 1803.43
LinDiffFP32: msecs: 1218 || ms/i: 0.609 || i/s: 1642.04
PMTEncoded: msecs: 1516 || ms/i: 1.516 || i/s: 659.631
PMStandard: msecs: 2000 || ms/i: 2 || i/s: 500
PMBuffered: msecs: 953 || ms/i: 1.906 || i/s: 524.659




Testing 512x512 image:
BufferCreateINT: msecs: 172 || ms/i: 28.6667 || i/s: 34.8837
BufferCreateINT16: msecs: 156 || ms/i: 26 || i/s: 38.4615
BufferCreateFP16: msecs: 172 || ms/i: 28.6667 || i/s: 34.8837
BufferCreateFP32: msecs: 203 || ms/i: 33.8333 || i/s: 29.5567
JustCopy: msecs: 484 || ms/i: 0.484 || i/s: 2066.12
SimpleSmooth: msecs: 1219 || ms/i: 1.219 || i/s: 820.345
TexNoise: msecs: 984 || ms/i: 0.984 || i/s: 1016.26
3x3Conv: msecs: 1766 || ms/i: 3.532 || i/s: 283.126
TEncode: msecs: 219 || ms/i: 0.438 || i/s: 2283.11
TDecode: msecs: 1141 || ms/i: 2.282 || i/s: 438.212
LinDiffINT: msecs: 1781 || ms/i: 1.781 || i/s: 561.482
LinDiffINT16: msecs: 1531 || ms/i: 1.531 || i/s: 653.168
LinDiffFP16: msecs: 1484 || ms/i: 1.484 || i/s: 673.854
LinDiffFP32: msecs: 2516 || ms/i: 2.516 || i/s: 397.456
PMTEncoded: msecs: 1484 || ms/i: 2.968 || i/s: 336.927
PMStandard: msecs: 4031 || ms/i: 8.062 || i/s: 124.039
PMBuffered: msecs: 750 || ms/i: 3 || i/s: 333.333

Testing 1024x1024 image:
BufferCreateINT: msecs: 172 || ms/i: 28.6667 || i/s: 34.8837
BufferCreateINT16: msecs: 172 || ms/i: 28.6667 || i/s: 34.8837
BufferCreateFP16: msecs: 156 || ms/i: 26 || i/s: 38.4615
BufferCreateFP32: msecs: 172 || ms/i: 28.6667 || i/s: 34.8837
JustCopy: msecs: 2438 || ms/i: 2.438 || i/s: 410.172
SimpleSmooth: msecs: 4921 || ms/i: 4.921 || i/s: 203.211
TexNoise: msecs: 2938 || ms/i: 2.938 || i/s: 340.368
3x3Conv: msecs: 5297 || ms/i: 10.594 || i/s: 94.3931
TEncode: msecs: 469 || ms/i: 0.938 || i/s: 1066.1
TDecode: msecs: 3250 || ms/i: 6.5 || i/s: 153.846
LinDiffINT: msecs: 8063 || ms/i: 8.063 || i/s: 124.023
LinDiffINT16: msecs: 6032 || ms/i: 6.032 || i/s: 165.782
LinDiffFP16: msecs: 6016 || ms/i: 6.016 || i/s: 166.223
LinDiffFP32: msecs: 8578 || ms/i: 8.578 || i/s: 116.577
PMTEncoded: msecs: 5875 || ms/i: 11.75 || i/s: 85.1064
PMStandard: msecs: 16235 || ms/i: 32.47 || i/s: 30.7977
PMBuffered: msecs: 10610 || ms/i: 42.44 || i/s: 23.5627

Thanks for your time!

[edit]
How to copy:
Right click on the title bar of the console window -> Edit -> Select all
Then Edit -> Copy

[edit2]
What's good?
msecs and ms/i: lower is better
i/s: higher is better

[edit3]
WTF is this all about?
Erm, you could read http://infmath.uibk.ac.at/teaching/...;table_id=tasks&men_task=fin_bak&sem= ;)
(direct pdf link: http://landesjugendtheater.at/misc/bakk1.pdf )



[-----------------------------------------------------------------------]



[edit4]
[copied from my post on page 4]
ORC (Offline Result Comparator) 0.1 is now available:
[editX] No, 0.4:
http://landesjugendtheater.at/misc/orc04_release.zip
(109kB - Now even more 56k friendly!)

Requires the .Net 2.0 beta framework.

How to Use:
  • Click "Load" to load results from the db file.
  • Right click on results in list to view details / edit / remove
  • Select (a) result(s), (a) test(s), and resolutions that interest you
  • Click on chart Window to draw a new chart
That's it. One more shot:
orc02.png


Have fun! Currently the db contains only results from the first page, if you add more please save & send me the results.db file.
 
W2k, P4 2.4Ghz, R9800Pro CAT4.12

Code:
GL filter framework 1.2999 test application by Peter Thoman 2004-2005

Gui initialized successfully.
DevIL initialized successfully.
 - DevIL Version: 167
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.

Press return key to start benchmark...



Testing 32x32 image:
BufferCreateINT: msecs: 711 || ms/i: 118.5 || i/s: 8.43882
BufferCreateINT16: msecs: 681 || ms/i: 113.5 || i/s: 8.81057
BufferCreateFP16: msecs: 681 || ms/i: 113.5 || i/s: 8.81057
BufferCreateFP32: msecs: 691 || ms/i: 115.167 || i/s: 8.68307
JustCopy: msecs: 581 || ms/i: 0.2905 || i/s: 3442.34
SimpleSmooth: msecs: 591 || ms/i: 0.2955 || i/s: 3384.09
TexNoise: msecs: 691 || ms/i: 0.3455 || i/s: 2894.36
3x3Conv: msecs: 360 || ms/i: 0.36 || i/s: 2777.78
TEncode: msecs: 281 || ms/i: 0.281 || i/s: 3558.72
TDecode: msecs: 601 || ms/i: 0.601 || i/s: 1663.89
LinDiffINT: msecs: 611 || ms/i: 0.3055 || i/s: 3273.32
LinDiffINT16: msecs: 601 || ms/i: 0.3005 || i/s: 3327.79
LinDiffFP16: msecs: 631 || ms/i: 0.3155 || i/s: 3169.57
LinDiffFP32: msecs: 621 || ms/i: 0.3105 || i/s: 3220.61
PMTEncoded: msecs: 1012 || ms/i: 1.012 || i/s: 988.142
PMStandard: msecs: 971 || ms/i: 0.971 || i/s: 1029.87
PMBuffered: msecs: 110 || ms/i: 0.22 || i/s: 4545.45

Testing 64x64 image:
BufferCreateINT: msecs: 721 || ms/i: 120.167 || i/s: 8.32178
BufferCreateINT16: msecs: 721 || ms/i: 120.167 || i/s: 8.32178
BufferCreateFP16: msecs: 721 || ms/i: 120.167 || i/s: 8.32178
BufferCreateFP32: msecs: 721 || ms/i: 120.167 || i/s: 8.32178
JustCopy: msecs: 581 || ms/i: 0.2905 || i/s: 3442.34
SimpleSmooth: msecs: 601 || ms/i: 0.3005 || i/s: 3327.79
TexNoise: msecs: 671 || ms/i: 0.3355 || i/s: 2980.63
3x3Conv: msecs: 340 || ms/i: 0.34 || i/s: 2941.18
TEncode: msecs: 291 || ms/i: 0.291 || i/s: 3436.43
TDecode: msecs: 591 || ms/i: 0.591 || i/s: 1692.05
LinDiffINT: msecs: 611 || ms/i: 0.3055 || i/s: 3273.32
LinDiffINT16: msecs: 611 || ms/i: 0.3055 || i/s: 3273.32
LinDiffFP16: msecs: 631 || ms/i: 0.3155 || i/s: 3169.57
LinDiffFP32: msecs: 641 || ms/i: 0.3205 || i/s: 3120.12
PMTEncoded: msecs: 992 || ms/i: 0.992 || i/s: 1008.06
PMStandard: msecs: 992 || ms/i: 0.992 || i/s: 1008.06
PMBuffered: msecs: 100 || ms/i: 0.2 || i/s: 5000

Testing 128x128 image:
BufferCreateINT: msecs: 731 || ms/i: 121.833 || i/s: 8.20793
BufferCreateINT16: msecs: 721 || ms/i: 120.167 || i/s: 8.32178
BufferCreateFP16: msecs: 721 || ms/i: 120.167 || i/s: 8.32178
BufferCreateFP32: msecs: 721 || ms/i: 120.167 || i/s: 8.32178
JustCopy: msecs: 581 || ms/i: 0.2905 || i/s: 3442.34
SimpleSmooth: msecs: 591 || ms/i: 0.2955 || i/s: 3384.09
TexNoise: msecs: 661 || ms/i: 0.3305 || i/s: 3025.72
3x3Conv: msecs: 350 || ms/i: 0.35 || i/s: 2857.14
TEncode: msecs: 301 || ms/i: 0.301 || i/s: 3322.26
TDecode: msecs: 601 || ms/i: 0.601 || i/s: 1663.89
LinDiffINT: msecs: 641 || ms/i: 0.3205 || i/s: 3120.12
LinDiffINT16: msecs: 651 || ms/i: 0.3255 || i/s: 3072.2
LinDiffFP16: msecs: 651 || ms/i: 0.3255 || i/s: 3072.2
LinDiffFP32: msecs: 661 || ms/i: 0.3305 || i/s: 3025.72
PMTEncoded: msecs: 1021 || ms/i: 1.021 || i/s: 979.432
PMStandard: msecs: 1012 || ms/i: 1.012 || i/s: 988.142
PMBuffered: msecs: 270 || ms/i: 0.54 || i/s: 1851.85

Testing 256x256 image:
BufferCreateINT: msecs: 721 || ms/i: 120.167 || i/s: 8.32178
BufferCreateINT16: msecs: 721 || ms/i: 120.167 || i/s: 8.32178
BufferCreateFP16: msecs: 721 || ms/i: 120.167 || i/s: 8.32178
BufferCreateFP32: msecs: 721 || ms/i: 120.167 || i/s: 8.32178
JustCopy: msecs: 601 || ms/i: 0.3005 || i/s: 3327.79
SimpleSmooth: msecs: 621 || ms/i: 0.3105 || i/s: 3220.61
TexNoise: msecs: 691 || ms/i: 0.3455 || i/s: 2894.36
3x3Conv: msecs: 1041 || ms/i: 1.041 || i/s: 960.615
TEncode: msecs: 301 || ms/i: 0.301 || i/s: 3322.26
TDecode: msecs: 600 || ms/i: 0.6 || i/s: 1666.67
LinDiffINT: msecs: 781 || ms/i: 0.3905 || i/s: 2560.82
LinDiffINT16: msecs: 781 || ms/i: 0.3905 || i/s: 2560.82
LinDiffFP16: msecs: 782 || ms/i: 0.391 || i/s: 2557.54
LinDiffFP32: msecs: 921 || ms/i: 0.4605 || i/s: 2171.55
PMTEncoded: msecs: 1032 || ms/i: 1.032 || i/s: 968.992
PMStandard: msecs: 1612 || ms/i: 1.612 || i/s: 620.347
PMBuffered: msecs: 1021 || ms/i: 2.042 || i/s: 489.716

Testing 512x512 image:
BufferCreateINT: msecs: 731 || ms/i: 121.833 || i/s: 8.20793
BufferCreateINT16: msecs: 721 || ms/i: 120.167 || i/s: 8.32178
BufferCreateFP16: msecs: 722 || ms/i: 120.333 || i/s: 8.31025
BufferCreateFP32: msecs: 721 || ms/i: 120.167 || i/s: 8.32178
JustCopy: msecs: 450 || ms/i: 0.45 || i/s: 2222.22
SimpleSmooth: msecs: 1062 || ms/i: 1.062 || i/s: 941.62
TexNoise: msecs: 480 || ms/i: 0.48 || i/s: 2083.33
3x3Conv: msecs: 1993 || ms/i: 3.986 || i/s: 250.878
TEncode: msecs: 151 || ms/i: 0.302 || i/s: 3311.26
TDecode: msecs: 901 || ms/i: 1.802 || i/s: 554.939
LinDiffINT: msecs: 1482 || ms/i: 1.482 || i/s: 674.764
LinDiffINT16: msecs: 1482 || ms/i: 1.482 || i/s: 674.764
LinDiffFP16: msecs: 1482 || ms/i: 1.482 || i/s: 674.764
LinDiffFP32: msecs: 1753 || ms/i: 1.753 || i/s: 570.451
PMTEncoded: msecs: 1232 || ms/i: 2.464 || i/s: 405.844
PMStandard: msecs: 3706 || ms/i: 7.412 || i/s: 134.916
PMBuffered: msecs: 1722 || ms/i: 6.888 || i/s: 145.18

Testing 1024x1024 image:
BufferCreateINT: msecs: 721 || ms/i: 120.167 || i/s: 8.32178
BufferCreateINT16: msecs: 721 || ms/i: 120.167 || i/s: 8.32178
BufferCreateFP16: msecs: 721 || ms/i: 120.167 || i/s: 8.32178
BufferCreateFP32: msecs: 721 || ms/i: 120.167 || i/s: 8.32178
JustCopy: msecs: 1713 || ms/i: 1.713 || i/s: 583.771
SimpleSmooth: msecs: 4146 || ms/i: 4.146 || i/s: 241.196
TexNoise: msecs: 1802 || ms/i: 1.802 || i/s: 554.939
3x3Conv: msecs: 7912 || ms/i: 15.824 || i/s: 63.1951
TEncode: msecs: 431 || ms/i: 0.862 || i/s: 1160.09
TDecode: msecs: 3545 || ms/i: 7.09 || i/s: 141.044
LinDiffINT: msecs: 5829 || ms/i: 5.829 || i/s: 171.556
LinDiffINT16: msecs: 5838 || ms/i: 5.838 || i/s: 171.292
LinDiffFP16: msecs: 5829 || ms/i: 5.829 || i/s: 171.556
LinDiffFP32: msecs: 6940 || ms/i: 6.94 || i/s: 144.092
PMTEncoded: msecs: 4717 || ms/i: 9.434 || i/s: 106
PMStandard: msecs: 12008 || ms/i: 24.016 || i/s: 41.6389
PMBuffered: msecs: 8532 || ms/i: 34.128 || i/s: 29.3015

Finished. Press return key to close...
                Don't forget to copy the results!
 
GLRenderTexture: Could not find an acceptable pixel format.

:cry:

This on a BFG 6800GT OC. Some prereq in software I don't have installed?

Oh, WinXP Home and Forceware 71.84
 
Most probably my fault, not yours.
When does it bug out? Thanks for your help!

Thanks for the results Tweaker.
 
x800pro(cat5.3) /amd64 3400+(2.4GHZ)

Code:
GL filter framework 1.2999 test application by Peter Thoman 2004-2005

Gui initialized successfully.
DevIL initialized successfully.
 - DevIL Version: 167
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.

Press return key to start benchmark...



Testing 32x32 image:
BufferCreateINT: msecs: 62 || ms/i: 10.3333 || i/s: 96.7742
BufferCreateINT16: msecs: 47 || ms/i: 7.83333 || i/s: 127.66
BufferCreateFP16: msecs: 63 || ms/i: 10.5 || i/s: 95.2381
BufferCreateFP32: msecs: 47 || ms/i: 7.83333 || i/s: 127.66
JustCopy: msecs: 297 || ms/i: 0.1485 || i/s: 6734.01
SimpleSmooth: msecs: 297 || ms/i: 0.1485 || i/s: 6734.01
TexNoise: msecs: 312 || ms/i: 0.156 || i/s: 6410.26
3x3Conv: msecs: 172 || ms/i: 0.172 || i/s: 5813.95
TEncode: msecs: 140 || ms/i: 0.14 || i/s: 7142.86
TDecode: msecs: 235 || ms/i: 0.235 || i/s: 4255.32
LinDiffINT: msecs: 344 || ms/i: 0.172 || i/s: 5813.95
LinDiffINT16: msecs: 360 || ms/i: 0.18 || i/s: 5555.56
LinDiffFP16: msecs: 313 || ms/i: 0.1565 || i/s: 6389.78
LinDiffFP32: msecs: 344 || ms/i: 0.172 || i/s: 5813.95
PMTEncoded: msecs: 547 || ms/i: 0.547 || i/s: 1828.15
PMStandard: msecs: 563 || ms/i: 0.563 || i/s: 1776.2
PMBuffered: msecs: 78 || ms/i: 0.156 || i/s: 6410.26

Testing 64x64 image:
BufferCreateINT: msecs: 63 || ms/i: 10.5 || i/s: 95.2381
BufferCreateINT16: msecs: 62 || ms/i: 10.3333 || i/s: 96.7742
BufferCreateFP16: msecs: 63 || ms/i: 10.5 || i/s: 95.2381
BufferCreateFP32: msecs: 62 || ms/i: 10.3333 || i/s: 96.7742
JustCopy: msecs: 344 || ms/i: 0.172 || i/s: 5813.95
SimpleSmooth: msecs: 343 || ms/i: 0.1715 || i/s: 5830.9
TexNoise: msecs: 360 || ms/i: 0.18 || i/s: 5555.56
3x3Conv: msecs: 187 || ms/i: 0.187 || i/s: 5347.59
TEncode: msecs: 156 || ms/i: 0.156 || i/s: 6410.26
TDecode: msecs: 250 || ms/i: 0.25 || i/s: 4000
LinDiffINT: msecs: 360 || ms/i: 0.18 || i/s: 5555.56
LinDiffINT16: msecs: 360 || ms/i: 0.18 || i/s: 5555.56
LinDiffFP16: msecs: 359 || ms/i: 0.1795 || i/s: 5571.03
LinDiffFP32: msecs: 359 || ms/i: 0.1795 || i/s: 5571.03
PMTEncoded: msecs: 484 || ms/i: 0.484 || i/s: 2066.12
PMStandard: msecs: 485 || ms/i: 0.485 || i/s: 2061.86
PMBuffered: msecs: 78 || ms/i: 0.156 || i/s: 6410.26

Testing 128x128 image:
BufferCreateINT: msecs: 62 || ms/i: 10.3333 || i/s: 96.7742
BufferCreateINT16: msecs: 63 || ms/i: 10.5 || i/s: 95.2381
BufferCreateFP16: msecs: 62 || ms/i: 10.3333 || i/s: 96.7742
BufferCreateFP32: msecs: 63 || ms/i: 10.5 || i/s: 95.2381
JustCopy: msecs: 296 || ms/i: 0.148 || i/s: 6756.76
SimpleSmooth: msecs: 297 || ms/i: 0.1485 || i/s: 6734.01
TexNoise: msecs: 313 || ms/i: 0.1565 || i/s: 6389.78
3x3Conv: msecs: 156 || ms/i: 0.156 || i/s: 6410.26
TEncode: msecs: 156 || ms/i: 0.156 || i/s: 6410.26
TDecode: msecs: 203 || ms/i: 0.203 || i/s: 4926.11
LinDiffINT: msecs: 313 || ms/i: 0.1565 || i/s: 6389.78
LinDiffINT16: msecs: 313 || ms/i: 0.1565 || i/s: 6389.78
LinDiffFP16: msecs: 359 || ms/i: 0.1795 || i/s: 5571.03
LinDiffFP32: msecs: 375 || ms/i: 0.1875 || i/s: 5333.33
PMTEncoded: msecs: 578 || ms/i: 0.578 || i/s: 1730.1
PMStandard: msecs: 578 || ms/i: 0.578 || i/s: 1730.1
PMBuffered: msecs: 125 || ms/i: 0.25 || i/s: 4000

Testing 256x256 image:
BufferCreateINT: msecs: 47 || ms/i: 7.83333 || i/s: 127.66
BufferCreateINT16: msecs: 63 || ms/i: 10.5 || i/s: 95.2381
BufferCreateFP16: msecs: 62 || ms/i: 10.3333 || i/s: 96.7742
BufferCreateFP32: msecs: 63 || ms/i: 10.5 || i/s: 95.2381
JustCopy: msecs: 297 || ms/i: 0.1485 || i/s: 6734.01
SimpleSmooth: msecs: 297 || ms/i: 0.1485 || i/s: 6734.01
TexNoise: msecs: 312 || ms/i: 0.156 || i/s: 6410.26
3x3Conv: msecs: 438 || ms/i: 0.438 || i/s: 2283.11
TEncode: msecs: 156 || ms/i: 0.156 || i/s: 6410.26
TDecode: msecs: 219 || ms/i: 0.219 || i/s: 4566.21
LinDiffINT: msecs: 375 || ms/i: 0.1875 || i/s: 5333.33
LinDiffINT16: msecs: 391 || ms/i: 0.1955 || i/s: 5115.09
LinDiffFP16: msecs: 391 || ms/i: 0.1955 || i/s: 5115.09
LinDiffFP32: msecs: 468 || ms/i: 0.234 || i/s: 4273.5
PMTEncoded: msecs: 485 || ms/i: 0.485 || i/s: 2061.86
PMStandard: msecs: 812 || ms/i: 0.812 || i/s: 1231.53
PMBuffered: msecs: 407 || ms/i: 0.814 || i/s: 1228.5

Testing 512x512 image:
BufferCreateINT: msecs: 62 || ms/i: 10.3333 || i/s: 96.7742
BufferCreateINT16: msecs: 63 || ms/i: 10.5 || i/s: 95.2381
BufferCreateFP16: msecs: 62 || ms/i: 10.3333 || i/s: 96.7742
BufferCreateFP32: msecs: 63 || ms/i: 10.5 || i/s: 95.2381
JustCopy: msecs: 250 || ms/i: 0.25 || i/s: 4000
SimpleSmooth: msecs: 515 || ms/i: 0.515 || i/s: 1941.75
TexNoise: msecs: 500 || ms/i: 0.5 || i/s: 2000
3x3Conv: msecs: 688 || ms/i: 1.376 || i/s: 726.744
TEncode: msecs: 78 || ms/i: 0.156 || i/s: 6410.26
TDecode: msecs: 203 || ms/i: 0.406 || i/s: 2463.05
LinDiffINT: msecs: 594 || ms/i: 0.594 || i/s: 1683.5
LinDiffINT16: msecs: 594 || ms/i: 0.594 || i/s: 1683.5
LinDiffFP16: msecs: 578 || ms/i: 0.578 || i/s: 1730.1
LinDiffFP32: msecs: 718 || ms/i: 0.718 || i/s: 1392.76
PMTEncoded: msecs: 563 || ms/i: 1.126 || i/s: 888.099
PMStandard: msecs: 1328 || ms/i: 2.656 || i/s: 376.506
PMBuffered: msecs: 328 || ms/i: 1.312 || i/s: 762.195

Testing 1024x1024 image:
BufferCreateINT: msecs: 46 || ms/i: 7.66667 || i/s: 130.435
BufferCreateINT16: msecs: 63 || ms/i: 10.5 || i/s: 95.2381
BufferCreateFP16: msecs: 62 || ms/i: 10.3333 || i/s: 96.7742
BufferCreateFP32: msecs: 79 || ms/i: 13.1667 || i/s: 75.9494
JustCopy: msecs: 1000 || ms/i: 1 || i/s: 1000
SimpleSmooth: msecs: 2000 || ms/i: 2 || i/s: 500
TexNoise: msecs: 1656 || ms/i: 1.656 || i/s: 603.865
3x3Conv: msecs: 2656 || ms/i: 5.312 || i/s: 188.253
TEncode: msecs: 110 || ms/i: 0.22 || i/s: 4545.45
TDecode: msecs: 922 || ms/i: 1.844 || i/s: 542.299
LinDiffINT: msecs: 2281 || ms/i: 2.281 || i/s: 438.404
LinDiffINT16: msecs: 2282 || ms/i: 2.282 || i/s: 438.212
LinDiffFP16: msecs: 2282 || ms/i: 2.282 || i/s: 438.212
LinDiffFP32: msecs: 3031 || ms/i: 3.031 || i/s: 329.924
PMTEncoded: msecs: 2094 || ms/i: 4.188 || i/s: 238.777
PMStandard: msecs: 5547 || ms/i: 11.094 || i/s: 90.1388
PMBuffered: msecs: 2422 || ms/i: 9.688 || i/s: 103.22

Finished. Press return key to close...
                Don't forget to copy the results!
 
PeterT said:
Most probably my fault, not yours.
When does it bug out? Thanks for your help!

Thanks for the results Tweaker.

Well, it comes up with a dos box and then that gets overlaid by the outlines of a window that says "Output" in the frame, but the window isn't really quite there if you know what I mean. I click on the Dos box and it says at the bottom "Press return key to start benchmark". I do so and it errors immediately on "Testing 32x32 image", second line "BufferCreateInt: msecs 171 msi 28.5 i/s 35.0877

Then the error line I said previously and below that "press return to quit"
 
Directly after BufferCreateInt? Ok, I'll look into it.

Damn that x800 is fast. Thanks tEd!

[edit]
I just had another look at the x800 results and it's very interesting that PMBuffered is actually faster than the standard version even at 1024x1024!
I'd never expected that. Your CPU is too slow ;)! Or rather, context switches REALLY suck on ultra-high-end gfx systems.
 
Just confirming what Geo said. Same problem on WinXP Pro, ForceWare 71.84, and Geforce 6800 Ultra (so, very similar setup unfortunately, maybe other ForceWare revisions work better). Think it's safe to assume that this is a general Geforce/ForceWare problem.
 
XP 2600+
R9800P

Code:
GL filter framework 1.2999 test application by Peter Thoman 2004-2005

Gui initialized successfully.
DevIL initialized successfully.
 - DevIL Version: 167
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.

Press return key to start benchmark...



Testing 32x32 image:
BufferCreateINT: msecs: 766 || ms/i: 127.667 || i/s: 7.8329
BufferCreateINT16: msecs: 765 || ms/i: 127.5 || i/s: 7.84314
BufferCreateFP16: msecs: 750 || ms/i: 125 || i/s: 8
BufferCreateFP32: msecs: 781 || ms/i: 130.167 || i/s: 7.68246
JustCopy: msecs: 484 || ms/i: 0.242 || i/s: 4132.23
SimpleSmooth: msecs: 500 || ms/i: 0.25 || i/s: 4000
TexNoise: msecs: 578 || ms/i: 0.289 || i/s: 3460.21
3x3Conv: msecs: 281 || ms/i: 0.281 || i/s: 3558.72
TEncode: msecs: 250 || ms/i: 0.25 || i/s: 4000
TDecode: msecs: 375 || ms/i: 0.375 || i/s: 2666.67
LinDiffINT: msecs: 516 || ms/i: 0.258 || i/s: 3875.97
LinDiffINT16: msecs: 578 || ms/i: 0.289 || i/s: 3460.21
LinDiffFP16: msecs: 563 || ms/i: 0.2815 || i/s: 3552.4
LinDiffFP32: msecs: 578 || ms/i: 0.289 || i/s: 3460.21
PMTEncoded: msecs: 922 || ms/i: 0.922 || i/s: 1084.6
PMStandard: msecs: 812 || ms/i: 0.812 || i/s: 1231.53
PMBuffered: msecs: 125 || ms/i: 0.25 || i/s: 4000

Testing 64x64 image:
BufferCreateINT: msecs: 750 || ms/i: 125 || i/s: 8
BufferCreateINT16: msecs: 750 || ms/i: 125 || i/s: 8
BufferCreateFP16: msecs: 750 || ms/i: 125 || i/s: 8
BufferCreateFP32: msecs: 750 || ms/i: 125 || i/s: 8
JustCopy: msecs: 532 || ms/i: 0.266 || i/s: 3759.4
SimpleSmooth: msecs: 531 || ms/i: 0.2655 || i/s: 3766.48
TexNoise: msecs: 594 || ms/i: 0.297 || i/s: 3367
3x3Conv: msecs: 296 || ms/i: 0.296 || i/s: 3378.38
TEncode: msecs: 265 || ms/i: 0.265 || i/s: 3773.58
TDecode: msecs: 422 || ms/i: 0.422 || i/s: 2369.67
LinDiffINT: msecs: 562 || ms/i: 0.281 || i/s: 3558.72
LinDiffINT16: msecs: 562 || ms/i: 0.281 || i/s: 3558.72
LinDiffFP16: msecs: 531 || ms/i: 0.2655 || i/s: 3766.48
LinDiffFP32: msecs: 531 || ms/i: 0.2655 || i/s: 3766.48
PMTEncoded: msecs: 938 || ms/i: 0.938 || i/s: 1066.1
PMStandard: msecs: 906 || ms/i: 0.906 || i/s: 1103.75
PMBuffered: msecs: 140 || ms/i: 0.28 || i/s: 3571.43

Testing 128x128 image:
BufferCreateINT: msecs: 750 || ms/i: 125 || i/s: 8
BufferCreateINT16: msecs: 750 || ms/i: 125 || i/s: 8
BufferCreateFP16: msecs: 750 || ms/i: 125 || i/s: 8
BufferCreateFP32: msecs: 750 || ms/i: 125 || i/s: 8
JustCopy: msecs: 531 || ms/i: 0.2655 || i/s: 3766.48
SimpleSmooth: msecs: 547 || ms/i: 0.2735 || i/s: 3656.31
TexNoise: msecs: 609 || ms/i: 0.3045 || i/s: 3284.07
3x3Conv: msecs: 313 || ms/i: 0.313 || i/s: 3194.89
TEncode: msecs: 266 || ms/i: 0.266 || i/s: 3759.4
TDecode: msecs: 437 || ms/i: 0.437 || i/s: 2288.33
LinDiffINT: msecs: 562 || ms/i: 0.281 || i/s: 3558.72
LinDiffINT16: msecs: 531 || ms/i: 0.2655 || i/s: 3766.48
LinDiffFP16: msecs: 563 || ms/i: 0.2815 || i/s: 3552.4
LinDiffFP32: msecs: 547 || ms/i: 0.2735 || i/s: 3656.31
PMTEncoded: msecs: 859 || ms/i: 0.859 || i/s: 1164.14
PMStandard: msecs: 828 || ms/i: 0.828 || i/s: 1207.73
PMBuffered: msecs: 281 || ms/i: 0.562 || i/s: 1779.36

Testing 256x256 image:
BufferCreateINT: msecs: 750 || ms/i: 125 || i/s: 8
BufferCreateINT16: msecs: 750 || ms/i: 125 || i/s: 8
BufferCreateFP16: msecs: 750 || ms/i: 125 || i/s: 8
BufferCreateFP32: msecs: 750 || ms/i: 125 || i/s: 8
JustCopy: msecs: 500 || ms/i: 0.25 || i/s: 4000
SimpleSmooth: msecs: 562 || ms/i: 0.281 || i/s: 3558.72
TexNoise: msecs: 547 || ms/i: 0.2735 || i/s: 3656.31
3x3Conv: msecs: 1031 || ms/i: 1.031 || i/s: 969.932
TEncode: msecs: 250 || ms/i: 0.25 || i/s: 4000
TDecode: msecs: 485 || ms/i: 0.485 || i/s: 2061.86
LinDiffINT: msecs: 797 || ms/i: 0.3985 || i/s: 2509.41
LinDiffINT16: msecs: 797 || ms/i: 0.3985 || i/s: 2509.41
LinDiffFP16: msecs: 797 || ms/i: 0.3985 || i/s: 2509.41
LinDiffFP32: msecs: 937 || ms/i: 0.4685 || i/s: 2134.47
PMTEncoded: msecs: 875 || ms/i: 0.875 || i/s: 1142.86
PMStandard: msecs: 1515 || ms/i: 1.515 || i/s: 660.066
PMBuffered: msecs: 1031 || ms/i: 2.062 || i/s: 484.966

Testing 512x512 image:
BufferCreateINT: msecs: 750 || ms/i: 125 || i/s: 8
BufferCreateINT16: msecs: 750 || ms/i: 125 || i/s: 8
BufferCreateFP16: msecs: 750 || ms/i: 125 || i/s: 8
BufferCreateFP32: msecs: 750 || ms/i: 125 || i/s: 8
JustCopy: msecs: 453 || ms/i: 0.453 || i/s: 2207.51
SimpleSmooth: msecs: 1094 || ms/i: 1.094 || i/s: 914.077
TexNoise: msecs: 656 || ms/i: 0.656 || i/s: 1524.39
3x3Conv: msecs: 2000 || ms/i: 4 || i/s: 250
TEncode: msecs: 125 || ms/i: 0.25 || i/s: 4000
TDecode: msecs: 906 || ms/i: 1.812 || i/s: 551.876
LinDiffINT: msecs: 1484 || ms/i: 1.484 || i/s: 673.854
LinDiffINT16: msecs: 1484 || ms/i: 1.484 || i/s: 673.854
LinDiffFP16: msecs: 1515 || ms/i: 1.515 || i/s: 660.066
LinDiffFP32: msecs: 1765 || ms/i: 1.765 || i/s: 566.572
PMTEncoded: msecs: 1219 || ms/i: 2.438 || i/s: 410.172
PMStandard: msecs: 2843 || ms/i: 5.686 || i/s: 175.871
PMBuffered: msecs: 1985 || ms/i: 7.94 || i/s: 125.945

Testing 1024x1024 image:
BufferCreateINT: msecs: 750 || ms/i: 125 || i/s: 8
BufferCreateINT16: msecs: 750 || ms/i: 125 || i/s: 8
BufferCreateFP16: msecs: 750 || ms/i: 125 || i/s: 8
BufferCreateFP32: msecs: 750 || ms/i: 125 || i/s: 8
JustCopy: msecs: 1735 || ms/i: 1.735 || i/s: 576.369
SimpleSmooth: msecs: 4172 || ms/i: 4.172 || i/s: 239.693
TexNoise: msecs: 1859 || ms/i: 1.859 || i/s: 537.924
3x3Conv: msecs: 7937 || ms/i: 15.874 || i/s: 62.9961
TEncode: msecs: 437 || ms/i: 0.874 || i/s: 1144.16
TDecode: msecs: 3531 || ms/i: 7.062 || i/s: 141.603
LinDiffINT: msecs: 5859 || ms/i: 5.859 || i/s: 170.678
LinDiffINT16: msecs: 5829 || ms/i: 5.829 || i/s: 171.556
LinDiffFP16: msecs: 5828 || ms/i: 5.828 || i/s: 171.585
LinDiffFP32: msecs: 6985 || ms/i: 6.985 || i/s: 143.164
PMTEncoded: msecs: 4703 || ms/i: 9.406 || i/s: 106.315
PMStandard: msecs: 11453 || ms/i: 22.906 || i/s: 43.6567
PMBuffered: msecs: 10438 || ms/i: 41.752 || i/s: 23.9509

Finished. Press return key to close...
                Don't forget to copy the results!
 
wireframe said:
Just confirming what Geo said. Same problem on WinXP Pro, ForceWare 71.84, and Geforce 6800 Ultra (so, very similar setup unfortunately, maybe other ForceWare revisions work better). Think it's safe to assume that this is a general Geforce/ForceWare problem.

Thanks for the confirmation, I already expected this to be a problem on all NV40 boards. Could you try the modified .exe I posted above? If that doesn't fix the problem I'll really have to dive into it some more. *yawn*
 
There's something weird about this test on my system.

First of all, it won't allow me to copy the results when it's finished, the output part of the program stops responding and it doesn't allow me to highlight anything in the command window to copy it.

But the main problem is, I seem to be getting worse results than the 9700 pro and the x800 pro, which shouldn't be happening, since I'm running an X850XT with an A64 3500+
 
Ok, it is not hanging, it is just acting "weird". The last test takes ages and in exploring I guaged it with the internal thermometer of the GPU. All the test pound on the GPU and you can see the temp increase as well as fall briefly between the test sets. The same is true for the last test except the GPU is showing temperatures that suggest it is near idling. Not very scientific, I know, but it definitely seems that the hardware is not being exploited in the last test. I think the results back that up.

Results:

Geforce 6800 Ultra AGP x8 (425/1200), ForceWare 71.84, Athlon 64 3500+, Windows XP Pro SP2

Code:
GL filter framework 1.2999 test application by Peter Thoman 2004-2005

Gui initialized successfully.
DevIL initialized successfully.
 - DevIL Version: 167
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.

Press return key to start benchmark...



Testing 32x32 image:
BufferCreateINT: msecs: 78 || ms/i: 13 || i/s: 76.9231No suitable INT format fou
nd. Trying FP... (Flaky 6600 workaround)

BufferCreateINT16: msecs: 94 || ms/i: 15.6667 || i/s: 63.8298
BufferCreateFP16: msecs: 62 || ms/i: 10.3333 || i/s: 96.7742
BufferCreateFP32: msecs: 63 || ms/i: 10.5 || i/s: 95.2381
JustCopy: msecs: 156 || ms/i: 0.078 || i/s: 12820.5
SimpleSmooth: msecs: 204 || ms/i: 0.102 || i/s: 9803.92
TexNoise: msecs: 187 || ms/i: 0.0935 || i/s: 10695.2
3x3Conv: msecs: 203 || ms/i: 0.203 || i/s: 4926.11
TEncode: msecs: 125 || ms/i: 0.125 || i/s: 8000
TDecode: msecs: 125 || ms/i: 0.125 || i/s: 8000
LinDiffINT: msecs: 172 || ms/i: 0.086 || i/s: 11627.9
LinDiffINT16: msecs: 156 || ms/i: 0.078 || i/s: 12820.5
LinDiffFP16: msecs: 172 || ms/i: 0.086 || i/s: 11627.9
LinDiffFP32: msecs: 156 || ms/i: 0.078 || i/s: 12820.5
PMTEncoded: msecs: 343 || ms/i: 0.343 || i/s: 2915.45
PMStandard: msecs: 265 || ms/i: 0.265 || i/s: 3773.58
PMBuffered: msecs: 15 || ms/i: 0.03 || i/s: 33333.3

Testing 64x64 image:
BufferCreateINT: msecs: 63 || ms/i: 10.5 || i/s: 95.2381
BufferCreateINT16: msecs: 93 || ms/i: 15.5 || i/s: 64.5161
BufferCreateFP16: msecs: 63 || ms/i: 10.5 || i/s: 95.2381
BufferCreateFP32: msecs: 62 || ms/i: 10.3333 || i/s: 96.7742
JustCopy: msecs: 141 || ms/i: 0.0705 || i/s: 14184.4
SimpleSmooth: msecs: 140 || ms/i: 0.07 || i/s: 14285.7
TexNoise: msecs: 141 || ms/i: 0.0705 || i/s: 14184.4
3x3Conv: msecs: 78 || ms/i: 0.078 || i/s: 12820.5
TEncode: msecs: 62 || ms/i: 0.062 || i/s: 16129
TDecode: msecs: 78 || ms/i: 0.078 || i/s: 12820.5
LinDiffINT: msecs: 172 || ms/i: 0.086 || i/s: 11627.9
LinDiffINT16: msecs: 172 || ms/i: 0.086 || i/s: 11627.9
LinDiffFP16: msecs: 172 || ms/i: 0.086 || i/s: 11627.9
LinDiffFP32: msecs: 157 || ms/i: 0.0785 || i/s: 12738.9
PMTEncoded: msecs: 266 || ms/i: 0.266 || i/s: 3759.4
PMStandard: msecs: 281 || ms/i: 0.281 || i/s: 3558.72
PMBuffered: msecs: 31 || ms/i: 0.062 || i/s: 16129

Testing 128x128 image:
BufferCreateINT: msecs: 62 || ms/i: 10.3333 || i/s: 96.7742
BufferCreateINT16: msecs: 94 || ms/i: 15.6667 || i/s: 63.8298
BufferCreateFP16: msecs: 63 || ms/i: 10.5 || i/s: 95.2381
BufferCreateFP32: msecs: 78 || ms/i: 13 || i/s: 76.9231
JustCopy: msecs: 125 || ms/i: 0.0625 || i/s: 16000
SimpleSmooth: msecs: 125 || ms/i: 0.0625 || i/s: 16000
TexNoise: msecs: 156 || ms/i: 0.078 || i/s: 12820.5
3x3Conv: msecs: 78 || ms/i: 0.078 || i/s: 12820.5
TEncode: msecs: 79 || ms/i: 0.079 || i/s: 12658.2
TDecode: msecs: 78 || ms/i: 0.078 || i/s: 12820.5
LinDiffINT: msecs: 156 || ms/i: 0.078 || i/s: 12820.5
LinDiffINT16: msecs: 172 || ms/i: 0.086 || i/s: 11627.9
LinDiffFP16: msecs: 172 || ms/i: 0.086 || i/s: 11627.9
LinDiffFP32: msecs: 391 || ms/i: 0.1955 || i/s: 5115.09
PMTEncoded: msecs: 266 || ms/i: 0.266 || i/s: 3759.4
PMStandard: msecs: 609 || ms/i: 0.609 || i/s: 1642.04
PMBuffered: msecs: 110 || ms/i: 0.22 || i/s: 4545.45

Testing 256x256 image:
BufferCreateINT: msecs: 78 || ms/i: 13 || i/s: 76.9231
BufferCreateINT16: msecs: 94 || ms/i: 15.6667 || i/s: 63.8298
BufferCreateFP16: msecs: 62 || ms/i: 10.3333 || i/s: 96.7742
BufferCreateFP32: msecs: 63 || ms/i: 10.5 || i/s: 95.2381
JustCopy: msecs: 141 || ms/i: 0.0705 || i/s: 14184.4
SimpleSmooth: msecs: 187 || ms/i: 0.0935 || i/s: 10695.2
TexNoise: msecs: 187 || ms/i: 0.0935 || i/s: 10695.2
3x3Conv: msecs: 188 || ms/i: 0.188 || i/s: 5319.15
TEncode: msecs: 78 || ms/i: 0.078 || i/s: 12820.5
TDecode: msecs: 125 || ms/i: 0.125 || i/s: 8000
LinDiffINT: msecs: 204 || ms/i: 0.102 || i/s: 9803.92
LinDiffINT16: msecs: 469 || ms/i: 0.2345 || i/s: 4264.39
LinDiffFP16: msecs: 500 || ms/i: 0.25 || i/s: 4000
LinDiffFP32: msecs: 1484 || ms/i: 0.742 || i/s: 1347.71
PMTEncoded: msecs: 719 || ms/i: 0.719 || i/s: 1390.82
PMStandard: msecs: 2344 || ms/i: 2.344 || i/s: 426.621
PMBuffered: msecs: 719 || ms/i: 1.438 || i/s: 695.41

Testing 512x512 image:
BufferCreateINT: msecs: 78 || ms/i: 13 || i/s: 76.9231
BufferCreateINT16: msecs: 94 || ms/i: 15.6667 || i/s: 63.8298
BufferCreateFP16: msecs: 62 || ms/i: 10.3333 || i/s: 96.7742
BufferCreateFP32: msecs: 63 || ms/i: 10.5 || i/s: 95.2381
JustCopy: msecs: 188 || ms/i: 0.188 || i/s: 5319.15
SimpleSmooth: msecs: 328 || ms/i: 0.328 || i/s: 3048.78
TexNoise: msecs: 250 || ms/i: 0.25 || i/s: 4000
3x3Conv: msecs: 328 || ms/i: 0.656 || i/s: 1524.39
TEncode: msecs: 78 || ms/i: 0.156 || i/s: 6410.26
TDecode: msecs: 219 || ms/i: 0.438 || i/s: 2283.11
LinDiffINT: msecs: 328 || ms/i: 0.328 || i/s: 3048.78
LinDiffINT16: msecs: 813 || ms/i: 0.813 || i/s: 1230.01
LinDiffFP16: msecs: 812 || ms/i: 0.812 || i/s: 1231.53
LinDiffFP32: msecs: 2703 || ms/i: 2.703 || i/s: 369.959
PMTEncoded: msecs: 1281 || ms/i: 2.562 || i/s: 390.32
PMStandard: msecs: 4266 || ms/i: 8.532 || i/s: 117.206
PMBuffered: msecs: 797 || ms/i: 3.188 || i/s: 313.676

Testing 1024x1024 image:
BufferCreateINT: msecs: 63 || ms/i: 10.5 || i/s: 95.2381
BufferCreateINT16: msecs: 109 || ms/i: 18.1667 || i/s: 55.0459
BufferCreateFP16: msecs: 63 || ms/i: 10.5 || i/s: 95.2381
BufferCreateFP32: msecs: 62 || ms/i: 10.3333 || i/s: 96.7742
JustCopy: msecs: 687 || ms/i: 0.687 || i/s: 1455.6
SimpleSmooth: msecs: 1250 || ms/i: 1.25 || i/s: 800
TexNoise: msecs: 875 || ms/i: 0.875 || i/s: 1142.86
3x3Conv: msecs: 1265 || ms/i: 2.53 || i/s: 395.257
TEncode: msecs: 282 || ms/i: 0.564 || i/s: 1773.05
TDecode: msecs: 843 || ms/i: 1.686 || i/s: 593.12
LinDiffINT: msecs: 1187 || ms/i: 1.187 || i/s: 842.46
LinDiffINT16: msecs: 3125 || ms/i: 3.125 || i/s: 320
LinDiffFP16: msecs: 3125 || ms/i: 3.125 || i/s: 320
LinDiffFP32: msecs: 10765 || ms/i: 10.765 || i/s: 92.8936
PMTEncoded: msecs: 4844 || ms/i: 9.688 || i/s: 103.22
PMStandard: msecs: 16875 || ms/i: 33.75 || i/s: 29.6296
PMBuffered: msecs: 278422 || ms/i: 1113.69 || i/s: 0.897918
 
PeterT said:
geo (or anyone else with the same problem):
Can you try http://www.landesjugendtheater.at/misc/FBenchWIP.exe ?
(put it into the folder with the old .exe)

If I guessed the cause correctly that should fix the problem. If not I'll be back in ~6 hours, it's 4 AM here.

Thanks again for testing this.

Just started burning a DVD for my wife. Gonna take a couple hours, and I don't how this app will coexist so I'm thinking "not right now". So it'll be a bit. Have a nice nap! :)
 
9500 pro + amd 2200
Code:
Testing 32x32 image:
BufferCreateINT: msecs: 781 || ms/i: 130.167 || i/s: 7.68246
BufferCreateINT16: msecs: 797 || ms/i: 132.833 || i/s: 7.52823
BufferCreateFP16: msecs: 750 || ms/i: 125 || i/s: 8
BufferCreateFP32: msecs: 750 || ms/i: 125 || i/s: 8
JustCopy: msecs: 812 || ms/i: 0.406 || i/s: 2463.05
SimpleSmooth: msecs: 828 || ms/i: 0.414 || i/s: 2415.46
TexNoise: msecs: 890 || ms/i: 0.445 || i/s: 2247.19
3x3Conv: msecs: 485 || ms/i: 0.485 || i/s: 2061.86
TEncode: msecs: 422 || ms/i: 0.422 || i/s: 2369.67
TDecode: msecs: 969 || ms/i: 0.969 || i/s: 1031.99
LinDiffINT: msecs: 1078 || ms/i: 0.539 || i/s: 1855.29
LinDiffINT16: msecs: 1047 || ms/i: 0.5235 || i/s: 1910.22
LinDiffFP16: msecs: 1062 || ms/i: 0.531 || i/s: 1883.24
LinDiffFP32: msecs: 937 || ms/i: 0.4685 || i/s: 2134.47
PMTEncoded: msecs: 1719 || ms/i: 1.719 || i/s: 581.734
PMStandard: msecs: 1469 || ms/i: 1.469 || i/s: 680.735
PMBuffered: msecs: 172 || ms/i: 0.344 || i/s: 2906.98

Testing 64x64 image:
BufferCreateINT: msecs: 766 || ms/i: 127.667 || i/s: 7.8329
BufferCreateINT16: msecs: 766 || ms/i: 127.667 || i/s: 7.8329
BufferCreateFP16: msecs: 750 || ms/i: 125 || i/s: 8
BufferCreateFP32: msecs: 765 || ms/i: 127.5 || i/s: 7.84314
JustCopy: msecs: 953 || ms/i: 0.4765 || i/s: 2098.64
SimpleSmooth: msecs: 1015 || ms/i: 0.5075 || i/s: 1970.44
TexNoise: msecs: 1063 || ms/i: 0.5315 || i/s: 1881.47
3x3Conv: msecs: 562 || ms/i: 0.562 || i/s: 1779.36
TEncode: msecs: 469 || ms/i: 0.469 || i/s: 2132.2
TDecode: msecs: 1015 || ms/i: 1.015 || i/s: 985.222
LinDiffINT: msecs: 1078 || ms/i: 0.539 || i/s: 1855.29
LinDiffINT16: msecs: 1078 || ms/i: 0.539 || i/s: 1855.29
LinDiffFP16: msecs: 1078 || ms/i: 0.539 || i/s: 1855.29
LinDiffFP32: msecs: 1094 || ms/i: 0.547 || i/s: 1828.15
PMTEncoded: msecs: 1531 || ms/i: 1.531 || i/s: 653.168
PMStandard: msecs: 1484 || ms/i: 1.484 || i/s: 673.854
PMBuffered: msecs: 171 || ms/i: 0.342 || i/s: 2923.98

Testing 128x128 image:
BufferCreateINT: msecs: 765 || ms/i: 127.5 || i/s: 7.84314
BufferCreateINT16: msecs: 766 || ms/i: 127.667 || i/s: 7.8329
BufferCreateFP16: msecs: 750 || ms/i: 125 || i/s: 8
BufferCreateFP32: msecs: 765 || ms/i: 127.5 || i/s: 7.84314
JustCopy: msecs: 844 || ms/i: 0.422 || i/s: 2369.67
SimpleSmooth: msecs: 844 || ms/i: 0.422 || i/s: 2369.67
TexNoise: msecs: 890 || ms/i: 0.445 || i/s: 2247.19
3x3Conv: msecs: 469 || ms/i: 0.469 || i/s: 2132.2
TEncode: msecs: 422 || ms/i: 0.422 || i/s: 2369.67
TDecode: msecs: 938 || ms/i: 0.938 || i/s: 1066.1
LinDiffINT: msecs: 1093 || ms/i: 0.5465 || i/s: 1829.83
LinDiffINT16: msecs: 969 || ms/i: 0.4845 || i/s: 2063.98
LinDiffFP16: msecs: 969 || ms/i: 0.4845 || i/s: 2063.98
LinDiffFP32: msecs: 1125 || ms/i: 0.5625 || i/s: 1777.78
PMTEncoded: msecs: 1750 || ms/i: 1.75 || i/s: 571.429
PMStandard: msecs: 1531 || ms/i: 1.531 || i/s: 653.168
PMBuffered: msecs: 297 || ms/i: 0.594 || i/s: 1683.5

Testing 256x256 image:
BufferCreateINT: msecs: 766 || ms/i: 127.667 || i/s: 7.8329
BufferCreateINT16: msecs: 765 || ms/i: 127.5 || i/s: 7.84314
BufferCreateFP16: msecs: 766 || ms/i: 127.667 || i/s: 7.8329
BufferCreateFP32: msecs: 765 || ms/i: 127.5 || i/s: 7.84314
JustCopy: msecs: 1000 || ms/i: 0.5 || i/s: 2000
SimpleSmooth: msecs: 1047 || ms/i: 0.5235 || i/s: 1910.22
TexNoise: msecs: 1109 || ms/i: 0.5545 || i/s: 1803.43
3x3Conv: msecs: 1235 || ms/i: 1.235 || i/s: 809.717
TEncode: msecs: 500 || ms/i: 0.5 || i/s: 2000
TDecode: msecs: 1031 || ms/i: 1.031 || i/s: 969.932
LinDiffINT: msecs: 1125 || ms/i: 0.5625 || i/s: 1777.78
LinDiffINT16: msecs: 1125 || ms/i: 0.5625 || i/s: 1777.78
LinDiffFP16: msecs: 1125 || ms/i: 0.5625 || i/s: 1777.78
LinDiffFP32: msecs: 1282 || ms/i: 0.641 || i/s: 1560.06
PMTEncoded: msecs: 1563 || ms/i: 1.563 || i/s: 639.795
PMStandard: msecs: 1937 || ms/i: 1.937 || i/s: 516.262
PMBuffered: msecs: 1438 || ms/i: 2.876 || i/s: 347.705

Testing 512x512 image:
BufferCreateINT: msecs: 765 || ms/i: 127.5 || i/s: 7.84314
BufferCreateINT16: msecs: 766 || ms/i: 127.667 || i/s: 7.8329
BufferCreateFP16: msecs: 765 || ms/i: 127.5 || i/s: 7.84314
BufferCreateFP32: msecs: 766 || ms/i: 127.667 || i/s: 7.8329
JustCopy: msecs: 890 || ms/i: 0.89 || i/s: 1123.6
SimpleSmooth: msecs: 1203 || ms/i: 1.203 || i/s: 831.255
TexNoise: msecs: 1235 || ms/i: 1.235 || i/s: 809.717
3x3Conv: msecs: 2500 || ms/i: 5 || i/s: 200
TEncode: msecs: 219 || ms/i: 0.438 || i/s: 2283.11
TDecode: msecs: 500 || ms/i: 1 || i/s: 1000
LinDiffINT: msecs: 2000 || ms/i: 2 || i/s: 500
LinDiffINT16: msecs: 1922 || ms/i: 1.922 || i/s: 520.291
LinDiffFP16: msecs: 1656 || ms/i: 1.656 || i/s: 603.865
LinDiffFP32: msecs: 2468 || ms/i: 2.468 || i/s: 405.186
PMTEncoded: msecs: 1391 || ms/i: 2.782 || i/s: 359.454
PMStandard: msecs: 3500 || ms/i: 7 || i/s: 142.857
PMBuffered: msecs: 3219 || ms/i: 12.876 || i/s: 77.6639

Testing 1024x1024 image:
BufferCreateINT: msecs: 750 || ms/i: 125 || i/s: 8
BufferCreateINT16: msecs: 750 || ms/i: 125 || i/s: 8
BufferCreateFP16: msecs: 765 || ms/i: 127.5 || i/s: 7.84314
BufferCreateFP32: msecs: 766 || ms/i: 127.667 || i/s: 7.8329
JustCopy: msecs: 3296 || ms/i: 3.296 || i/s: 303.398
SimpleSmooth: msecs: 4938 || ms/i: 4.938 || i/s: 202.511
TexNoise: msecs: 4953 || ms/i: 4.953 || i/s: 201.898
3x3Conv: msecs: 7188 || ms/i: 14.376 || i/s: 69.5604
TEncode: msecs: 265 || ms/i: 0.53 || i/s: 1886.79
TDecode: msecs: 4047 || ms/i: 8.094 || i/s: 123.548
LinDiffINT: msecs: 8125 || ms/i: 8.125 || i/s: 123.077
LinDiffINT16: msecs: 7703 || ms/i: 7.703 || i/s: 129.82
LinDiffFP16: msecs: 8016 || ms/i: 8.016 || i/s: 124.75
LinDiffFP32: msecs: 7406 || ms/i: 7.406 || i/s: 135.026
PMTEncoded: msecs: 6093 || ms/i: 12.186 || i/s: 82.0614
PMStandard: msecs: 14922 || ms/i: 29.844 || i/s: 33.5076
PMBuffered: msecs: 13875 || ms/i: 55.5 || i/s: 18.018
[/code]
 
Oh yeah, btw, that Profit!! part, what's up with that? Are we speaking about karma or something more concrete? :D

You said your image processing framework, but frankly I don't know what that means and where you're going with this. Not that this is a prerequisite; just curious where my ticks are being donated. ;)
 
Athlon XP-M 2.5Ghz / 6800GT @ Ultra / Forceware 71.81

Code:
GL filter framework 1.2999 test application by Peter Thoman 2004-2005

Gui initialized successfully.
DevIL initialized successfully.
 - DevIL Version: 167
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.

Press return key to start benchmark...



Testing 32x32 image:
BufferCreateINT: msecs: 94 || ms/i: 15.6667 || i/s: 63.8298
GLRenderTexture: Could not find an acceptable pixel format.
 
With alternative exe - Athlon XP-M 2.5Ghz / 6800GT @ Ultra / 71.84

Code:
GL filter framework 1.2999 test application by Peter Thoman 2004-2005

Gui initialized successfully.
DevIL initialized successfully.
 - DevIL Version: 167
OpenGL initialized successfully.
ILUT OpenGL mode set successfully.
Loaded required OpenGL extensions for GLPixelShader.
Loaded required OpenGL extensions for GLRenderTexture.
Loaded required OpenGL extensions for GLFilterStep.
Initialization complete.

Press return key to start benchmark...



Testing 32x32 image:
BufferCreateINT: msecs: 78 || ms/i: 13 || i/s: 76.9231No suitable INT format fou
nd. Trying FP... (Flaky 6600 workaround)

BufferCreateINT16: msecs: 78 || ms/i: 13 || i/s: 76.9231
BufferCreateFP16: msecs: 63 || ms/i: 10.5 || i/s: 95.2381
BufferCreateFP32: msecs: 47 || ms/i: 7.83333 || i/s: 127.66
JustCopy: msecs: 187 || ms/i: 0.0935 || i/s: 10695.2
SimpleSmooth: msecs: 266 || ms/i: 0.133 || i/s: 7518.8
TexNoise: msecs: 250 || ms/i: 0.125 || i/s: 8000
3x3Conv: msecs: 172 || ms/i: 0.172 || i/s: 5813.95
TEncode: msecs: 157 || ms/i: 0.157 || i/s: 6369.43
TDecode: msecs: 156 || ms/i: 0.156 || i/s: 6410.26
LinDiffINT: msecs: 218 || ms/i: 0.109 || i/s: 9174.31
LinDiffINT16: msecs: 219 || ms/i: 0.1095 || i/s: 9132.42
LinDiffFP16: msecs: 219 || ms/i: 0.1095 || i/s: 9132.42
LinDiffFP32: msecs: 219 || ms/i: 0.1095 || i/s: 9132.42
PMTEncoded: msecs: 485 || ms/i: 0.485 || i/s: 2061.86
PMStandard: msecs: 344 || ms/i: 0.344 || i/s: 2906.98
PMBuffered: msecs: 47 || ms/i: 0.094 || i/s: 10638.3

Testing 64x64 image:
BufferCreateINT: msecs: 63 || ms/i: 10.5 || i/s: 95.2381
BufferCreateINT16: msecs: 78 || ms/i: 13 || i/s: 76.9231
BufferCreateFP16: msecs: 63 || ms/i: 10.5 || i/s: 95.2381
BufferCreateFP32: msecs: 46 || ms/i: 7.66667 || i/s: 130.435
JustCopy: msecs: 188 || ms/i: 0.094 || i/s: 10638.3
SimpleSmooth: msecs: 171 || ms/i: 0.0855 || i/s: 11695.9
TexNoise: msecs: 188 || ms/i: 0.094 || i/s: 10638.3
3x3Conv: msecs: 94 || ms/i: 0.094 || i/s: 10638.3
TEncode: msecs: 93 || ms/i: 0.093 || i/s: 10752.7
TDecode: msecs: 110 || ms/i: 0.11 || i/s: 9090.91
LinDiffINT: msecs: 218 || ms/i: 0.109 || i/s: 9174.31
LinDiffINT16: msecs: 219 || ms/i: 0.1095 || i/s: 9132.42
LinDiffFP16: msecs: 219 || ms/i: 0.1095 || i/s: 9132.42
LinDiffFP32: msecs: 219 || ms/i: 0.1095 || i/s: 9132.42
PMTEncoded: msecs: 359 || ms/i: 0.359 || i/s: 2785.52
PMStandard: msecs: 344 || ms/i: 0.344 || i/s: 2906.98
PMBuffered: msecs: 47 || ms/i: 0.094 || i/s: 10638.3

Testing 128x128 image:
BufferCreateINT: msecs: 46 || ms/i: 7.66667 || i/s: 130.435
BufferCreateINT16: msecs: 94 || ms/i: 15.6667 || i/s: 63.8298
BufferCreateFP16: msecs: 63 || ms/i: 10.5 || i/s: 95.2381
BufferCreateFP32: msecs: 47 || ms/i: 7.83333 || i/s: 127.66
JustCopy: msecs: 172 || ms/i: 0.086 || i/s: 11627.9
SimpleSmooth: msecs: 172 || ms/i: 0.086 || i/s: 11627.9
TexNoise: msecs: 187 || ms/i: 0.0935 || i/s: 10695.2
3x3Conv: msecs: 94 || ms/i: 0.094 || i/s: 10638.3
TEncode: msecs: 94 || ms/i: 0.094 || i/s: 10638.3
TDecode: msecs: 109 || ms/i: 0.109 || i/s: 9174.31
LinDiffINT: msecs: 218 || ms/i: 0.109 || i/s: 9174.31
LinDiffINT16: msecs: 219 || ms/i: 0.1095 || i/s: 9132.42
LinDiffFP16: msecs: 219 || ms/i: 0.1095 || i/s: 9132.42
LinDiffFP32: msecs: 469 || ms/i: 0.2345 || i/s: 4264.39
PMTEncoded: msecs: 360 || ms/i: 0.36 || i/s: 2777.78
PMStandard: msecs: 750 || ms/i: 0.75 || i/s: 1333.33
PMBuffered: msecs: 156 || ms/i: 0.312 || i/s: 3205.13

Testing 256x256 image:
BufferCreateINT: msecs: 46 || ms/i: 7.66667 || i/s: 130.435
BufferCreateINT16: msecs: 78 || ms/i: 13 || i/s: 76.9231
BufferCreateFP16: msecs: 63 || ms/i: 10.5 || i/s: 95.2381
BufferCreateFP32: msecs: 47 || ms/i: 7.83333 || i/s: 127.66
JustCopy: msecs: 172 || ms/i: 0.086 || i/s: 11627.9
SimpleSmooth: msecs: 219 || ms/i: 0.1095 || i/s: 9132.42
TexNoise: msecs: 203 || ms/i: 0.1015 || i/s: 9852.22
3x3Conv: msecs: 266 || ms/i: 0.266 || i/s: 3759.4
TEncode: msecs: 94 || ms/i: 0.094 || i/s: 10638.3
TDecode: msecs: 110 || ms/i: 0.11 || i/s: 9090.91
LinDiffINT: msecs: 250 || ms/i: 0.125 || i/s: 8000
LinDiffINT16: msecs: 516 || ms/i: 0.258 || i/s: 3875.97
LinDiffFP16: msecs: 516 || ms/i: 0.258 || i/s: 3875.97
LinDiffFP32: msecs: 1797 || ms/i: 0.8985 || i/s: 1112.97
PMTEncoded: msecs: 594 || ms/i: 0.594 || i/s: 1683.5
PMStandard: msecs: 2765 || ms/i: 2.765 || i/s: 361.664
PMBuffered: msecs: 968 || ms/i: 1.936 || i/s: 516.529

Testing 512x512 image:
BufferCreateINT: msecs: 62 || ms/i: 10.3333 || i/s: 96.7742
BufferCreateINT16: msecs: 78 || ms/i: 13 || i/s: 76.9231
BufferCreateFP16: msecs: 63 || ms/i: 10.5 || i/s: 95.2381
BufferCreateFP32: msecs: 47 || ms/i: 7.83333 || i/s: 127.66
JustCopy: msecs: 188 || ms/i: 0.188 || i/s: 5319.15
SimpleSmooth: msecs: 390 || ms/i: 0.39 || i/s: 2564.1
TexNoise: msecs: 266 || ms/i: 0.266 || i/s: 3759.4
3x3Conv: msecs: 438 || ms/i: 0.876 || i/s: 1141.55
TEncode: msecs: 94 || ms/i: 0.188 || i/s: 5319.15
TDecode: msecs: 203 || ms/i: 0.406 || i/s: 2463.05
LinDiffINT: msecs: 407 || ms/i: 0.407 || i/s: 2457
LinDiffINT16: msecs: 891 || ms/i: 0.891 || i/s: 1122.33
LinDiffFP16: msecs: 875 || ms/i: 0.875 || i/s: 1142.86
LinDiffFP32: msecs: 3250 || ms/i: 3.25 || i/s: 307.692
PMTEncoded: msecs: 1031 || ms/i: 2.062 || i/s: 484.966
PMStandard: msecs: 5109 || ms/i: 10.218 || i/s: 97.8665
PMBuffered: msecs: 1719 || ms/i: 6.876 || i/s: 145.433

Testing 1024x1024 image:
BufferCreateINT: msecs: 47 || ms/i: 7.83333 || i/s: 127.66
BufferCreateINT16: msecs: 94 || ms/i: 15.6667 || i/s: 63.8298
BufferCreateFP16: msecs: 62 || ms/i: 10.3333 || i/s: 96.7742
BufferCreateFP32: msecs: 63 || ms/i: 10.5 || i/s: 95.2381
JustCopy: msecs: 687 || ms/i: 0.687 || i/s: 1455.6
SimpleSmooth: msecs: 1453 || ms/i: 1.453 || i/s: 688.231
TexNoise: msecs: 906 || ms/i: 0.906 || i/s: 1103.75
3x3Conv: msecs: 1688 || ms/i: 3.376 || i/s: 296.209
TEncode: msecs: 297 || ms/i: 0.594 || i/s: 1683.5
TDecode: msecs: 797 || ms/i: 1.594 || i/s: 627.353
LinDiffINT: msecs: 1531 || ms/i: 1.531 || i/s: 653.168
LinDiffINT16: msecs: 3437 || ms/i: 3.437 || i/s: 290.951
LinDiffFP16: msecs: 3453 || ms/i: 3.453 || i/s: 289.603
LinDiffFP32: msecs: 12813 || ms/i: 12.813 || i/s: 78.0457
PMTEncoded: msecs: 3860 || ms/i: 7.72 || i/s: 129.534
PMStandard: msecs: 20187 || ms/i: 40.374 || i/s: 24.7684
PMBuffered: msecs: 54594 || ms/i: 218.376 || i/s: 4.57926
 
Back
Top