Own an ATI X1, HD or NV 7/8 series GPU and have 5-10 minutes? Please run this.

Discussion in 'GPGPU Technology & Programming' started by PeterT, May 29, 2007.

  1. Aerows

    Regular Newcomer

    Joined:
    Nov 19, 2002
    Messages:
    317
    Likes Received:
    6
    Wonderful! :grin: Thank you so much. I will edit my original, and then post the other results.
     
  2. neliz

    neliz GIGABYTE Man
    Veteran

    Joined:
    Mar 30, 2005
    Messages:
    4,904
    Likes Received:
    23
    Location:
    In the know
    It crashed at 2047 with me too, display driver recovered but application seemed to hang.

    HD2900 Cat 8.38RC7
    Opteron 165
    1GB DDR-3200
    Vista32

    Code:
    GL multigrid benchmark 0.5 by Peter Thoman 2007
    
    DevIL initialized successfully.
    OpenGL initialized successfully.
    ILUT OpenGL mode set successfully.
    GLEW version 1.3.5 initialized successfully.
    Loaded required OpenGL extensions for GLPixelShader.
    Loaded required OpenGL extensions for GLRenderTexture.
    Loaded required OpenGL extensions for GLFilterStep.
    Initialization complete.
    
    
    
    ------------------
    -- size: 3
    
    BENCHMARK
    Jacobi iteration: 20.1 micros
    Residual calculation: 19.9 micros
    Restriction: fw: 14.2 micros
    Interpolation + add: 19.7 micros
    VCycle: 445 micros
    
    
    BENCHMARK
    Jacobi iteration: 19.5 micros
    Residual calculation: 19.3 micros
    Restriction: fw: 13.9 micros
    Interpolation + add: 19.7 micros
    VCycle: 457 micros
    
    
    BENCHMARK
    Jacobi iteration: 20.2 micros
    Residual calculation: 19.3 micros
    Restriction: fw: 14 micros
    Interpolation + add: 19.1 micros
    VCycle: 446 micros
    
    
    
    ------------------
    -- size: 7
    
    BENCHMARK
    Jacobi iteration: 19.6667 micros
    Residual calculation: 19.6667 micros
    Restriction: fw: 14 micros
    Interpolation + add: 19.5556 micros
    VCycle: 724.444 micros
    
    
    BENCHMARK
    Jacobi iteration: 19.6667 micros
    Residual calculation: 19.7778 micros
    Restriction: fw: 13.8889 micros
    Interpolation + add: 19.6667 micros
    VCycle: 725.556 micros
    
    
    BENCHMARK
    Jacobi iteration: 19.6667 micros
    Residual calculation: 19.7778 micros
    Restriction: fw: 13.8889 micros
    Interpolation + add: 19.4444 micros
    VCycle: 725.556 micros
    
    
    
    ------------------
    -- size: 15
    
    BENCHMARK
    Jacobi iteration: 19.375 micros
    Residual calculation: 19.375 micros
    Restriction: fw: 13.875 micros
    Interpolation + add: 19.625 micros
    VCycle: 1005 micros
    
    
    BENCHMARK
    Jacobi iteration: 19.375 micros
    Residual calculation: 19.5 micros
    Restriction: fw: 13.75 micros
    Interpolation + add: 19.5 micros
    VCycle: 1020 micros
    
    
    BENCHMARK
    Jacobi iteration: 20 micros
    Residual calculation: 19.75 micros
    Restriction: fw: 13.75 micros
    Interpolation + add: 19.25 micros
    VCycle: 1005 micros
    
    
    
    ------------------
    -- size: 31
    
    BENCHMARK
    Jacobi iteration: 19.5714 micros
    Residual calculation: 19.7143 micros
    Restriction: fw: 14 micros
    Interpolation + add: 19.2857 micros
    VCycle: 1294.29 micros
    
    
    BENCHMARK
    Jacobi iteration: 19.7143 micros
    Residual calculation: 19.7143 micros
    Restriction: fw: 14 micros
    Interpolation + add: 19.7143 micros
    VCycle: 1290 micros
    
    
    BENCHMARK
    Jacobi iteration: 19.7143 micros
    Residual calculation: 19.7143 micros
    Restriction: fw: 14 micros
    Interpolation + add: 19.2857 micros
    VCycle: 1290 micros
    
    
    
    ------------------
    -- size: 63
    
    BENCHMARK
    Jacobi iteration: 19.6667 micros
    Residual calculation: 19.6667 micros
    Restriction: fw: 13.8333 micros
    Interpolation + add: 19.6667 micros
    VCycle: 1606.67 micros
    
    
    BENCHMARK
    Jacobi iteration: 20.3333 micros
    Residual calculation: 19.6667 micros
    Restriction: fw: 13.8333 micros
    Interpolation + add: 19.6667 micros
    VCycle: 1576.67 micros
    
    
    BENCHMARK
    Jacobi iteration: 19.8333 micros
    Residual calculation: 19.5 micros
    Restriction: fw: 14 micros
    Interpolation + add: 19.5 micros
    VCycle: 1598.33 micros
    
    
    
    ------------------
    -- size: 127
    
    BENCHMARK
    Jacobi iteration: 20 micros
    Residual calculation: 19.8 micros
    Restriction: fw: 14.4 micros
    Interpolation + add: 19.8 micros
    VCycle: 1894 micros
    
    
    BENCHMARK
    Jacobi iteration: 19.6 micros
    Residual calculation: 19.6 micros
    Restriction: fw: 14 micros
    Interpolation + add: 19.4 micros
    VCycle: 1874 micros
    
    
    BENCHMARK
    Jacobi iteration: 20 micros
    Residual calculation: 20 micros
    Restriction: fw: 14.4 micros
    Interpolation + add: 19.8 micros
    VCycle: 1910 micros
    
    
    
    ------------------
    -- size: 255
    
    BENCHMARK
    Jacobi iteration: 20.75 micros
    Residual calculation: 20.5 micros
    Restriction: fw: 14.75 micros
    Interpolation + add: 20.75 micros
    VCycle: 4360 micros
    
    
    BENCHMARK
    Jacobi iteration: 19.75 micros
    Residual calculation: 19 micros
    Restriction: fw: 13.75 micros
    Interpolation + add: 19.5 micros
    VCycle: 4380 micros
    
    
    BENCHMARK
    Jacobi iteration: 19.5 micros
    Residual calculation: 19.5 micros
    Restriction: fw: 14 micros
    Interpolation + add: 19 micros
    VCycle: 4427.5 micros
    
    
    
    ------------------
    -- size: 511
    
    BENCHMARK
    Jacobi iteration: 19.3333 micros
    Residual calculation: 19.3333 micros
    Restriction: fw: 13.6667 micros
    Interpolation + add: 20 micros
    VCycle: 20176.7 micros
    
    
    BENCHMARK
    Jacobi iteration: 19.3333 micros
    Residual calculation: 19.6667 micros
    Restriction: fw: 14 micros
    Interpolation + add: 19.6667 micros
    VCycle: 22086.7 micros
    
    
    BENCHMARK
    Jacobi iteration: 19.6667 micros
    Residual calculation: 19.6667 micros
    Restriction: fw: 13.6667 micros
    Interpolation + add: 19.3333 micros
    VCycle: 22066.7 micros
    
    
    
    ------------------
    -- size: 1023
    
    BENCHMARK
    Jacobi iteration: 19 micros
    Residual calculation: 19 micros
    Restriction: fw: 13.5 micros
    Interpolation + add: 19.5 micros
    VCycle: 81530 micros
    
    
    BENCHMARK
    Jacobi iteration: 39.5 micros
    Residual calculation: 38 micros
    Restriction: fw: 41.5 micros
    Interpolation + add: 41.5 micros
    VCycle: 89600 micros
    
    
    BENCHMARK
    Jacobi iteration: 41.5 micros
    Residual calculation: 38 micros
    Restriction: fw: 41.5 micros
    Interpolation + add: 42.5 micros
    VCycle: 89620 micros
    
    
    
    ------------------
    -- size: 2047
    
     
  3. Aerows

    Regular Newcomer

    Joined:
    Nov 19, 2002
    Messages:
    317
    Likes Received:
    6
    Vista 32-bit / Forceware 158.18
    AMD X2 4200+ Windsor @ 2.2 Ghz (stock)
    Corsair XMS PC2-5400 @ 667
    Geforce 8600 GTS @ 720/2100

    ***************************************
    Code:
    GL multigrid benchmark 0.5 by Peter Thoman 2007
    
    DevIL initialized successfully.
    OpenGL initialized successfully.
    ILUT OpenGL mode set successfully.
    GLEW version 1.3.5 initialized successfully.
    Loaded required OpenGL extensions for GLPixelShader.
    Loaded required OpenGL extensions for GLRenderTexture.
    Loaded required OpenGL extensions for GLFilterStep.
    Initialization complete.
    
    
    
    ------------------
    -- size: 3
    
    BENCHMARK
    Jacobi iteration: 3.1 micros
    Residual calculation: 3.2 micros
    Restriction: fw: 3.1 micros
    Interpolation + add: 3.1 micros
    VCycle: 6443 micros
    
    
    BENCHMARK
    Jacobi iteration: 3.1 micros
    Residual calculation: 4.7 micros
    Restriction: fw: 3.1 micros
    Interpolation + add: 3.1 micros
    VCycle: 6505 micros
    
    
    BENCHMARK
    Jacobi iteration: 4.7 micros
    Residual calculation: 3.1 micros
    Restriction: fw: 3.1 micros
    Interpolation + add: 3.1 micros
    VCycle: 6428 micros
    
    
    
    ------------------
    -- size: 7
    
    BENCHMARK
    Jacobi iteration: 3.44444 micros
    Residual calculation: 3.44444 micros
    Restriction: fw: 3.55556 micros
    Interpolation + add: 3.44444 micros
    VCycle: 10850 micros
    
    
    BENCHMARK
    Jacobi iteration: 3.44444 micros
    Residual calculation: 3.44444 micros
    Restriction: fw: 3.55556 micros
    Interpolation + add: 3.44444 micros
    VCycle: 10850 micros
    
    
    BENCHMARK
    Jacobi iteration: 3.44444 micros
    Residual calculation: 3.44444 micros
    Restriction: fw: 3.55556 micros
    Interpolation + add: 3.44444 micros
    VCycle: 10920 micros
    
    
    
    ------------------
    -- size: 15
    
    BENCHMARK
    Jacobi iteration: 3.875 micros
    Residual calculation: 3.875 micros
    Restriction: fw: 2 micros
    Interpolation + add: 3.875 micros
    VCycle: 15521.3 micros
    
    
    BENCHMARK
    Jacobi iteration: 3.875 micros
    Residual calculation: 4 micros
    Restriction: fw: 1.875 micros
    Interpolation + add: 3.875 micros
    VCycle: 15093.8 micros
    
    
    BENCHMARK
    Jacobi iteration: 3.875 micros
    Residual calculation: 2 micros
    Restriction: fw: 3.875 micros
    Interpolation + add: 3.875 micros
    VCycle: 15073.8 micros
    
    
    
    ------------------
    -- size: 31
    
    BENCHMARK
    Jacobi iteration: 2.14286 micros
    Residual calculation: 4.57143 micros
    Restriction: fw: 4.42857 micros
    Interpolation + add: 2.14286 micros
    VCycle: 19545.7 micros
    
    
    BENCHMARK
    Jacobi iteration: 2.14286 micros
    Residual calculation: 4.57143 micros
    Restriction: fw: 4.42857 micros
    Interpolation + add: 2.14286 micros
    VCycle: 19657.1 micros
    
    
    BENCHMARK
    Jacobi iteration: 4.42857 micros
    Residual calculation: 2.28571 micros
    Restriction: fw: 4.42857 micros
    Interpolation + add: 2.28571 micros
    VCycle: 19611.4 micros
    
    
    
    ------------------
    -- size: 63
    
    BENCHMARK
    Jacobi iteration: 2.5 micros
    Residual calculation: 5.33333 micros
    Restriction: fw: 2.5 micros
    Interpolation + add: 7.83333 micros
    VCycle: 24258.3 micros
    
    
    BENCHMARK
    Jacobi iteration: 2.5 micros
    Residual calculation: 2.66667 micros
    Restriction: fw: 5.16667 micros
    Interpolation + add: 5.16667 micros
    VCycle: 24336.7 micros
    
    
    BENCHMARK
    Jacobi iteration: 5.16667 micros
    Residual calculation: 2.5 micros
    Restriction: fw: 2.66667 micros
    Interpolation + add: 7.83333 micros
    VCycle: 24595 micros
    
    
    
    ------------------
    -- size: 127
    
    BENCHMARK
    Jacobi iteration: 3.2 micros
    Residual calculation: 6.2 micros
    Restriction: fw: 37.4 micros
    Interpolation + add: 37.6 micros
    VCycle: 29952 micros
    
    
    BENCHMARK
    Jacobi iteration: 3.2 micros
    Residual calculation: 6.2 micros
    Restriction: fw: 37.4 micros
    Interpolation + add: 34.4 micros
    VCycle: 29764 micros
    
    
    BENCHMARK
    Jacobi iteration: 3 micros
    Residual calculation: 9.4 micros
    Restriction: fw: 34.4 micros
    Interpolation + add: 34.2 micros
    VCycle: 30296 micros
    
    
    
    ------------------
    -- size: 255
    
    BENCHMARK
    Jacobi iteration: 4 micros
    Residual calculation: 3.75 micros
    Restriction: fw: 101.5 micros
    Interpolation + add: 152 micros
    VCycle: 40170 micros
    
    
    BENCHMARK
    Jacobi iteration: 3.75 micros
    Residual calculation: 0 micros
    Restriction: fw: 97.5 micros
    Interpolation + add: 152.25 micros
    VCycle: 40092.5 micros
    
    
    BENCHMARK
    Jacobi iteration: 0 micros
    Residual calculation: 4 micros
    Restriction: fw: 97.5 micros
    Interpolation + add: 152 micros
    VCycle: 40325 micros
    
    
    
    ------------------
    -- size: 511
    
    BENCHMARK
    Jacobi iteration: 0 micros
    Residual calculation: 5 micros
    Restriction: fw: 151 micros
    Interpolation + add: 353.333 micros
    VCycle: 71086.7 micros
    
    
    BENCHMARK
    Jacobi iteration: 5.33333 micros
    Residual calculation: 5 micros
    Restriction: fw: 151 micros
    Interpolation + add: 358.667 micros
    VCycle: 70200 micros
    
    
    BENCHMARK
    Jacobi iteration: 0 micros
    Residual calculation: 5.33333 micros
    Restriction: fw: 150.667 micros
    Interpolation + add: 359 micros
    VCycle: 70096.7 micros
    
    
    
    ------------------
    -- size: 1023
    
    BENCHMARK
    Jacobi iteration: 0 micros
    Residual calculation: 0 micros
    Restriction: fw: 7.5 micros
    Interpolation + add: 304.5 micros
    VCycle: 191490 micros
    
    
    BENCHMARK
    Jacobi iteration: 0 micros
    Residual calculation: 7.5 micros
    Restriction: fw: 0 micros
    Interpolation + add: 304.5 micros
    VCycle: 191565 micros
    
    
    BENCHMARK
    Jacobi iteration: 0 micros
    Residual calculation: 8 micros
    Restriction: fw: 0 micros
    Interpolation + add: 304 micros
    VCycle: 191410 micros
    
    
    
    ------------------
    -- size: 2047
    
    *************************

    It did not complete the 2047 test; I tried it twice and both times the display driver reset itself in the middle of the test (it likely ran out of VRAM since it only has 256MB).
     
  4. neliz

    neliz GIGABYTE Man
    Veteran

    Joined:
    Mar 30, 2005
    Messages:
    4,904
    Likes Received:
    23
    Location:
    In the know
    Peter, do you have an older version of the App (the January one perhaps?) that we could test with? don't know if much has changed though but I'm curious why all the Radeon's bug out at 2047

    P.S. now I know why I bought a HD2900, to be a master at theoretical benchmarking! w00t!
     
  5. PeterT

    Regular

    Joined:
    May 14, 2002
    Messages:
    702
    Likes Received:
    14
    Location:
    Austria
    Interesting. It showing signs of instability (as well as your V-cycle numbers) suggests that it's actually doing something, even though you're running Catalyst on Vista. But the other numbers just don't work. Now I'm quite confused.
     
  6. AlexV

    AlexV Heteroscedasticitate
    Moderator Veteran

    Joined:
    Mar 15, 2005
    Messages:
    2,528
    Likes Received:
    107
    But at least it seems to work right for you. And may I add that I hate you for being able to use the 8.38s(Nah, just kidding:) ). I`d like to second what Dave said:it certainly is NOT running at 3D speeds-the fan doesn`t spin-up at all.
     
  7. PeterT

    Regular

    Joined:
    May 14, 2002
    Messages:
    702
    Likes Received:
    14
    Location:
    Austria
    A lot has changed, basically the January version took completely wrong measurements. There have also been lots of optimizations. I'll try to whip together a functionality testing app that will produce (or fail to) a result showing whether the computation actually was carried out.
     
  8. Tim Murray

    Tim Murray the Windom Earle of mobile SOCs
    Veteran

    Joined:
    May 25, 2003
    Messages:
    3,278
    Likes Received:
    66
    Location:
    Mountain View, CA
    So no windowed app will trigger 3D clocks?
     
  9. neliz

    neliz GIGABYTE Man
    Veteran

    Joined:
    Mar 30, 2005
    Messages:
    4,904
    Likes Received:
    23
    Location:
    In the know
    I ran your old fbench too, don't know if it helps anything...

    Code:
    GL filter framework 1.4 test application by Peter Thoman 2004-2005
    
    Gui initialized successfully.
    DevIL initialized successfully.
     - DevIL Version: 167
    OpenGL initialized successfully.
    ILUT OpenGL mode set successfully.
    Loaded required OpenGL extensions for GLPixelShader.
    Loaded required OpenGL extensions for GLRenderTexture.
    Loaded required OpenGL extensions for GLFilterStep.
    Initialization complete.
    
    Press return key to start benchmark...
    
    
    
    Testing 32x32 image:
    Results for BufferCreateINT: msecs: 457 || ms/i: 76.1667 || i/s: 13.1291
    No suitable INT format found. Trying FP... (Flaky 6x00 workaround)
    
    Results for BufferCreateINT16: msecs: 386 || ms/i: 64.3333 || i/s: 15.544
    Results for BufferCreateFP16: msecs: 388 || ms/i: 64.6667 || i/s: 15.4639
    Results for BufferCreateFP32: msecs: 386 || ms/i: 64.3333 || i/s: 15.544
    Results for JustCopy: msecs: 1511 || ms/i: 0.7555 || i/s: 1323.63
    Results for SimpleSmooth: msecs: 1500 || ms/i: 0.75 || i/s: 1333.33
    Results for TexNoise: msecs: 1537 || ms/i: 0.7685 || i/s: 1301.24
    Results for 3x3Conv: msecs: 773 || ms/i: 0.773 || i/s: 1293.66
    Results for TEncode: msecs: 754 || ms/i: 0.754 || i/s: 1326.26
    Results for TDecode: msecs: 774 || ms/i: 0.774 || i/s: 1291.99
    Results for LinDiffINT: msecs: 1508 || ms/i: 0.754 || i/s: 1326.26
    Results for LinDiffINT16: msecs: 1510 || ms/i: 0.755 || i/s: 1324.5
    Results for LinDiffFP16: msecs: 1507 || ms/i: 0.7535 || i/s: 1327.14
    Results for LinDiffFP32: msecs: 1534 || ms/i: 0.767 || i/s: 1303.78
    Results for LD_INT->FP16: msecs: 772 || ms/i: 0.772 || i/s: 1295.34
    Results for LD_INT->FP32: msecs: 747 || ms/i: 0.747 || i/s: 1338.69
    Results for LD_FP16->INT: msecs: 749 || ms/i: 0.749 || i/s: 1335.11
    Results for LD_FP32->INT: msecs: 769 || ms/i: 0.769 || i/s: 1300.39
    Results for PMTEncoded: msecs: 2352 || ms/i: 2.352 || i/s: 425.17
    Results for PMStandard: msecs: 2350 || ms/i: 2.35 || i/s: 425.532
    Results for PMBuffered: msecs: 96 || ms/i: 0.192 || i/s: 5208.33
    
    Testing 64x64 image:
    Results for BufferCreateINT: msecs: 379 || ms/i: 63.1667 || i/s: 15.8311
    Results for BufferCreateINT16: msecs: 379 || ms/i: 63.1667 || i/s: 15.8311
    Results for BufferCreateFP16: msecs: 379 || ms/i: 63.1667 || i/s: 15.8311
    Results for BufferCreateFP32: msecs: 380 || ms/i: 63.3333 || i/s: 15.7895
    Results for JustCopy: msecs: 1487 || ms/i: 0.7435 || i/s: 1344.99
    Results for SimpleSmooth: msecs: 1525 || ms/i: 0.7625 || i/s: 1311.48
    Results for TexNoise: msecs: 1524 || ms/i: 0.762 || i/s: 1312.34
    Results for 3x3Conv: msecs: 768 || ms/i: 0.768 || i/s: 1302.08
    Results for TEncode: msecs: 757 || ms/i: 0.757 || i/s: 1321
    Results for TDecode: msecs: 759 || ms/i: 0.759 || i/s: 1317.52
    Results for LinDiffINT: msecs: 1527 || ms/i: 0.7635 || i/s: 1309.76
    Results for LinDiffINT16: msecs: 1559 || ms/i: 0.7795 || i/s: 1282.87
    Results for LinDiffFP16: msecs: 1511 || ms/i: 0.7555 || i/s: 1323.63
    Results for LinDiffFP32: msecs: 1513 || ms/i: 0.7565 || i/s: 1321.88
    Results for LD_INT->FP16: msecs: 763 || ms/i: 0.763 || i/s: 1310.62
    Results for LD_INT->FP32: msecs: 762 || ms/i: 0.762 || i/s: 1312.34
    Results for LD_FP16->INT: msecs: 747 || ms/i: 0.747 || i/s: 1338.69
    Results for LD_FP32->INT: msecs: 748 || ms/i: 0.748 || i/s: 1336.9
    Results for PMTEncoded: msecs: 2341 || ms/i: 2.341 || i/s: 427.168
    Results for PMStandard: msecs: 2349 || ms/i: 2.349 || i/s: 425.713
    Results for PMBuffered: msecs: 98 || ms/i: 0.196 || i/s: 5102.04
    
    Testing 128x128 image:
    Results for BufferCreateINT: msecs: 384 || ms/i: 64 || i/s: 15.625
    Results for BufferCreateINT16: msecs: 383 || ms/i: 63.8333 || i/s: 15.6658
    Results for BufferCreateFP16: msecs: 381 || ms/i: 63.5 || i/s: 15.748
    Results for BufferCreateFP32: msecs: 380 || ms/i: 63.3333 || i/s: 15.7895
    Results for JustCopy: msecs: 1503 || ms/i: 0.7515 || i/s: 1330.67
    Results for SimpleSmooth: msecs: 1494 || ms/i: 0.747 || i/s: 1338.69
    Results for TexNoise: msecs: 1540 || ms/i: 0.77 || i/s: 1298.7
    Results for 3x3Conv: msecs: 761 || ms/i: 0.761 || i/s: 1314.06
    Results for TEncode: msecs: 759 || ms/i: 0.759 || i/s: 1317.52
    Results for TDecode: msecs: 775 || ms/i: 0.775 || i/s: 1290.32
    Results for LinDiffINT: msecs: 1558 || ms/i: 0.779 || i/s: 1283.7
    Results for LinDiffINT16: msecs: 1523 || ms/i: 0.7615 || i/s: 1313.2
    Results for LinDiffFP16: msecs: 1553 || ms/i: 0.7765 || i/s: 1287.83
    Results for LinDiffFP32: msecs: 1537 || ms/i: 0.7685 || i/s: 1301.24
    Results for LD_INT->FP16: msecs: 773 || ms/i: 0.773 || i/s: 1293.66
    Results for LD_INT->FP32: msecs: 782 || ms/i: 0.782 || i/s: 1278.77
    Results for LD_FP16->INT: msecs: 766 || ms/i: 0.766 || i/s: 1305.48
    Results for LD_FP32->INT: msecs: 760 || ms/i: 0.76 || i/s: 1315.79
    Results for PMTEncoded: msecs: 2374 || ms/i: 2.374 || i/s: 421.23
    Results for PMStandard: msecs: 2348 || ms/i: 2.348 || i/s: 425.894
    Results for PMBuffered: msecs: 97 || ms/i: 0.194 || i/s: 5154.64
    
    Testing 256x256 image:
    Results for BufferCreateINT: msecs: 387 || ms/i: 64.5 || i/s: 15.5039
    Results for BufferCreateINT16: msecs: 383 || ms/i: 63.8333 || i/s: 15.6658
    Results for BufferCreateFP16: msecs: 382 || ms/i: 63.6667 || i/s: 15.7068
    Results for BufferCreateFP32: msecs: 381 || ms/i: 63.5 || i/s: 15.748
    Results for JustCopy: msecs: 1519 || ms/i: 0.7595 || i/s: 1316.66
    Results for SimpleSmooth: msecs: 1487 || ms/i: 0.7435 || i/s: 1344.99
    Results for TexNoise: msecs: 1507 || ms/i: 0.7535 || i/s: 1327.14
    Results for 3x3Conv: msecs: 758 || ms/i: 0.758 || i/s: 1319.26
    Results for TEncode: msecs: 767 || ms/i: 0.767 || i/s: 1303.78
    Results for TDecode: msecs: 769 || ms/i: 0.769 || i/s: 1300.39
    Results for LinDiffINT: msecs: 1539 || ms/i: 0.7695 || i/s: 1299.55
    Results for LinDiffINT16: msecs: 1526 || ms/i: 0.763 || i/s: 1310.62
    Results for LinDiffFP16: msecs: 1526 || ms/i: 0.763 || i/s: 1310.62
    Results for LinDiffFP32: msecs: 1557 || ms/i: 0.7785 || i/s: 1284.52
    Results for LD_INT->FP16: msecs: 762 || ms/i: 0.762 || i/s: 1312.34
    Results for LD_INT->FP32: msecs: 762 || ms/i: 0.762 || i/s: 1312.34
    Results for LD_FP16->INT: msecs: 761 || ms/i: 0.761 || i/s: 1314.06
    Results for LD_FP32->INT: msecs: 758 || ms/i: 0.758 || i/s: 1319.26
    Results for PMTEncoded: msecs: 2368 || ms/i: 2.368 || i/s: 422.297
    Results for PMStandard: msecs: 2407 || ms/i: 2.407 || i/s: 415.455
    Results for PMBuffered: msecs: 118 || ms/i: 0.236 || i/s: 4237.29
    
    Testing 512x512 image:
    Results for BufferCreateINT: msecs: 386 || ms/i: 64.3333 || i/s: 15.544
    Results for BufferCreateINT16: msecs: 385 || ms/i: 64.1667 || i/s: 15.5844
    Results for BufferCreateFP16: msecs: 379 || ms/i: 63.1667 || i/s: 15.8311
    Results for BufferCreateFP32: msecs: 383 || ms/i: 63.8333 || i/s: 15.6658
    Results for JustCopy: msecs: 752 || ms/i: 0.752 || i/s: 1329.79
    Results for SimpleSmooth: msecs: 776 || ms/i: 0.776 || i/s: 1288.66
    Results for TexNoise: msecs: 782 || ms/i: 0.782 || i/s: 1278.77
    Results for 3x3Conv: msecs: 391 || ms/i: 0.782 || i/s: 1278.77
    Results for TEncode: msecs: 391 || ms/i: 0.782 || i/s: 1278.77
    Results for TDecode: msecs: 394 || ms/i: 0.788 || i/s: 1269.04
    Results for LinDiffINT: msecs: 798 || ms/i: 0.798 || i/s: 1253.13
    Results for LinDiffINT16: msecs: 786 || ms/i: 0.786 || i/s: 1272.26
    Results for LinDiffFP16: msecs: 780 || ms/i: 0.78 || i/s: 1282.05
    Results for LinDiffFP32: msecs: 781 || ms/i: 0.781 || i/s: 1280.41
    Results for LD_INT->FP16: msecs: 401 || ms/i: 0.802 || i/s: 1246.88
    Results for LD_INT->FP32: msecs: 393 || ms/i: 0.786 || i/s: 1272.26
    Results for LD_FP16->INT: msecs: 391 || ms/i: 0.782 || i/s: 1278.77
    Results for LD_FP32->INT: msecs: 390 || ms/i: 0.78 || i/s: 1282.05
    Results for PMTEncoded: msecs: 1245 || ms/i: 2.49 || i/s: 401.606
    Results for PMStandard: msecs: 1248 || ms/i: 2.496 || i/s: 400.641
    Results for PMBuffered: msecs: 77 || ms/i: 0.308 || i/s: 3246.75
    
    Testing 1024x1024 image:
    Results for BufferCreateINT: msecs: 386 || ms/i: 64.3333 || i/s: 15.544
    Results for BufferCreateINT16: msecs: 395 || ms/i: 65.8333 || i/s: 15.1899
    Results for BufferCreateFP16: msecs: 389 || ms/i: 64.8333 || i/s: 15.4242
    Results for BufferCreateFP32: msecs: 392 || ms/i: 65.3333 || i/s: 15.3061
    Results for JustCopy: msecs: 765 || ms/i: 0.765 || i/s: 1307.19
    Results for SimpleSmooth: msecs: 768 || ms/i: 0.768 || i/s: 1302.08
    Results for TexNoise: msecs: 777 || ms/i: 0.777 || i/s: 1287
    Results for 3x3Conv: msecs: 496 || ms/i: 0.992 || i/s: 1008.06
    Results for TEncode: msecs: 398 || ms/i: 0.796 || i/s: 1256.28
    Results for TDecode: msecs: 403 || ms/i: 0.806 || i/s: 1240.69
    Results for LinDiffINT: msecs: 798 || ms/i: 0.798 || i/s: 1253.13
    Results for LinDiffINT16: msecs: 784 || ms/i: 0.784 || i/s: 1275.51
    Results for LinDiffFP16: msecs: 785 || ms/i: 0.785 || i/s: 1273.89
    Results for LinDiffFP32: msecs: 877 || ms/i: 0.877 || i/s: 1140.25
    Results for LD_INT->FP16: msecs: 405 || ms/i: 0.81 || i/s: 1234.57
    Results for LD_INT->FP32: msecs: 405 || ms/i: 0.81 || i/s: 1234.57
    Results for LD_FP16->INT: msecs: 397 || ms/i: 0.794 || i/s: 1259.45
    Results for LD_FP32->INT: msecs: 390 || ms/i: 0.78 || i/s: 1282.05
    Results for PMTEncoded: msecs: 1251 || ms/i: 2.502 || i/s: 399.68
    Results for PMStandard: msecs: 1549 || ms/i: 3.098 || i/s: 322.789
    Results for PMBuffered: msecs: 91 || ms/i: 0.364 || i/s: 2747.25
     
  10. neliz

    neliz GIGABYTE Man
    Veteran

    Joined:
    Mar 30, 2005
    Messages:
    4,904
    Likes Received:
    23
    Location:
    In the know
    Uhm.. my fan going *HNGGGGGGGGRRRRRRRRRRRRRRRHHH* suggests otherwise... (no 3D mode though, but there is an increase in fan speed)
     
  11. FrameBuffer

    Banned

    Joined:
    Aug 7, 2005
    Messages:
    499
    Likes Received:
    3
    System Info:

    CPU: AMD Opteron 165 (2x1MB)
    Mem: 2GB PC3200
    OS: Windows Vista
    Driver: 7.4 Vista
    Graphics: ATI Radeon 1950 XT 512MB (@Default)

    GL multigrid benchmark 0.5 by Peter Thoman 2007

    DevIL initialized successfully.
    OpenGL initialized successfully.
    ILUT OpenGL mode set successfully.
    GLEW version 1.3.5 initialized successfully.
    Loaded required OpenGL extensions for GLPixelShader.
    Loaded required OpenGL extensions for GLRenderTexture.
    Loaded required OpenGL extensions for GLFilterStep.
    Initialization complete.



    ------------------
    -- size: 3

    BENCHMARK
    Jacobi iteration: 19.9 micros
    Residual calculation: 32.8 micros
    Restriction: fw: 28.7 micros
    Interpolation + add: 33.9 micros
    VCycle: 549 micros


    BENCHMARK
    Jacobi iteration: 34.2 micros
    Residual calculation: 46.4 micros
    Restriction: fw: 43.3 micros
    Interpolation + add: 40.8 micros
    VCycle: 476 micros


    BENCHMARK
    Jacobi iteration: 22.7 micros
    Residual calculation: 25.6 micros
    Restriction: fw: 24.1 micros
    Interpolation + add: 40.4 micros
    VCycle: 908 micros



    ------------------
    -- size: 7

    BENCHMARK
    Jacobi iteration: 27.1111 micros
    Residual calculation: 44.4444 micros
    Restriction: fw: 22.7778 micros
    Interpolation + add: 22 micros
    VCycle: 823.333 micros


    BENCHMARK
    Jacobi iteration: 22 micros
    Residual calculation: 24.4444 micros
    Restriction: fw: 16.3333 micros
    Interpolation + add: 38.8889 micros
    VCycle: 893.333 micros


    BENCHMARK
    Jacobi iteration: 20.5556 micros
    Residual calculation: 20.4444 micros
    Restriction: fw: 15.6667 micros
    Interpolation + add: 20.1111 micros
    VCycle: 703.333 micros



    ------------------
    -- size: 15

    BENCHMARK
    Jacobi iteration: 20 micros
    Residual calculation: 19.75 micros
    Restriction: fw: 15.125 micros
    Interpolation + add: 19.875 micros
    VCycle: 972.5 micros


    BENCHMARK
    Jacobi iteration: 20 micros
    Residual calculation: 20.5 micros
    Restriction: fw: 15 micros
    Interpolation + add: 20.25 micros
    VCycle: 962.5 micros


    BENCHMARK
    Jacobi iteration: 19.875 micros
    Residual calculation: 21.625 micros
    Restriction: fw: 15.375 micros
    Interpolation + add: 20.5 micros
    VCycle: 977.5 micros



    ------------------
    -- size: 31

    BENCHMARK
    Jacobi iteration: 19.8571 micros
    Residual calculation: 19.7143 micros
    Restriction: fw: 15 micros
    Interpolation + add: 21.8571 micros
    VCycle: 1221.43 micros


    BENCHMARK
    Jacobi iteration: 20 micros
    Residual calculation: 20.2857 micros
    Restriction: fw: 15 micros
    Interpolation + add: 20.1429 micros
    VCycle: 1497.14 micros


    BENCHMARK
    Jacobi iteration: 20 micros
    Residual calculation: 19.8571 micros
    Restriction: fw: 15 micros
    Interpolation + add: 20.1429 micros
    VCycle: 1241.43 micros



    ------------------
    -- size: 63

    BENCHMARK
    Jacobi iteration: 20.6667 micros
    Residual calculation: 21 micros
    Restriction: fw: 15.8333 micros
    Interpolation + add: 20.3333 micros
    VCycle: 1535 micros


    BENCHMARK
    Jacobi iteration: 20.5 micros
    Residual calculation: 20.5 micros
    Restriction: fw: 15.3333 micros
    Interpolation + add: 21 micros
    VCycle: 2961.67 micros


    BENCHMARK
    Jacobi iteration: 22.1667 micros
    Residual calculation: 22.1667 micros
    Restriction: fw: 15.5 micros
    Interpolation + add: 21.1667 micros
    VCycle: 1541.67 micros



    ------------------
    -- size: 127

    BENCHMARK
    Jacobi iteration: 20 micros
    Residual calculation: 20.2 micros
    Restriction: fw: 14.6 micros
    Interpolation + add: 20.4 micros
    VCycle: 1762 micros


    BENCHMARK
    Jacobi iteration: 20.2 micros
    Residual calculation: 19.8 micros
    Restriction: fw: 14.6 micros
    Interpolation + add: 19.8 micros
    VCycle: 1800 micros


    BENCHMARK
    Jacobi iteration: 20.2 micros
    Residual calculation: 20.4 micros
    Restriction: fw: 15.2 micros
    Interpolation + add: 20.2 micros
    VCycle: 1806 micros



    ------------------
    -- size: 255

    BENCHMARK
    Jacobi iteration: 20.25 micros
    Residual calculation: 19.75 micros
    Restriction: fw: 15.5 micros
    Interpolation + add: 19.5 micros
    VCycle: 2030 micros


    BENCHMARK
    Jacobi iteration: 19.75 micros
    Residual calculation: 20 micros
    Restriction: fw: 14.5 micros
    Interpolation + add: 20.5 micros
    VCycle: 2040 micros


    BENCHMARK
    Jacobi iteration: 20.25 micros
    Residual calculation: 20 micros
    Restriction: fw: 15 micros
    Interpolation + add: 19.75 micros
    VCycle: 2067.5 micros



    ------------------
    -- size: 511

    BENCHMARK
    Jacobi iteration: 20.3333 micros
    Residual calculation: 20.6667 micros
    Restriction: fw: 15 micros
    Interpolation + add: 20 micros
    VCycle: 2343.33 micros


    BENCHMARK
    Jacobi iteration: 20.3333 micros
    Residual calculation: 20.6667 micros
    Restriction: fw: 15 micros
    Interpolation + add: 20.3333 micros
    VCycle: 2330 micros


    BENCHMARK
    Jacobi iteration: 20.6667 micros
    Residual calculation: 20 micros
    Restriction: fw: 15.3333 micros
    Interpolation + add: 20 micros
    VCycle: 2330 micros



    ------------------
    -- size: 1023

    BENCHMARK
    Jacobi iteration: 20 micros
    Residual calculation: 22.5 micros
    Restriction: fw: 14.5 micros
    Interpolation + add: 20 micros
    VCycle: 2575 micros


    BENCHMARK
    Jacobi iteration: 20 micros
    Residual calculation: 20 micros
    Restriction: fw: 15 micros
    Interpolation + add: 19.5 micros
    VCycle: 2575 micros


    BENCHMARK
    Jacobi iteration: 20 micros
    Residual calculation: 20.5 micros
    Restriction: fw: 15 micros
    Interpolation + add: 20 micros
    VCycle: 2580 micros



    ------------------
    -- size: 2047

    BENCHMARK
    Jacobi iteration: 40 micros
    Residual calculation: 423 micros
    Restriction: fw: 19 micros
    Interpolation + add: 23 micros
    VCycle: 3890 micros


    BENCHMARK
    Jacobi iteration: 21 micros
    Residual calculation: 20 micros
    Restriction: fw: 15 micros
    Interpolation + add: 20 micros
    VCycle: 3090 micros


    BENCHMARK
    Jacobi iteration: 20 micros
    Residual calculation: 20 micros
    Restriction: fw: 14 micros
    Interpolation + add: 20 micros
    VCycle: 2910 micros
     
  12. neliz

    neliz GIGABYTE Man
    Veteran

    Joined:
    Mar 30, 2005
    Messages:
    4,904
    Likes Received:
    23
    Location:
    In the know
    btw, the error code

    Code:
    Faulting application GPUMGbench.exe, 
    version 0.0.0.0,
     time stamp 0x465c4e20,
     faulting module atioglxx.dll, 
    version 6.14.10.6590,
     time stamp 0x464d02ab, 
    exception code 0xc0000005,
    fault offset 0x0050256c, 
    process id 0x590,
     application start time 0x01c7a231789f2b57.
     
  13. AlexV

    AlexV Heteroscedasticitate
    Moderator Veteran

    Joined:
    Mar 15, 2005
    Messages:
    2,528
    Likes Received:
    107
    You`re also the only one getting results that are in line with what they should be(mostly). And I recall there was talk of a bug in the 8.38 making the fan spin-up even in desktop mode so...;)
     
  14. neliz

    neliz GIGABYTE Man
    Veteran

    Joined:
    Mar 30, 2005
    Messages:
    4,904
    Likes Received:
    23
    Location:
    In the know
    It sure is spinning in DT! making it much less comfortable than my first experience with the card
     
  15. Dave Baumann

    Dave Baumann Gamerscore Wh...
    Moderator Legend

    Joined:
    Jan 29, 2002
    Messages:
    14,079
    Likes Received:
    648
    Location:
    O Canada!
    On the standard desktop Radeons this is the case (including things like SPECviewperf).
     
  16. AlexV

    AlexV Heteroscedasticitate
    Moderator Veteran

    Joined:
    Mar 15, 2005
    Messages:
    2,528
    Likes Received:
    107
    Anybody know what`s the deviceID for an 8800GTX?As in one that would be useable in 3DAnalyze?
     
  17. FrameBuffer

    Banned

    Joined:
    Aug 7, 2005
    Messages:
    499
    Likes Received:
    3
    System Info:

    CPU: AMD Opteron 165 (2x1MB)
    Mem: 2GB PC3200
    OS: Windows XP 32bit
    Driver: 7.4 32bit
    Graphics: ATI Radeon 1950 XT 512MB (@Default)

    GL multigrid benchmark 0.5 by Peter Thoman 2007

    DevIL initialized successfully.
    OpenGL initialized successfully.
    ILUT OpenGL mode set successfully.
    GLEW version 1.3.5 initialized successfully.
    Loaded required OpenGL extensions for GLPixelShader.
    Loaded required OpenGL extensions for GLRenderTexture.
    Loaded required OpenGL extensions for GLFilterStep.
    Initialization complete.



    ------------------
    -- size: 3

    BENCHMARK
    Jacobi iteration: 4.6 micros
    Residual calculation: 6.3 micros
    Restriction: fw: 6.2 micros
    Interpolation + add: 4.7 micros
    VCycle: 672 micros


    BENCHMARK
    Jacobi iteration: 6.2 micros
    Residual calculation: 6.3 micros
    Restriction: fw: 4.6 micros
    Interpolation + add: 6.3 micros
    VCycle: 656 micros


    BENCHMARK
    Jacobi iteration: 6.3 micros
    Residual calculation: 4.7 micros
    Restriction: fw: 6.2 micros
    Interpolation + add: 6.3 micros
    VCycle: 656 micros



    ------------------
    -- size: 7

    BENCHMARK
    Jacobi iteration: 5.22222 micros
    Residual calculation: 7 micros
    Restriction: fw: 5.22222 micros
    Interpolation + add: 6.88889 micros
    VCycle: 1076.67 micros


    BENCHMARK
    Jacobi iteration: 7 micros
    Residual calculation: 5.22222 micros
    Restriction: fw: 5.22222 micros
    Interpolation + add: 5.11111 micros
    VCycle: 1094.44 micros


    BENCHMARK
    Jacobi iteration: 7 micros
    Residual calculation: 5.22222 micros
    Restriction: fw: 5.11111 micros
    Interpolation + add: 7 micros
    VCycle: 1093.33 micros



    ------------------
    -- size: 15

    BENCHMARK
    Jacobi iteration: 5.875 micros
    Residual calculation: 5.875 micros
    Restriction: fw: 5.875 micros
    Interpolation + add: 5.75 micros
    VCycle: 1505 micros


    BENCHMARK
    Jacobi iteration: 5.875 micros
    Residual calculation: 5.875 micros
    Restriction: fw: 5.875 micros
    Interpolation + add: 3.875 micros
    VCycle: 1523.75 micros


    BENCHMARK
    Jacobi iteration: 5.875 micros
    Residual calculation: 5.875 micros
    Restriction: fw: 5.875 micros
    Interpolation + add: 3.875 micros
    VCycle: 1523.75 micros



    ------------------
    -- size: 31

    BENCHMARK
    Jacobi iteration: 6.57143 micros
    Residual calculation: 6.71429 micros
    Restriction: fw: 4.57143 micros
    Interpolation + add: 6.57143 micros
    VCycle: 1942.86 micros


    BENCHMARK
    Jacobi iteration: 6.57143 micros
    Residual calculation: 4.57143 micros
    Restriction: fw: 8.85714 micros
    Interpolation + add: 6.71429 micros
    VCycle: 1941.43 micros


    BENCHMARK
    Jacobi iteration: 6.71429 micros
    Residual calculation: 6.71429 micros
    Restriction: fw: 4.42857 micros
    Interpolation + add: 6.71429 micros
    VCycle: 1941.43 micros



    ------------------
    -- size: 63

    BENCHMARK
    Jacobi iteration: 13.1667 micros
    Residual calculation: 13 micros
    Restriction: fw: 15.5 micros
    Interpolation + add: 10.5 micros
    VCycle: 2448.33 micros


    BENCHMARK
    Jacobi iteration: 7.83333 micros
    Residual calculation: 15.6667 micros
    Restriction: fw: 13 micros
    Interpolation + add: 10.3333 micros
    VCycle: 2448.33 micros


    BENCHMARK
    Jacobi iteration: 20.8333 micros
    Residual calculation: 5.16667 micros
    Restriction: fw: 13 micros
    Interpolation + add: 10.5 micros
    VCycle: 2446.67 micros



    ------------------
    -- size: 127

    BENCHMARK
    Jacobi iteration: 71.8 micros
    Residual calculation: 25 micros
    Restriction: fw: 59.4 micros
    Interpolation + add: 34.4 micros
    VCycle: 3250 micros


    BENCHMARK
    Jacobi iteration: 46.8 micros
    Residual calculation: 68.8 micros
    Restriction: fw: 40.6 micros
    Interpolation + add: 34.4 micros
    VCycle: 3220 micros


    BENCHMARK
    Jacobi iteration: 46.8 micros
    Residual calculation: 109.4 micros
    Restriction: fw: 6.2 micros
    Interpolation + add: 28.2 micros
    VCycle: 3218 micros



    ------------------
    -- size: 255

    BENCHMARK
    Jacobi iteration: 449.25 micros
    Residual calculation: 15.5 micros
    Restriction: fw: 140.75 micros
    Interpolation + add: 234.25 micros
    VCycle: 4532.5 micros


    BENCHMARK
    Jacobi iteration: 86 micros
    Residual calculation: 50.75 micros
    Restriction: fw: 390.5 micros
    Interpolation + add: 4 micros
    VCycle: 5272.5 micros


    BENCHMARK
    Jacobi iteration: 421.75 micros
    Residual calculation: 7.75 micros
    Restriction: fw: 363.25 micros
    Interpolation + add: 15.75 micros
    VCycle: 4805 micros



    ------------------
    -- size: 511

    BENCHMARK
    Jacobi iteration: 1682.33 micros
    Residual calculation: 1093.67 micros
    Restriction: fw: 5.33333 micros
    Interpolation + add: 1260.33 micros
    VCycle: 8123.33 micros


    BENCHMARK
    Jacobi iteration: 2005.33 micros
    Residual calculation: 5.33333 micros
    Restriction: fw: 1380 micros
    Interpolation + add: 343.667 micros
    VCycle: 11250 micros


    BENCHMARK
    Jacobi iteration: 1937.67 micros
    Residual calculation: 5 micros
    Restriction: fw: 1380.33 micros
    Interpolation + add: 390.667 micros
    VCycle: 11510 micros



    ------------------
    -- size: 1023

    BENCHMARK
    Jacobi iteration: 8523.5 micros
    Residual calculation: 3156.5 micros
    Restriction: fw: 2390.5 micros
    Interpolation + add: 2594 micros
    VCycle: 30155 micros


    BENCHMARK
    Jacobi iteration: 8352 micros
    Residual calculation: 3070 micros
    Restriction: fw: 2367.5 micros
    Interpolation + add: 2390.5 micros
    VCycle: 33200 micros


    BENCHMARK
    Jacobi iteration: 8453 micros
    Residual calculation: 3312.5 micros
    Restriction: fw: 2258 micros
    Interpolation + add: 2710.5 micros
    VCycle: 29535 micros



    ------------------
     
  18. BlizzardOne

    Newcomer

    Joined:
    Sep 29, 2006
    Messages:
    81
    Likes Received:
    6
    Location:
    North West UK
    I wish there was a toggle for this in the CCC.
     
  19. Geeforcer

    Geeforcer Harmlessly Evil
    Veteran

    Joined:
    Feb 6, 2002
    Messages:
    2,297
    Likes Received:
    464
    Peter, I see that the program is single-threaded, any way you can make it take advantage of multiple cores?

    Sytem: E6600@3200Mhz, 2GB RAM
    8800 GTX, Forceware 160.03
    WinXP Pro

    Code:
    GL multigrid benchmark 0.5 by Peter Thoman 2007
    DevIL initialized successfully.
    OpenGL initialized successfully.
    ILUT OpenGL mode set successfully.
    GLEW version 1.3.5 initialized successfully.
    Loaded required OpenGL extensions for GLPixelShader.
    Loaded required OpenGL extensions for GLRenderTexture.
    Loaded required OpenGL extensions for GLFilterStep.
    Initialization complete.
     
    ------------------
    -- size: 3
    BENCHMARK
    Jacobi iteration: 3.1 micros
    Residual calculation: 1.6 micros
    Restriction: fw: 1.5 micros
    Interpolation + add: 1.6 micros
    VCycle: 391 micros
     
    BENCHMARK
    Jacobi iteration: 1.5 micros
    Residual calculation: 1.6 micros
    Restriction: fw: 1.5 micros
    Interpolation + add: 1.6 micros
    VCycle: 391 micros
     
    BENCHMARK
    Jacobi iteration: 1.5 micros
    Residual calculation: 1.6 micros
    Restriction: fw: 1.5 micros
    Interpolation + add: 1.6 micros
    VCycle: 391 micros
     
    ------------------
    -- size: 7
    BENCHMARK
    Jacobi iteration: 3.44444 micros
    Residual calculation: 0 micros
    Restriction: fw: 1.77778 micros
    Interpolation + add: 3.44444 micros
    VCycle: 660 micros
     
    BENCHMARK
    Jacobi iteration: 1.66667 micros
    Residual calculation: 1.77778 micros
    Restriction: fw: 1.77778 micros
    Interpolation + add: 1.66667 micros
    VCycle: 660 micros
     
    BENCHMARK
    Jacobi iteration: 1.77778 micros
    Residual calculation: 1.66667 micros
    Restriction: fw: 1.77778 micros
    Interpolation + add: 1.66667 micros
    VCycle: 660 micros
     
    ------------------
    -- size: 15
    BENCHMARK
    Jacobi iteration: 0 micros
    Residual calculation: 1.875 micros
    Restriction: fw: 2 micros
    Interpolation + add: 2 micros
    VCycle: 937.5 micros
     
    BENCHMARK
    Jacobi iteration: 2 micros
    Residual calculation: 1.875 micros
    Restriction: fw: 2 micros
    Interpolation + add: 0 micros
    VCycle: 937.5 micros
     
    BENCHMARK
    Jacobi iteration: 2 micros
    Residual calculation: 1.875 micros
    Restriction: fw: 2 micros
    Interpolation + add: 2 micros
    VCycle: 917.5 micros
     
    ------------------
    -- size: 31
    BENCHMARK
    Jacobi iteration: 2.14286 micros
    Residual calculation: 2.28571 micros
    Restriction: fw: 2.14286 micros
    Interpolation + add: 0 micros
    VCycle: 1205.71 micros
     
    BENCHMARK
    Jacobi iteration: 2.28571 micros
    Residual calculation: 2.28571 micros
    Restriction: fw: 2.14286 micros
    Interpolation + add: 0 micros
    VCycle: 1205.71 micros
     
    BENCHMARK
    Jacobi iteration: 2.28571 micros
    Residual calculation: 2.28571 micros
    Restriction: fw: 2.14286 micros
    Interpolation + add: 0 micros
    VCycle: 1205.71 micros
     
    ------------------
    -- size: 63
    BENCHMARK
    Jacobi iteration: 2.66667 micros
    Residual calculation: 5.16667 micros
    Restriction: fw: 2.66667 micros
    Interpolation + add: 2.5 micros
    VCycle: 1458.33 micros
     
    BENCHMARK
    Jacobi iteration: 2.66667 micros
    Residual calculation: 5.16667 micros
    Restriction: fw: 2.66667 micros
    Interpolation + add: 2.5 micros
    VCycle: 1485 micros
     
    BENCHMARK
    Jacobi iteration: 5.16667 micros
    Residual calculation: 2.66667 micros
    Restriction: fw: 2.66667 micros
    Interpolation + add: 2.5 micros
    VCycle: 1485 micros
     
    ------------------
    -- size: 127
    BENCHMARK
    Jacobi iteration: 12.6 micros
    Residual calculation: 12.4 micros
    Restriction: fw: 9.4 micros
    Interpolation + add: 6.2 micros
    VCycle: 1782 micros
     
    BENCHMARK
    Jacobi iteration: 12.4 micros
    Residual calculation: 12.6 micros
    Restriction: fw: 9.4 micros
    Interpolation + add: 6.2 micros
    VCycle: 1782 micros
     
    BENCHMARK
    Jacobi iteration: 12.4 micros
    Residual calculation: 12.6 micros
    Restriction: fw: 9.4 micros
    Interpolation + add: 6.2 micros
    VCycle: 1782 micros
     
    ------------------
    -- size: 255
    BENCHMARK
    Jacobi iteration: 50.75 micros
    Residual calculation: 47 micros
    Restriction: fw: 50.75 micros
    Interpolation + add: 31.25 micros
    VCycle: 2227.5 micros
     
    BENCHMARK
    Jacobi iteration: 50.75 micros
    Residual calculation: 46.75 micros
    Restriction: fw: 47 micros
    Interpolation + add: 35 micros
    VCycle: 2227.5 micros
     
    BENCHMARK
    Jacobi iteration: 50.75 micros
    Residual calculation: 47 micros
    Restriction: fw: 50.75 micros
    Interpolation + add: 31.25 micros
    VCycle: 2225 micros
     
    ------------------
    -- size: 511
    BENCHMARK
    Jacobi iteration: 192.667 micros
    Residual calculation: 197.667 micros
    Restriction: fw: 187.667 micros
    Interpolation + add: 125 micros
    VCycle: 3853.33 micros
     
    BENCHMARK
    Jacobi iteration: 192.667 micros
    Residual calculation: 192.667 micros
    Restriction: fw: 187.333 micros
    Interpolation + add: 130.333 micros
    VCycle: 3906.67 micros
     
    BENCHMARK
    Jacobi iteration: 192.667 micros
    Residual calculation: 192.667 micros
    Restriction: fw: 187.667 micros
    Interpolation + add: 130 micros
    VCycle: 3856.67 micros
     
    ------------------
    -- size: 1023
    BENCHMARK
    Jacobi iteration: 812.5 micros
    Residual calculation: 789 micros
    Restriction: fw: 851.5 micros
    Interpolation + add: 554.5 micros
    VCycle: 15785 micros
     
    BENCHMARK
    Jacobi iteration: 812.5 micros
    Residual calculation: 789 micros
    Restriction: fw: 859.5 micros
    Interpolation + add: 539 micros
    VCycle: 15860 micros
     
    BENCHMARK
    Jacobi iteration: 804.5 micros
    Residual calculation: 789 micros
    Restriction: fw: 859.5 micros
    Interpolation + add: 547 micros
    VCycle: 15855 micros
     
    ------------------
    -- size: 2047
    BENCHMARK
    Jacobi iteration: 468 micros
    Residual calculation: 2625 micros
    Restriction: fw: 3797 micros
    Interpolation + add: 2938 micros
    VCycle: 94840 micros
     
    BENCHMARK
    Jacobi iteration: 469 micros
    Residual calculation: 2703 micros
    Restriction: fw: 3797 micros
    Interpolation + add: 2890 micros
    VCycle: 94850 micros
     
    BENCHMARK
    Jacobi iteration: 469 micros
    Residual calculation: 2672 micros
    Restriction: fw: 3750 micros
    Interpolation + add: 2891 micros
    VCycle: 95150 micros
     
  20. BlizzardOne

    Newcomer

    Joined:
    Sep 29, 2006
    Messages:
    81
    Likes Received:
    6
    Location:
    North West UK
    C2D X6800 @ Stock (2.93ghz)
    2GB RAM @ DDR800
    Crossfired X1950XTX's @ Stock, Cat 7.4's
    Win XP Pro

    Code:
    GL multigrid benchmark 0.5 by Peter Thoman 2007
    
    DevIL initialized successfully.
    OpenGL initialized successfully.
    ILUT OpenGL mode set successfully.
    GLEW version 1.3.5 initialized successfully.
    Loaded required OpenGL extensions for GLPixelShader.
    Loaded required OpenGL extensions for GLRenderTexture.
    Loaded required OpenGL extensions for GLFilterStep.
    Initialization complete.
    
    
    
    ------------------
    -- size: 3
    
    BENCHMARK
    Jacobi iteration: 3.1 micros
    Residual calculation: 3.2 micros
    Restriction: fw: 3.1 micros
    Interpolation + add: 3.1 micros
    VCycle: 375 micros
    
    
    BENCHMARK
    Jacobi iteration: 3.1 micros
    Residual calculation: 3.1 micros
    Restriction: fw: 3.1 micros
    Interpolation + add: 3.2 micros
    VCycle: 375 micros
    
    
    BENCHMARK
    Jacobi iteration: 3.1 micros
    Residual calculation: 3.2 micros
    Restriction: fw: 3.1 micros
    Interpolation + add: 3.1 micros
    VCycle: 375 micros
    
    
    
    ------------------
    -- size: 7
    
    BENCHMARK
    Jacobi iteration: 3.55556 micros
    Residual calculation: 1.66667 micros
    Restriction: fw: 3.44444 micros
    Interpolation + add: 3.55556 micros
    VCycle: 607.778 micros
    
    
    BENCHMARK
    Jacobi iteration: 3.44444 micros
    Residual calculation: 3.55556 micros
    Restriction: fw: 3.44444 micros
    Interpolation + add: 3.44444 micros
    VCycle: 607.778 micros
    
    
    BENCHMARK
    Jacobi iteration: 3.55556 micros
    Residual calculation: 3.44444 micros
    Restriction: fw: 1.77778 micros
    Interpolation + add: 3.44444 micros
    VCycle: 607.778 micros
    
    
    
    ------------------
    -- size: 15
    
    BENCHMARK
    Jacobi iteration: 1.875 micros
    Residual calculation: 4 micros
    Restriction: fw: 3.875 micros
    Interpolation + add: 3.875 micros
    VCycle: 840 micros
    
    
    BENCHMARK
    Jacobi iteration: 3.875 micros
    Residual calculation: 1.875 micros
    Restriction: fw: 5.875 micros
    Interpolation + add: 2 micros
    VCycle: 858.75 micros
    
    
    BENCHMARK
    Jacobi iteration: 4 micros
    Residual calculation: 3.875 micros
    Restriction: fw: 3.875 micros
    Interpolation + add: 2 micros
    VCycle: 858.75 micros
    
    
    
    ------------------
    -- size: 31
    
    BENCHMARK
    Jacobi iteration: 4.42857 micros
    Residual calculation: 2.28571 micros
    Restriction: fw: 6.71429 micros
    Interpolation + add: 6.71429 micros
    VCycle: 1115.71 micros
    
    
    BENCHMARK
    Jacobi iteration: 4.42857 micros
    Residual calculation: 2.28571 micros
    Restriction: fw: 6.71429 micros
    Interpolation + add: 6.71429 micros
    VCycle: 1115.71 micros
    
    
    BENCHMARK
    Jacobi iteration: 4.42857 micros
    Residual calculation: 2.28571 micros
    Restriction: fw: 6.71429 micros
    Interpolation + add: 6.71429 micros
    VCycle: 1115.71 micros
    
    
    
    ------------------
    -- size: 63
    
    BENCHMARK
    Jacobi iteration: 15.6667 micros
    Residual calculation: 13 micros
    Restriction: fw: 15.6667 micros
    Interpolation + add: 7.66667 micros
    VCycle: 1406.67 micros
    
    
    BENCHMARK
    Jacobi iteration: 13 micros
    Residual calculation: 15.6667 micros
    Restriction: fw: 13 micros
    Interpolation + add: 10.5 micros
    VCycle: 1406.67 micros
    
    
    BENCHMARK
    Jacobi iteration: 13 micros
    Residual calculation: 13 micros
    Restriction: fw: 15.6667 micros
    Interpolation + add: 10.3333 micros
    VCycle: 1406.67 micros
    
    
    
    ------------------
    -- size: 127
    
    BENCHMARK
    Jacobi iteration: 46.8 micros
    Residual calculation: 50 micros
    Restriction: fw: 59.4 micros
    Interpolation + add: 34.4 micros
    VCycle: 1970 micros
    
    
    BENCHMARK
    Jacobi iteration: 46.8 micros
    Residual calculation: 50 micros
    Restriction: fw: 59.4 micros
    Interpolation + add: 34.4 micros
    VCycle: 2000 micros
    
    
    BENCHMARK
    Jacobi iteration: 46.8 micros
    Residual calculation: 50 micros
    Restriction: fw: 59.4 micros
    Interpolation + add: 34.4 micros
    VCycle: 2000 micros
    
    
    
    ------------------
    -- size: 255
    
    BENCHMARK
    Jacobi iteration: 164 micros
    Residual calculation: 195.5 micros
    Restriction: fw: 234.25 micros
    Interpolation + add: 152.25 micros
    VCycle: 4650 micros
    
    
    BENCHMARK
    Jacobi iteration: 160.25 micros
    Residual calculation: 199.25 micros
    Restriction: fw: 234.25 micros
    Interpolation + add: 152.25 micros
    VCycle: 4610 micros
    
    
    BENCHMARK
    Jacobi iteration: 160.25 micros
    Residual calculation: 199.25 micros
    Restriction: fw: 234.25 micros
    Interpolation + add: 152.5 micros
    VCycle: 4647.5 micros
    
    
    
    ------------------
    -- size: 511
    
    BENCHMARK
    Jacobi iteration: 505.333 micros
    Residual calculation: 770.667 micros
    Restriction: fw: 942.667 micros
    Interpolation + add: 739.667 micros
    VCycle: 18750 micros
    
    
    BENCHMARK
    Jacobi iteration: 505.333 micros
    Residual calculation: 781.333 micros
    Restriction: fw: 932.333 micros
    Interpolation + add: 729 micros
    VCycle: 18853.3 micros
    
    
    BENCHMARK
    Jacobi iteration: 500 micros
    Residual calculation: 786.333 micros
    Restriction: fw: 927 micros
    Interpolation + add: 750 micros
    VCycle: 18803.3 micros
    
    
    
    ------------------
    -- size: 1023
    
    BENCHMARK
    Jacobi iteration: 609 micros
    Residual calculation: 1969 micros
    Restriction: fw: 3765.5 micros
    Interpolation + add: 3367.5 micros
    VCycle: 98200 micros
    
    
    BENCHMARK
    Jacobi iteration: 601.5 micros
    Residual calculation: 2055 micros
    Restriction: fw: 3679.5 micros
    Interpolation + add: 3289 micros
    VCycle: 98985 micros
    
    
    BENCHMARK
    Jacobi iteration: 554.5 micros
    Residual calculation: 2023.5 micros
    Restriction: fw: 3758 micros
    Interpolation + add: 3367 micros
    VCycle: 98205 micros
    
    
    
    ------------------
    -- size: 2047
     
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...