Sisoft GPGPU benches

Discussion in 'GPGPU Technology & Programming' started by Florin, Nov 28, 2009.

  1. Andrew Lauritzen

    Andrew Lauritzen Moderator
    Moderator Veteran

    Joined:
    May 21, 2004
    Messages:
    2,629
    Likes Received:
    1,227
    Location:
    British Columbia, Canada
    Not sure this is true... this definitely all goes through the WDDM in DC, and perhaps in CUDA too (depending on whether they tunnel it through there or try to abstract their own little space). Typically the WDDM/OS owns all GPU memory and command buffer submission on Windows though, so normally anything that goes to the GPU goes through there in some manner.

    Now obviously an IHV can go around that (rather than through it) to some extent with things like CUDA, but potentially with some pitfalls.
     
  2. Silent_Buddha

    Legend

    Joined:
    Mar 13, 2007
    Messages:
    19,418
    Likes Received:
    10,311
    How much direct access does MS allow to hardware? Wasn't WDDM a move towards removing direct access in order to reduce the chances of a buggy driver blue-screening the OS?

    Or a problem piece of hardware doing the same?

    Regards,
    SB
     
  3. OpenGL guy

    Veteran

    Joined:
    Feb 6, 2002
    Messages:
    2,357
    Likes Received:
    28
    Since DirectCompute runs within DirectX (and even uses the D3D driver), buffer sharing is trivial. Sharing buffers between other APIs (such as between OpenCL and OpenGL) may be more problematic.
     
  4. Tim Murray

    Tim Murray the Windom Earle of mobile SOCs
    Veteran

    Joined:
    May 25, 2003
    Messages:
    3,278
    Likes Received:
    66
    Location:
    Mountain View, CA
    Let me rephrase--the additional overhead imposed by WDDM versus other driver models (e.g., the cost of accessing hardware in Linux) should cause no meaningful performance penalties in the basic streaming case. What I was wondering in particular is if DC could be faster because of some hooks into WDDM, for instance (to avoid some additional user/kernel transitions), but I don't know enough about DC to be able to say either way.
     
  5. CarstenS

    Legend Subscriber

    Joined:
    May 31, 2002
    Messages:
    5,800
    Likes Received:
    3,920
    Location:
    Germany
    Do CS kernels count as being evil in terms of the concurrent kernel execution stuff or do they behave nicely in NV-GPUs and do not block the chip for their duration?
     
  6. Andrew Lauritzen

    Andrew Lauritzen Moderator
    Moderator Veteran

    Joined:
    May 21, 2004
    Messages:
    2,629
    Likes Received:
    1,227
    Location:
    British Columbia, Canada
    Agreed, although it all depends on how - for instance - the CUDA and OpenCL runtimes are implemented with respect to WDDM. There's always a possibility of some inefficiencies there, although again I'd assume that with simple streaming kernels we're going to be far from CPU-limited.

    I'm assuming this is a limitation of current NVIDIA hardware (specifically since it is called out as a benefit of Fermi) rather than anything in software. Thus I expect DC on current NVIDIA hardware to behave like the other APIs.
     
  7. neliz

    neliz GIGABYTE Man
    Veteran

    Joined:
    Mar 30, 2005
    Messages:
    4,904
    Likes Received:
    23
    Location:
    In the know
  8. Tim Murray

    Tim Murray the Windom Earle of mobile SOCs
    Veteran

    Joined:
    May 25, 2003
    Messages:
    3,278
    Likes Received:
    66
    Location:
    Mountain View, CA
    Their PCIe numbers are totally crazy; getting 5+GB/s on NV hardware (and AMD hardware, for what it's worth) is trivial. Maybe they're using paged memory?

    edit: yeah, I get 5.8/5.2 GB/s on my i7 system for pinned memory and 3GB/s bidirectional for paged, but PCIe transfers from paged memory are more of an issue of CPU memcpy bandwidth than PCIe performance (which is why they're so radically higher on Nehalem than previous Intel CPUs).
     
  9. Forrest

    Newcomer

    Joined:
    Jul 22, 2008
    Messages:
    39
    Likes Received:
    0
    Totally crazy PCIe numbers? sure! :lol:

    I get 0.4 GB/s in OpenCL on 4850.

    And this one :

    [​IMG]

    1/8th memory transfer performance. :???:
     
  10. neliz

    neliz GIGABYTE Man
    Veteran

    Joined:
    Mar 30, 2005
    Messages:
    4,904
    Likes Received:
    23
    Location:
    In the know
    I didn't even take the time to check the graphs properly.. but what about the CUDA/OCL variance between two essentially the same platforms, the 9400M and ION?
     
  11. Florin

    Florin Merrily dodgy
    Veteran Subscriber

    Joined:
    Aug 27, 2003
    Messages:
    1,707
    Likes Received:
    345
    Location:
    The colonies
    Removing and reinstalling the Nvidia driver after installing Sandra fixed this. Not sure what's going on but fwiw.
     
  12. Arnold Beckenbauer

    Veteran Subscriber

    Joined:
    Oct 11, 2006
    Messages:
    1,756
    Likes Received:
    722
    Location:
    Germany
    There is a new version of SiSoft Sandra 2010: 1611.
    How can I get OpenCL working with my HD4850? OpenCL&CPU work.
     
  13. CarstenS

    Legend Subscriber

    Joined:
    May 31, 2002
    Messages:
    5,800
    Likes Received:
    3,920
    Location:
    Germany
    I don't get it to work with HD 5870 either. Freshly installed Windows 7, Cat 9.11 WHQL, Sandra 1611, Stream SDK 2.0 b4 - Stream and Direct Compute work, but I don't get anything Open CL.

    Any hints?
     
  14. neliz

    neliz GIGABYTE Man
    Veteran

    Joined:
    Mar 30, 2005
    Messages:
    4,904
    Likes Received:
    23
    Location:
    In the know
    Wait for tomorrow's Cats and pray Terry did his job?
     
  15. CarstenS

    Legend Subscriber

    Joined:
    May 31, 2002
    Messages:
    5,800
    Likes Received:
    3,920
    Location:
    Germany
    Hopefully - thx for the exceptionally helpful hint btw ;-)

    edit:
    Never mind - seems like a royal FU with Sandra 2010-1611 or with me. Everything else OpenCL works.
     
    #35 CarstenS, Dec 16, 2009
    Last edited by a moderator: Dec 16, 2009
  16. neliz

    neliz GIGABYTE Man
    Veteran

    Joined:
    Mar 30, 2005
    Messages:
    4,904
    Likes Received:
    23
    Location:
    In the know
    A hint would be something about a GTS330, GTS350, GTX360/380 and HD5830 and 5890.. I'm not giving those
     
  17. CarstenS

    Legend Subscriber

    Joined:
    May 31, 2002
    Messages:
    5,800
    Likes Received:
    3,920
    Location:
    Germany
    I meant that rather sarcastic, given that everyone in the world and their dogs follow Terry on Twitter. ;)
     
  18. neliz

    neliz GIGABYTE Man
    Veteran

    Joined:
    Mar 30, 2005
    Messages:
    4,904
    Likes Received:
    23
    Location:
    In the know
    He never said it would support OpenGL 3.2 though..;) (but that's not the subject at hand)
     
  19. Arnold Beckenbauer

    Veteran Subscriber

    Joined:
    Oct 11, 2006
    Messages:
    1,756
    Likes Received:
    722
    Location:
    Germany
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...