Cell Sparse Matrix Vector

Discussion in 'CellPerformance@B3D' started by Jawed, Nov 13, 2007.

  1. Jawed

    Legend

    Joined:
    Oct 2, 2004
    Messages:
    10,873
    Likes Received:
    767
    Location:
    London
  2. "Nerve-Damage"

    Regular

    Joined:
    Nov 24, 2005
    Messages:
    809
    Likes Received:
    14
    Thank's Jawed.
     
  3. Panajev2001a

    Veteran

    Joined:
    Mar 31, 2002
    Messages:
    3,187
    Likes Received:
    8
  4. popper

    Newcomer

    Joined:
    Jul 22, 2006
    Messages:
    69
    Likes Received:
    3
    given i just skiped the PDF so i might be wrong but they seem to have totally ignored the onchip altivec unit of the cell?...

    if so it seems a massive waste not to even try and include vectorised Altivec code in any optimisations they performed for max performance for the tests at hand.

    the freeVec lib for instance shows proven altivec abilitys when used in PPC linux problem solving With lots of profiling and testing etc.

    http://www.powerdeveloper.org/forums/viewforum.php?f=23
    you guys might even want to give markos a hand in this new thread too.
    http://www.powerdeveloper.org/forums/viewtopic.php?t=1410
     
    #4 popper, Dec 1, 2007
    Last edited by a moderator: Dec 1, 2007
  5. Vitaly Vidmirov

    Newcomer

    Joined:
    Jul 9, 2007
    Messages:
    108
    Likes Received:
    10
    Location:
    Russia
    ...of transistors

    SPEs for computation, PPE for service. That is the whole point of CELL.
     
  6. popper

    Newcomer

    Joined:
    Jul 22, 2006
    Messages:
    69
    Likes Received:
    3
    i wouldnt say that, after all, the PPE is really a PPC at heart and the Altivec even today, clock for clock still outdoes SSE vectorised code.

    its a shame as Altivecs already on there and fully functional, and other than the few guys over on powerDev etc no one new seems interested in vectorising key PPC Linux for a free boost in performance all this time, never mind giving to the time to get good current code pushed upstream, or even into your everyday apps.

    its strange, if you are to beleave the new PS3 coders are really interested in makeing their code, or the existing PPC base code that they are using on the PS3 today,as effective and fast as possible for free with existing and new Altivec replacement code etc.

    perhaps theres just not as many interested OSS PPC/PS3 coders as we are led to beleave comeing online, at least i dont see much of anyone new offering to try and really help the PPC/PS3 linux efforts, arr well...

    perhaps their all SSE coders and dont know how good and effective Altivec can be ,even for your use/assumption that the PPE is only for service, Altivec can help offload the CPU core servicing of the SPEs?...
     
    #6 popper, Dec 1, 2007
    Last edited by a moderator: Dec 1, 2007
  7. Panajev2001a

    Veteran

    Joined:
    Mar 31, 2002
    Messages:
    3,187
    Likes Received:
    8
    I welcome projects such as libfreevec and all other VMX optimized projects: the problem (or so it seems, your mileage might vary) is that mixing regular FXU/FPU work and VMX (Integer and/or FP) work seems to be a bad performance combination as you cannot exchange data across FP and VMX registers (for example) directly and you have to go higher up in the memory hierarchy.
    If you were already using VMX quite extensively you might want to give it as much work as possible, but going back and forth between FXU/FPU and VMX might lower performance rather than increase it so you have to be careful with that.
     
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...