Introduction to SGX

Discussion in 'Mobile Graphics Architectures and IP' started by Lazy8s, May 10, 2009.

  1. mczak

    Veteran

    Joined:
    Oct 24, 2002
    Messages:
    3,022
    Likes Received:
    122
    This is however independent of the firmware, the driver may or may not use software vertex shading regardless (and hardware vertex shading may not always be a win, well it would if the cpu paired with wouldn't have a FPU...).
     
  2. Loewe

    Newcomer

    Joined:
    Feb 7, 2002
    Messages:
    41
    Likes Received:
    0
    Location:
    Germany
    I have not sayed that the intel driver do not use TBDR, I have sayed:
    I can't give you a good translation, but let us say it should be read as "it seems that they not using it correct".
    I know that there is a better driver today, if only for Linux, I don't know. The newer driver is better, let us say it have the niveau from the PowerVR driver, but realy good is that not!
    I am sure, the SGX core can do more in this old games.
    Give me the new driver and we will see. :)
    Ok, it is clear that hardware T&L must not be faster.
    The shader unit from this SGX core is rather smal and so it is important to use it clever. The guys at imgtec have a long tradition in design T&L units and I am sure that they know what they do. AFAIK they say that the SGX530 is well ballanced and the SGX535 has a second texture unit as the importent difference. So I don't think that the shader unit is cabable to do the T&L.
    It's a pity that you can't help me here. I am a layman regarding computer grafic, but I try to learn where I can.

    Let us hope that we will see anitime a good working driver for the GMA500.
     
  3. mczak

    Veteran

    Joined:
    Oct 24, 2002
    Messages:
    3,022
    Likes Received:
    122
    Don't forget SGX may also be coupled with cpus which have very slow (or no) fpu at all, so being able to do hw tnl on the gpu is a must for the chip. Should also use less power even in case your cpu could potentially do it faster.

    Oh, I wasn't referring to any bogus terms per se, just that some details don't really seem to fit the chip very well. But even powervr employees here don't comment on such things...
     
  4. Loewe

    Newcomer

    Joined:
    Feb 7, 2002
    Messages:
    41
    Likes Received:
    0
    Location:
    Germany
    However, the SGX must do the T&L. If you use the PowerVR driver it can do it fast, see the results from Q3A, SeSa and OGLVillageMark. AFAIK is there no special T&L detection in the PowerVR OGL driver and this driver run significantly faster.
    The D3D driver from intel use the same codebase as the D3D driver from PowerVR and this codebase is not be great. On the other site is Vista sending everything down to the SGX as full f32 data and so the fillrate goes down to one quarter!
    Let me say it again, with an good driver this core should be outgunning KYROII in most cases.
    I know that this is not possible in newer games or in the new 3DMark tests, the shader load is to big for this little core.

    I am not to optimistic, but let us hope that Intel will give us a good, must not be great (but why not?), working driver, also for D3D and OpenGL!
    I don't know what you refer to. All details regarding the 3D core in my last article are from the Intel SCH manual.
     
  5. Arun

    Arun Unknown.
    Legend

    Joined:
    Aug 28, 2002
    Messages:
    5,023
    Likes Received:
    302
    Location:
    UK
    This reminds me - SGX is capable of 1xFP32 or 2xFP16 or 4xINT8. This makes perfect sense for OGL ES 2.0 given its specs (which themselves were surely influenced by IMG) but does anyone know if that last one is actually enough for DX8 PS1.1? All DX8 GPUs ever released had at least 9-bit ALUs iirc (that was NV, ATI/Matrox had more and Rampage would have had more too, no idea about VIA/SiS). And I think PS1.4 requires more mantissa bits than FP16 has at least (although iirc that didn't stop NVIDIA in the NV3x era).

    Regarding using the CPU for T&L: remember Atom comes in different variants. What may not be a bottleneck in 1.6GHz netbooks might be one in 800MHz MIDs (iirc some of those at least have SGX clocked similarly)... So nothing is quite that simple.
     
  6. Ailuros

    Ailuros Epsilon plus three
    Legend Subscriber

    Joined:
    Feb 7, 2002
    Messages:
    9,511
    Likes Received:
    224
    Location:
    Chania
    Who cares anyway? Would Intel accept a fully working new D3D driver from IMG? I severely doubt it. It's too damn convenient to mark it as inefficient and at the same time overlook your own inefficiencies in your own neck of woods ;)
     
  7. Loewe

    Newcomer

    Joined:
    Feb 7, 2002
    Messages:
    41
    Likes Received:
    0
    Location:
    Germany
    I have never said that it will be simple!
    That what they doing with the actuell driver is simple! Take ever the biggest format and you can't do wrong!
    You can do this, if you have millions over millions of transistors and it is not important how much power you need. But if you must calculate every transitor and efficiency is the only thing thats helps, than you need not only an efficiency core, you need also an efficiency driver. :)
     
  8. JohnH

    Regular

    Joined:
    Mar 18, 2002
    Messages:
    595
    Likes Received:
    18
    Location:
    UK
    SGX also supports 4xFixed point 10 bit as per OGLES2.0 LOWP so if the OS chose to expose the original shaders to the driver it would have the opportunity use it.

    Note that the numbers you quote above are for a single pipeline, and FP32 is actually up to 2x ops per clock per pipe.

    The reality is even more complicated than this, use of the CPU for TnL duty results in its caches be blasted by geometry data on every frame, so although it may appear faster in simplistic cases it can fall down in real applications. Further to this, I think there is something else going on in the D3D case, as is clearly evidenced by the much better OpenGL performance (with IMG drivers).

    Cheers,
    John.
     
  9. darkblu

    Veteran

    Joined:
    Feb 7, 2002
    Messages:
    2,642
    Likes Received:
    22
    ps1.4 is fx12, so yes, 11 bits of mantissa would not cover it.
     
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...