Inquirer spreading R420 info

Discussion in 'Architecture and Products' started by 991060, Apr 16, 2004.

  1. {Sniping}Waste

    Regular

    Joined:
    Jan 13, 2003
    Messages:
    833
    Likes Received:
    29
    Location:
    Garland TX
    Do yall know who Sireric is? :shock:
    He of all ppl here should know the R3XX cores and what it can do.
     
  2. T2k

    T2k
    Veteran

    Joined:
    Jun 12, 2002
    Messages:
    2,004
    Likes Received:
    0
    Location:
    The Slope & TriBeCa (NYC)
    No, I don't know. :roll:


    PS: Hint... have you noticed this smiley :lol: in my post? It's called 'lol'...
     
  3. {Sniping}Waste

    Regular

    Joined:
    Jan 13, 2003
    Messages:
    833
    Likes Received:
    29
    Location:
    Garland TX
    Sorry T2K. :(
    It must be insalting for sireric to read Chalnoth post.
     
  4. KimB

    Legend

    Joined:
    May 28, 2002
    Messages:
    12,928
    Likes Received:
    230
    Location:
    Seattle, WA
    I read the quote above as, "if Microsoft would let us, we'd support 32 temporary registers."

    The caps bits have been available for supporting 32 temporary registers the entire time DX9 has been released.
     
  5. hoom

    Veteran

    Joined:
    Sep 23, 2003
    Messages:
    3,264
    Likes Received:
    813
    Umm hows about ps2.0b for finally getting round to enabling 32 temps Chalnoth?

    OpenGL guy, can you give us some indication of whether or not ps2.0b is actually an ATI profile or whether it was another company that had it prepared for them (SGI/PowerVI/NV???)
     
  6. OpenGL guy

    Veteran

    Joined:
    Feb 6, 2002
    Messages:
    2,357
    Likes Received:
    28
    It says "we will raise that". Does it say "we will raise that to 32"? No. Does it give a timeframe? No. Get over it. You are the one in error.
    As I stated, we could expose 16, but it's pointless.

    Do you have some sort of need to always get the last word in even when you are wrong? I showed that you were wrong earlier. You're still wrong now. This is getting boring.
     
  7. Althornin

    Althornin Senior Lurker
    Veteran

    Joined:
    Feb 6, 2002
    Messages:
    1,326
    Likes Received:
    5
    huh, i guess the part that says "recommendation" has you confused.
    I also wonder why you think its 32 registers, and OpenGL guy says 16 for DX9 SM2.0
    I also have to comment on your comment about "treating developers as stupid".
    Make up your mind. When ATI treats em as smart, you bitch and say that they shouldnt leave them room to screw up. When ATi does that, you bitch. Which is it?
     
  8. DemoCoder

    Veteran

    Joined:
    Feb 9, 2002
    Messages:
    4,733
    Likes Received:
    81
    Location:
    California
    I'm curious if there is a performance drop if more than 12-16 are used. Curious that 12 was settled on as the API limit.
     
  9. sireric

    Regular

    Joined:
    Jul 26, 2002
    Messages:
    348
    Likes Received:
    22
    Location:
    Santa Clara, CA
    Not quite.

    1) Regret to inform you that R3xx still has 32 temps registers, as its always had.

    2) The GPRs exposed has stayed the same, mainly due to the fact that there's no need to expose more. It's not just simply changing a dword; there's a whole support aspect and then there's the optimizer to change as well as other parts of the driver that I won't get into. I did ask the SW guys, and they said they saw no need to increase it at this time.

    So, I guess that my full quote was not quite on the ball, but it's mainly true. I should of added to the second comment that we would expose it as the caps are available and as we feel the need for it.
     
  10. DemoCoder

    Veteran

    Joined:
    Feb 9, 2002
    Messages:
    4,733
    Likes Received:
    81
    Location:
    California
    Usually in any basic code block, there are few live registers anyway. AMD put out a paper awhile back where they analyzed thousands of lines of code in a C library, and the average number of live variables per function was 6.5. There is reason to believe this will be lower for GPUs, but also reasons to believe it could be higher. GPUs generally accumulate results in shaders and don't deal with datastructures. On the other hand, GPUs don't have memory to spill to, so may use lots of registers as a sort of "poor man's heap" or as a way of caching some results. I personally think the figure for most shaders is probably 6 or lower.

    If someone's using 16 or 32 registers (that are all live), I think they're probably using them to store a big datastructure.
     
  11. Deathlike2

    Regular

    Joined:
    Aug 17, 2003
    Messages:
    542
    Likes Received:
    5
    Note sig.

    It seems some people are too picky, without understanding responses provided.
     
  12. Hellbinder

    Banned

    Joined:
    Feb 8, 2002
    Messages:
    1,444
    Likes Received:
    12
    Last time i Checked sireric was a Hardware Engineer who thats Right actually Works for ATi. Not only that.. He actually Works on and aids in the designs etc of actual ATi hardware...

    Now imagine that. He might actually know more about R300 hardware than someone who is NOT a Hardware Engineer who whorks for ATi.

    ... I...... am actually..... having trouble with this... I cant Quite grasp it...

    I need an asprin....
     
  13. gandalfthewhite

    Newcomer

    Joined:
    Feb 21, 2004
    Messages:
    43
    Likes Received:
    0
    wow sir eric and Open GL guy i cant belive your even paying attention to chanloths replies damn

    alright for those who dont know i dont know why you dont know but whatever Sire Eric and Open GL guy WORK for ATI if their is any question about the architecture THEIR the ones that you will get informed answers from same with drivers

    edit: damn u hellbinder ya beat me to it
     
  14. nelg

    Veteran

    Joined:
    Jan 26, 2003
    Messages:
    1,557
    Likes Received:
    42
    Location:
    Toronto
    Would it be safe to assume that ATI’s decision to have 32 temp registers was made before the PS2.0 spec was finalized. If so, what were the considerations that led ATI to design the R300 with 3 times the amount of registers than what was eventually exposed what led to the original decision to have 32 as opposed to 12?
     
  15. OpenGL guy

    Veteran

    Joined:
    Feb 6, 2002
    Messages:
    2,357
    Likes Received:
    28
    I'm not sure why 12 was settled upon when we can easily do 16, but I guess Microsoft did their own research on PS 2.0 shaders and concluded that 12 was enough. I mean, I suppose it could make things easier for other vendors *shrug*

    Under OpenGL, there's an opportunity to use more temps because the driver gets to attempt compilation and optimization before telling the app the shader is not suitable. If the driver can rearrange things to fit into the driver, then it will do so. Under D3D we don't have this option so we have to be more conservative, although 16 would still be fine.
     
  16. Geo

    Geo Mostly Harmless
    Legend

    Joined:
    Apr 22, 2002
    Messages:
    9,116
    Likes Received:
    215
    Location:
    Uffda-land
    What a world to live in where a beautiful theory is never slain by a cold hard fact. Bumble-bees walk, curve balls are optical illusions, and R300 has only 12 temps.
     
  17. Ostsol

    Veteran

    Joined:
    Nov 19, 2002
    Messages:
    1,765
    Likes Received:
    0
    Location:
    Edmonton, Alberta, Canada
    Damn. I'm late to the party. :( Anyway, I'll just chip in that ATI has exposed all 32 fragment/pixel shader registers. It's just that this is only under OpenGL. I just checked, too. . . :) There's certainly no reason why they would be unable to do the same for DirectX. The hardware supports it, so it's obviously some other reasons.

    Anyway, who needs all 32 registers? Heck, NVidia has proven that you can get by quite well on just a handful.
     
  18. Rolf N

    Rolf N Recurring Membmare
    Veteran

    Joined:
    Aug 18, 2003
    Messages:
    2,494
    Likes Received:
    55
    Location:
    yes
    Just out of curiosity ...
    Maybe they thought IHVs would do 16 registers, but might have a use of a few registers as scratch for macro expansion. Ie set aside four registers that aren't visible to the runtime, but only for the driver's internal use?

    Where would the other sixteen go? Reserved for texture coord registers, or sampler results (yes, I'm confused about PS2)?
     
  19. gunblade

    Newcomer

    Joined:
    Nov 16, 2002
    Messages:
    92
    Likes Received:
    2
    Wow, can't believe I actually read and manage to read that :oops:
    well, seldom see post like that here.
     
  20. KimB

    Legend

    Joined:
    May 28, 2002
    Messages:
    12,928
    Likes Received:
    230
    Location:
    Seattle, WA
    Here:
    When did I say that?

    ATI shouldn't leave room for end users to screw up (to an extent...FSAA/AF settings are a good thing).
     
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...