[B3D Analysis] R600 has been unleashed upon an unsuspecting enthusiast community

Discussion in '3D Hardware, Software & Output Devices' started by Farid, May 14, 2007.

  1. neliz

    neliz GIGABYTE Man
    Veteran

    Joined:
    Mar 30, 2005
    Messages:
    4,904
    Likes Received:
    23
    Location:
    In the know
    That's a pretty old chip you've got there Rys.

    07 - 04 .. it even has all the info on top of the die instead of the shim. :)

    but it sure is an enjoyable read!
     
  2. Geeforcer

    Geeforcer Harmlessly Evil
    Veteran

    Joined:
    Feb 6, 2002
    Messages:
    2,320
    Likes Received:
    525
    I am struggling to understand how you arrived at that conclusion based on anything that was published today so far.

    R600 has a mixed record againt GTX is mixed, but what really would make it shine is 40 less ALUs, 3/4ths narrower bus, half the TMUs and 3/4ths less ROPs?
     
    #22 Geeforcer, May 14, 2007
    Last edited by a moderator: May 14, 2007
  3. Reverend

    Banned

    Joined:
    Jan 31, 2002
    Messages:
    3,266
    Likes Received:
    24
    Sorry, don't understand that but it may be just me. Hope you can expand this properly.
     
  4. psurge

    Regular

    Joined:
    Feb 6, 2002
    Messages:
    955
    Likes Received:
    52
    Location:
    LA, California
    Well, I don't see how there's a routing hit vs. binding a texture unit to a single 16x SIMD array (imagine the 80 ALUs laid out vertically, the buses to texture units laid out horizontally). Whether or not there is a caching hit depends on how quads are assigned to threads, but regardless I suspect that Enos is right - the way things are arranged it looks like one would get more even utilization of texture units. For instance you could schedule completely different thread types to different SIMD arrays (say a texture heavy pixel shader, and a vertex/geometry shader with zero texturing instructions) and still fully utilize the texture units.

    If I understand G80 correctly, you'd have to take care to have each cluster executing a mix of all the thread types to achieve the same TMU efficiency (and that would lower cluster I-cache efficiency).
     
  5. Sobek

    Sobek Locally Operating
    Veteran

    Joined:
    Dec 17, 2004
    Messages:
    1,774
    Likes Received:
    18
    Location:
    QLD, Australia
    Hmm... Were we really an 'unsuspecting enthusiast community'? :???:

    I haven't done a lot of checking, but is there any word on when R600 will be available in Australia?
     
  6. hesido

    Regular

    Joined:
    Mar 28, 2004
    Messages:
    553
    Likes Received:
    85
    I hope AMD has something to say in the mainstream segment, but then they still cling on to NDA's even for that segment. Things do not look bright on the high end. Bad thing is AMD has to sell their top card for so much less, this would greatly reduce profit margins as we would expect to see greatly reduce prices for the entire family (we would want the mainstream segment priced accordingly)

    We can also commend NVidia for coming up with state of the art filtering performance. Oh boy.

    AMD tried to play the HDMI, HDCP card well I guess. But then again, how many wants to watch blue-ray, hd-dvd movies on their monitors? I don't think many people would be connecting their pc's to their plasma TV's in their living room. Media PC's would surely benefit, but media pc concept might not be as popular as xbox360's or PS3's with HD playback capabilities, or dedicated players, for that matters.

    Is it possible to enhance the filtering performance of this arcitecture, so we can see increased performance on lower nm parts (semi-next gen) or will we have to wait for the next architecture?
     
  7. Reputator

    Newcomer

    Joined:
    Nov 6, 2006
    Messages:
    66
    Likes Received:
    0
    Location:
    Florida, US
    Is there any way to explain in as laymen terms as possible what makes these 320 superscalar ALUs different from NVIDIA's scalar ALUs, if they can still all work on separate instructions?

    In other words what binds them together as 4+1?
     
  8. _xxx_

    Banned

    Joined:
    Aug 3, 2004
    Messages:
    5,008
    Likes Received:
    86
    Location:
    Stuttgart, Germany
    Magic :yep2:
     
  9. XMAN26

    Banned

    Joined:
    Feb 17, 2003
    Messages:
    702
    Likes Received:
    1
    Imho

    Based on the bench numbers I've seen so far, ATI/AMD or DAAMIT should have

    A.) Released these cards last year and work on driver issues all along like Nvidia has

    or

    B.) Extended out the NDA more to get the drivers better as the ones that have been supplied have a 2nd Tier Highend(if XTX was around difference would be amount of memory and mem/core clocks) trading blows with a 2nd tier highend that has 3/4 the ALUs, 5/6 the bandwidth and memory of its bigger brother.

    So Either the G80 design is just simply that good or ATI screwed the pouch. 3-4 more months may tell a different story after several driver releases, but right now, for me and I'm sure others aswell, the HD2900XT is nothing but a product with a hopeful future. It will be seen if it is a promising one. Glad I got my BFG8800GTS OC when I did. I've been gaming in bliss for about a month now. Glad I didn't wait to see the what if today, I'd have been pissed.
     
  10. Kaotik

    Kaotik Drunk Member
    Legend

    Joined:
    Apr 16, 2003
    Messages:
    10,245
    Likes Received:
    4,465
    Location:
    Finland
    Such a shame, if HD2-series card ever ends up in my computer, narrow & wide tent AA modes are out of the question, edge detect is the only possibly usable CFAA mode, at least there's still 2 to 8 MSAA modes aswell.
     
  11. JeffK

    Newcomer

    Joined:
    Oct 17, 2004
    Messages:
    43
    Likes Received:
    1
    Location:
    Palo Alto
    Right, I understand that a SIMD will feed into a single texture unit, but why the preference of SIMDs across different clusters sharing a texture unit versus SIMDs from the same cluster sharing a texture unit? Is this just to evenly mix the workload going to the texture units which could potentially be coming from different clusters doing drastically different types of work?
     
  12. tEd

    tEd Casual Member
    Veteran

    Joined:
    Feb 6, 2002
    Messages:
    2,105
    Likes Received:
    70
    Location:
    switzerland
    Dito , though there is still things i like on ati's side. EATM and performance/quality aAA modes.
     
  13. Geeforcer

    Geeforcer Harmlessly Evil
    Veteran

    Joined:
    Feb 6, 2002
    Messages:
    2,320
    Likes Received:
    525
    One issue I will take with "waiting for the better drivers" suggestion is that it almost assumes that while 2900 drivers improve drastically, G80 will lag behind. This, however, is hardly a replay of G70 Vs R520, which many seem to be drawing the erroneous parallel with, where former was an development of a well-established architecture while the latter was much more novel design.

    I would argue that now, the opposite is actually the case: at this point, AMD software engineers should be much more familiar with R600's inner workings (after all, this is "the second generation USA" and the ring bus has been around for two years now) than their counterparts at Nvidia, because on many fronts G80 represents a much bigger departure from their respective preceding designs than R600. In other words, if you want to aplly R520/G70 parrallel, G80 would seem to be a better candidate for the part of R520.
     
  14. Pete

    Pete Moderate Nuisance
    Moderator Legend

    Joined:
    Feb 7, 2002
    Messages:
    5,777
    Likes Received:
    1,814
    Some here: a decidedly mixed bag, at this point.

    Geeforcer, IIRC, R5x0 made the greatest advances with AA thanks to purported work on the MC. I'm guessing we'll see the same again, though this time they're not starting from as level a theoretical playing field WRT pixels exiting the ROPs per clock, so really no way they'll catch G80 GTX in ROP-limited scenarios. But G80 seems to use the same basic crossbar MC as before, while R600, though it's a second gen "ring bus," still seems to have changed somewhat from R5x0. So I'm thinking R600 will show more improvements with AA, but still won't reach G80GTX levels.

    But if you're talking about shader or scheduler efficiency, isn't the argument that R600's VLIW superscalar approach means a greater onus on the compiler, whereas G80's scalar ALUs may enjoy greater (natural) efficiency from the start? I guess the next question is how much efficiency is left to exploit for each GPU, and how exploitable is it (tho R600 may have more room for improvement, does AMD have the manpower/time to ameliorate this sooner rather than later in R600's life cycle?).
     
  15. Rys

    Rys Graphics @ AMD
    Moderator Veteran Alpha

    Joined:
    Oct 9, 2003
    Messages:
    4,182
    Likes Received:
    1,579
    Location:
    Beyond3D HQ
    Actually, I think the driver does use EATM in place of AAA. I'll check and make sure, but I think the AAA perf fix was to enable EATM instead.
     
  16. HKS

    HKS
    Newcomer

    Joined:
    Apr 26, 2007
    Messages:
    32
    Likes Received:
    17
    Location:
    Norway
  17. Galduta

    Veteran

    Joined:
    Apr 13, 2004
    Messages:
    1,046
    Likes Received:
    7
  18. ChrisRay

    ChrisRay <span style="color: rgb(124, 197, 0)">R.I.P. 1983-
    Veteran

    Joined:
    Nov 25, 2002
    Messages:
    2,234
    Likes Received:
    26
    So the driver no longer supports SS AA on alphas?
     
  19. tEd

    tEd Casual Member
    Veteran

    Joined:
    Feb 6, 2002
    Messages:
    2,105
    Likes Received:
    70
    Location:
    switzerland
    oh ok.
     
  20. SugarCoat

    Veteran

    Joined:
    Jul 17, 2005
    Messages:
    2,091
    Likes Received:
    52
    Location:
    State of Illusionism
    I will give the R600 this, it seems to be chewing up OpenGL games much better then any of the previous cores, all the way back to the R300, which i doubt is a coincidence. AOE3 performance is vastly improved as well which is also another title that caused all their previous cards to stumble. The performance seems very hit and miss which really strikes me as strange.
     
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...