The NEXT LAST R600 Rumours & Speculation Thread

Discussion in 'Pre-release GPU Speculation' started by Geo, Mar 1, 2007.

Thread Status:
Not open for further replies.
  1. Jawed

    Legend

    Joined:
    Oct 2, 2004
    Messages:
    11,716
    Likes Received:
    2,137
    Location:
    London
    I agree. What Rys says implies he considers it a driver fault.

    I'd just like to see them tested, so far nought. No conception of the theoretical performance of G80's GS architecture and no idea how G80 is dealing with the variety of possibilities there.

    I guess we'll just have to see...

    Jawed
     
  2. trinibwoy

    trinibwoy Meh
    Legend

    Joined:
    Mar 17, 2004
    Messages:
    12,059
    Likes Received:
    3,119
    Location:
    New York
    It's also easier for someone to ask for an explanation than to go research the topic themselves and try to better their own understanding so that they can avoid making such silly statements in the future.

    First of all, instruction re-ordering is handled by the shader compiler so your statement there has no merit, at least I've never heard of OOE in a GPU. And even if it did, I fail to see how that would be relevant to an R600 to G80 comparison. Let's assume for kicks that GPU's did OOE in hardware. How is R600's ALU configuration more amenable to this than G80's?

    And that bit about storing temporary data for in-flight threads - that's the theme that all modern GPU's are built around. Not even sure how to respond to your comments about the number of instructions or branching since they really make no sense to me. Maybe somebody is willing to take a shot at it for ya.
     
  3. Rys

    Rys Graphics @ AMD
    Moderator Veteran Alpha

    Joined:
    Oct 9, 2003
    Messages:
    4,182
    Likes Received:
    1,579
    Location:
    Beyond3D HQ
    We can push peak filter rates (and for more than INT8 surfaces) out of R600 at this point (large and small textures too) with a new tester (w00t!), so it seems the driver and hardware is running freely in that respect, so it would depend on what the app is doing to poop on that somehow.

    Don't look forward to it too much! I get nervous around this time that I'm not going to have the time to put everything in there. I've already cut some stuff (which we'll talk about at some point though)......

    Which seems to say I should spend less time drinking tea and reading this thread, and more time hacking it up :lol:
     
  4. trinibwoy

    trinibwoy Meh
    Legend

    Joined:
    Mar 17, 2004
    Messages:
    12,059
    Likes Received:
    3,119
    Location:
    New York
    Are those peaks high peaks or low peaks? :smile:

    Awwwww :cry:
     
  5. Razor1

    Veteran

    Joined:
    Jul 24, 2004
    Messages:
    4,232
    Likes Received:
    749
    Location:
    NY, NY

    interesting

    LOL
     
  6. Jawed

    Legend

    Joined:
    Oct 2, 2004
    Messages:
    11,716
    Likes Received:
    2,137
    Location:
    London
    Unfortunately explaining is not simple.

    You need to think of time-sliced batch scheduling as the primary mechanism, with the GPU operating on a set of tens or hundreds of batches (per cluster). Instruction parallelism is something that's pretty much hidden and not (in my opinion) relevant to a discussion of load-balancing and overall batch throughput.

    This is a good starting point:

    http://www.beyond3d.com/content/articles/4/8

    Jawed
     
  7. SugarCoat

    Veteran

    Joined:
    Jul 17, 2005
    Messages:
    2,091
    Likes Received:
    52
    Location:
    State of Illusionism
    The R580 wasnt a massive leap in real world performance over the X1800XT in any respect. I believe the biggest advantages came at ultra high resolutions but for the most part the R520 was a good strong core and the R580 was simply a refresh of that giving a speed boost. In this case, with the R600, its looking like the problem is with the core itself so if what we're waiting on is substantial clock increases at .65nm then thats not very hopeful considering nVidia can do the same thing.

    The R520 also had a really good advantage, to me anyway, in improved high IQ performance, and substantially so over Geforce 7 parts, so if the IQ hasnt been improved yet again to something beyond that of the G80 even if its slower, then this card is a pass to me.

    Still got the inhouse tech demos to look forward too!
     
  8. tEd

    tEd Casual Member
    Veteran

    Joined:
    Feb 6, 2002
    Messages:
    2,105
    Likes Received:
    70
    Location:
    switzerland
    All the bench leaks , i'm surprised no driver has been leaked yet
     
  9. aeryon

    Newcomer

    Joined:
    Oct 5, 2006
    Messages:
    85
    Likes Received:
    3
    Location:
    France / China
    for what ? actually they last no more than 2 or 3 days before a new release comes out :lol:
     
  10. Razor1

    Veteran

    Joined:
    Jul 24, 2004
    Messages:
    4,232
    Likes Received:
    749
    Location:
    NY, NY
    Rys, just had a question, if the threads in flight are reduced when GS is being used, that will effect everything else right? It will become a "systemic" problem?
     
  11. 3dcgi

    Veteran Subscriber

    Joined:
    Feb 7, 2002
    Messages:
    2,493
    Likes Received:
    474
    Unless the driver/compiler can detect that the shader will do no work GS threads must still be run and memory must be allocated. This will be true of both G80 and R600.
     
  12. Galduta

    Veteran

    Joined:
    Apr 13, 2004
    Messages:
    1,046
    Likes Received:
    7
    Very interesting .... R6 Las Vegas ,

    R6: Las Vegas, maximun settings

    2900XT/8800GTS:
    1024x768
    min:38/27
    med:74/60
    max:111/95

    1280x960
    min:26/18
    med:53/42
    max:83/68

    1600x1200
    min:19/13
    med:37/30
    max:70/48
     
    #5312 Galduta, May 12, 2007
    Last edited by a moderator: May 12, 2007
  13. Rys

    Rys Graphics @ AMD
    Moderator Veteran Alpha

    Joined:
    Oct 9, 2003
    Messages:
    4,182
    Likes Received:
    1,579
    Location:
    Beyond3D HQ
    In that it'll affect the performance of the entire chip? It has the potential to, of course, if there aren't enough available threads to keep throughput up.

    I think there are also cases (currently, and on G80) where a GS shader can have the thread count increasingly reduced to the point where only a relatively small number of them are running and possibly only on one cluster, because it's doing increased amounts of amplification.

    I think worst cases like that are possible on dynamically load-balanced architecture which has fixed resources, though. I haven't tested a shader like that heavily though, nor on R600 yet.
     
  14. mboeller

    Regular

    Joined:
    Feb 7, 2002
    Messages:
    923
    Likes Received:
    3
    Location:
    Germany
    From what I have read so far I conclude that the AA of the HD 2900XT is still "broken" because without AA the HD 2900XT is significantly faster than the GTS but with AA the card is slower.

    can someone confirm this?
     
  15. IbaneZ

    Regular

    Joined:
    Apr 15, 2003
    Messages:
    743
    Likes Received:
    17
    Interesting indeed.

    Why the hell doesn't R600 kick the living crap out of the 8800 GTS?

    Please, someone has to remind the 320 stream processors that's it's showtime. Wakey wakey. :lol:
     
  16. R300King!

    Newcomer

    Joined:
    Aug 4, 2002
    Messages:
    231
    Likes Received:
    5
    I think this is a HD2900XT here.

    [​IMG]

    FEAR
    [​IMG]

    CoH
    [​IMG]


    Are these good scores? :D
     
  17. Galduta

    Veteran

    Joined:
    Apr 13, 2004
    Messages:
    1,046
    Likes Received:
    7
    #5317 Galduta, May 12, 2007
    Last edited by a moderator: May 12, 2007
  18. Andrew Lauritzen

    Andrew Lauritzen Moderator
    Moderator Veteran

    Joined:
    May 21, 2004
    Messages:
    2,632
    Likes Received:
    1,250
    Location:
    British Columbia, Canada
    I specified a maximum of 3 vertices....

    Honestly on one hand I understand that there are a lot of really hard things to do with respect to the geometry shader. It really does break the parallelism and can easily be coded to bring *any* card to a halt. That said, they must have thought that some of the easy cases could be accelerated efficiently or else they wouldn't have added it to the spec (assuming MS isn't just totally off in left field). Still, the current performance of the G80 GS leaves something to be desired, but I'm willing to conceded that it could be partially or entirely driver related at this point.

    Still I won't be surprised if geometry shading is another dynamic branching wrt. NV40 vs R520...
     
  19. R300King!

    Newcomer

    Joined:
    Aug 4, 2002
    Messages:
    231
    Likes Received:
    5
  20. w0mbat

    Newcomer

    Joined:
    Nov 18, 2006
    Messages:
    234
    Likes Received:
    5
    single card WR!
    [​IMG]
     
Loading...
Thread Status:
Not open for further replies.

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...