Huddy says "R600"

Discussion in 'Pre-release GPU Speculation' started by Geo, May 25, 2006.

Thread Status:
Not open for further replies.
  1. KimB

    Legend

    Joined:
    May 28, 2002
    Messages:
    12,928
    Likes Received:
    230
    Location:
    Seattle, WA
    For how long now have people been asking about this tech? My bet is that they're already using it. That, or they never will.
     
  2. BByte

    Newcomer

    Joined:
    Jun 28, 2004
    Messages:
    83
    Likes Received:
    2
    Why are unified pipes smaller than traditional ones?
     
  3. Mintmaster

    Veteran

    Joined:
    Mar 31, 2002
    Messages:
    3,897
    Likes Received:
    87
    Like I said, I'm thinking they'll be somewhat comparable to G71's pipes, so not just ALU's. I'm guessing similar to the ALU + (ALU or TEX) kind of shader pipe they've got going right now.

    I'm thinking 32 if they go triple issue or 40-48 if they stay dual issue. They'll be a bit more simplified from the current setup, maybe taking 4 cycles for FP16/I16/I32 filtering (I'm crossing my fingers for high precision integer filtering), initial AF setup, cube map calcs, etc. Maybe they won't provide enough register space to hit that peak most of the time in order to stay compact.

    I know that sounds like overkill, but again this ties into my theory that they'll leverage the current setup that they've perfected. With this design, I don't see them saving much space by reducing the texture rate, since the address calcs are done so fast, though I may be underestimating the ease of fetching data from the texture cache.

    Of course, I could be way off, and NVidia could decouple the TMUs like ATI did. Then they'd sort of be abandoning their current architecture, though, as your gut feeling dictates. My gut feeling says they won't go that route until they decide to go unified.
     
  4. Mintmaster

    Veteran

    Joined:
    Mar 31, 2002
    Messages:
    3,897
    Likes Received:
    87
    I think SugarCoat was comparing to NVidia's pipes and pre-R5xx (with TMU logic). We already know that ATI added a lot of math power in R580 with only 60M transistors. This, of course, assumes they didn't take anything out of R520 that was "overbuilt".
     
  5. KimB

    Legend

    Joined:
    May 28, 2002
    Messages:
    12,928
    Likes Received:
    230
    Location:
    Seattle, WA
    I don't think more than 2-cycle latency on FP16 makes sense.
     
  6. Mintmaster

    Veteran

    Joined:
    Mar 31, 2002
    Messages:
    3,897
    Likes Received:
    87
    At first I was thinking the same thing, but when you do things on a quad by quad basis you can slow things down quite a bit while still saving space. I was just throwing ideas around, really. Xenos does FP16 filtering by converting to 16.16 and doing integer filtering at 1/4 rate, I assume by cascading the filtering logic (the filtering weights remain 8-bit or less). Some clamping could happen since FP16 range is more than 2^32, but it's a decent compromise IMO.
     
  7. Xmas

    Xmas Porous
    Veteran Subscriber

    Joined:
    Feb 6, 2002
    Messages:
    3,344
    Likes Received:
    176
    Location:
    On the path to wisdom
    Why not? Considering that the most common case of texturing involves compressed textures and that G70 is completely bandwidth limited for 32bpt and above, I'd say it makes perfect sense.
     
  8. KimB

    Legend

    Joined:
    May 28, 2002
    Messages:
    12,928
    Likes Received:
    230
    Location:
    Seattle, WA
    Well, if you just consider that the G7x already is an extremely efficient piece of hardware in terms of die space, why should nVidia simplify things further?
     
  9. pc999

    Veteran

    Joined:
    Mar 13, 2004
    Messages:
    3,628
    Likes Received:
    31
    Location:
    Portugal

    IIRC it could only be used in a brand new architeture, so I would say it is R600 or never.
     
  10. KimB

    Legend

    Joined:
    May 28, 2002
    Messages:
    12,928
    Likes Received:
    230
    Location:
    Seattle, WA
    Why? It's just an ALU layout library.
     
  11. Kombatant

    Regular

    Joined:
    May 29, 2003
    Messages:
    639
    Likes Received:
    19
    Location:
    Milton Keynes, UK
    Actually G80 is really Rampage 2 - but shhh! don't tell anyone :lol:
     
  12. Xmas

    Xmas Porous
    Veteran Subscriber

    Joined:
    Feb 6, 2002
    Messages:
    3,344
    Likes Received:
    176
    Location:
    On the path to wisdom
    To be able to add more units.
    Extremely efficient compared to what? It's not efficient to have units that can rarely sustain their peek processing rates. Maybe the additional cost was low. Maybe it wasn't, but c&p design was cheap.
     
  13. KimB

    Legend

    Joined:
    May 28, 2002
    Messages:
    12,928
    Likes Received:
    230
    Location:
    Seattle, WA
    Compared to their competitor.
     
  14. EasyRaider

    Regular

    Joined:
    Oct 1, 2002
    Messages:
    431
    Likes Received:
    2
    Location:
    Norway
    edit: never mind
     
    #74 EasyRaider, May 27, 2006
    Last edited by a moderator: May 27, 2006
  15. Ailuros

    Ailuros Epsilon plus three
    Legend Subscriber

    Joined:
    Feb 7, 2002
    Messages:
    9,511
    Likes Received:
    224
    Location:
    Chania
    My gut feeling also says one step at a time ;)
     
  16. rwolf

    rwolf Rock Star
    Regular

    Joined:
    Oct 25, 2002
    Messages:
    968
    Likes Received:
    54
    Location:
    Canada
    That transistor advantage might be moot when they slap GDDR4 on the R580.
     
  17. KimB

    Legend

    Joined:
    May 28, 2002
    Messages:
    12,928
    Likes Received:
    230
    Location:
    Seattle, WA
    I don't see what that would have to do with anything. It's just a memory interface like any other.
     
  18. Geo

    Geo Mostly Harmless
    Legend

    Joined:
    Apr 22, 2002
    Messages:
    9,116
    Likes Received:
    215
    Location:
    Uffda-land
    Have we heard anything on ROPs for R600? Obviously they wouldn't be using the ones from Xenos. . . They "touched" the ones for R5xx, so one would think they'd want to leverage most of that work forward. . .but I'd think they're going to have to touch them again anyway for R600, certainly for de-coupling them since they won't be tied to quads anymore. Might anything else go in there as well since they have to revisit them anyway?
     
  19. pjbliverpool

    pjbliverpool B3D Scallywag
    Legend

    Joined:
    May 8, 2005
    Messages:
    9,235
    Likes Received:
    4,259
    Location:
    Guess...
    4 samples per pixel in a single cycle like Xenos. It seems like this feature is of more use in the PC than a console anyway.
     
  20. nAo

    nAo Nutella Nutellae
    Veteran

    Joined:
    Feb 6, 2002
    Messages:
    4,400
    Likes Received:
    440
    Location:
    San Francisco
    Umh..are you sure? I think it makes more sense on Xenos (due to edram)
     
Loading...
Thread Status:
Not open for further replies.

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...