AMD: R9xx Speculation

Discussion in 'Architecture and Products' started by Lukfi, Oct 5, 2009.

  1. Love_In_Rio

    Veteran

    Joined:
    Apr 21, 2004
    Messages:
    1,627
    Likes Received:
    226
    So, what do you think guys? after having a glance at the tidbits of the new architecture, could this be indeed a new R300 as rumoured?.
     
  2. ECH

    ECH
    Regular

    Joined:
    May 24, 2007
    Messages:
    692
    Likes Received:
    30
    I think the performance of the cards would need to be seen 1st.
     
  3. Harison

    Newcomer

    Joined:
    Mar 29, 2010
    Messages:
    195
    Likes Received:
    0
    Indeed, but if AMD hasnt screwed up, its possible to extrapolate rough estimates. I would be mighty surprised if with such stats 6970 wont be faster than GTX580. While 6990 as someone joked would provide "useless" amount of firepower :wink:
     
  4. AlphaWolf

    AlphaWolf Specious Misanthrope
    Legend

    Joined:
    May 28, 2003
    Messages:
    9,470
    Likes Received:
    1,686
    Location:
    Treading Water
    It's not going to be an R300 because GTX580 is already out, and its not NV30.
     
  5. mczak

    Veteran

    Joined:
    Oct 24, 2002
    Messages:
    3,022
    Likes Received:
    122
    Don't forget even with only one 32bit int mul per clock, the absolute number is still (somewhat) higher than what a GTX580 can do (which has half-rate 32bit int rate). For 32bit int adds that's more than twice as fast as GTX 580 (unless, of course, that's scalar, in which case it'll drop to 1 32bit int add as usual). So that still looks plenty fast to me.
     
  6. mczak

    Veteran

    Joined:
    Oct 24, 2002
    Messages:
    3,022
    Likes Received:
    122
    And it can't be a new R300 (imho) anyway, since this was such a big leap in all areas - not only performance but also feature wise. Cayman is probably a nice improvement in performance (and it could be a decent improvement in perf/w too which gets more important), but it doesn't really bring anything new to the table feature wise me thinks.
     
  7. Gipsel

    Veteran

    Joined:
    Jan 4, 2010
    Messages:
    1,620
    Likes Received:
    264
    Location:
    Hamburg, Germany
    That's a mistake. It can do four 32bit integer adds per cycle. An add can be done in each VLIW slot, same as in Cypress (and everything since R600).

    Edit:
    ISA looks like that (no difference between different generations):
    Code:
    x: ADD_INT     R0.x,  R1.x,  R2.x
    y: ADD_INT     R0.y,  R1.y,  R2.y
    z: ADD_INT     R0.z,  R1.z,  R2.z
    w: ADD_INT     R0.w,  R1.w,  R2.w
     
  8. hoom

    Veteran

    Joined:
    Sep 23, 2003
    Messages:
    3,261
    Likes Received:
    813
    Hmm, somehow I'd gotten into my mind that NV is doing fullspeed 32bit INTs :eek:
     
  9. Gipsel

    Veteran

    Joined:
    Jan 4, 2010
    Messages:
    1,620
    Likes Received:
    264
    Location:
    Hamburg, Germany
    That's what they said, tests have shown something else iirc.
     
  10. Alexko

    Veteran Subscriber

    Joined:
    Aug 31, 2009
    Messages:
    4,541
    Likes Received:
    964
    Well, I hadn't even noticed that one! If it can do 4 FMAs/cycle, there's really no reason it shouldn't be able to do 4 ADDs, anyway.

    By the way, does anyone know when the NDA actually expires? I mean the NDA for this presentation, not benchmarks.
     
  11. ECH

    ECH
    Regular

    Joined:
    May 24, 2007
    Messages:
    692
    Likes Received:
    30
    That's my thinking as well. It appears that the 6950 should be on par (win some/loss some) with the competing current gen card. But that remains to be seen. Another contention is the improved IQ with MLAA along with EQAA and what kind of performance one will get with those cards.
     
  12. digitalwanderer

    digitalwanderer Dangerously Mirthful
    Legend

    Joined:
    Feb 19, 2002
    Messages:
    18,987
    Likes Received:
    3,529
    Location:
    Winfield, IN USA
    Unfortunately the people who know that also couldn't tell us if they wanted to, NDAs are tricky that way. ;)
     
  13. eastmen

    Legend Subscriber

    Joined:
    Mar 17, 2008
    Messages:
    13,878
    Likes Received:
    4,724
    Future proofing doesn't exist. GF 1x0 may be fast with tessellation but in upcoming games another part of the hardware may be bottleknecked in thos egames resulting in the game performing as badly as the cypress platform might. Or cypress leading in other areas of performance might pull away.

    Anyway there is a thread on tessellation
     
  14. mczak

    Veteran

    Joined:
    Oct 24, 2002
    Messages:
    3,022
    Likes Received:
    122
    Oh you're right. The whitepaper said only DP can't be dual issued but two int instructions can. Either that's just not true or it could be artificially limited for consumer parts?
     
  15. caveman-jim

    Regular

    Joined:
    Sep 19, 2005
    Messages:
    305
    Likes Received:
    0
    Location:
    Austin, TX
    Are those different?
     
  16. hoom

    Veteran

    Joined:
    Sep 23, 2003
    Messages:
    3,261
    Likes Received:
    813
    How and why would they make such a mistake?
    They use the Mantissa part of the FP unit only can do 24bit INT unless they have 48bit FP capability.
     
  17. EduardoS

    Newcomer

    Joined:
    Nov 8, 2008
    Messages:
    131
    Likes Received:
    0
    And in many cases for the entire kernel, only half rate 32 bits add is unacceptable, fortunally it's not the case.

    32 bit mul at quarter rate is ok but not good, at half rate would be good, since the hardware is capable of 52 bits multiplies at quarter rate couldn't it be a little modified to allow 32 bit mul at half? :smile:
     
  18. EduardoS

    Newcomer

    Joined:
    Nov 8, 2008
    Messages:
    131
    Likes Received:
    0
    With FMA the need 48 bit adders for correct results.

    Anyway, 32 bits adders are way to cheap to not include...
     
  19. RecessionCone

    Regular Subscriber

    Joined:
    Feb 27, 2010
    Messages:
    505
    Likes Received:
    189
    Actually, Fermi has full rate 32-bit int add operations. I just wrote a CUDA kernel to test it out on my GTX 480, and got 644 Giga integer adds/second. The full-rate peak would be 1.4 GHz * 480 SMs= 672 Giga integer adds/second.

    Trying the same kernel out with 32-bit int mul operations gave 331 Giga integer muls/second, which does appear to be half rate.
     
  20. hoom

    Veteran

    Joined:
    Sep 23, 2003
    Messages:
    3,261
    Likes Received:
    813
    Ahh, but still why would ATI be quoting 24bit only?

    Perhaps for a few but when you have 1920 of them?
     
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...