AMD: R7xx Speculation

Discussion in 'Architecture and Products' started by Unknown Soldier, May 18, 2007.

Thread Status:
Not open for further replies.
  1. AlphaWolf

    AlphaWolf Specious Misanthrope
    Legend

    Joined:
    May 28, 2003
    Messages:
    8,474
    Likes Received:
    322
    Location:
    Treading Water
    They took it down shortly after it appeared, it was a couple hours ago.
     
  2. Mintmaster

    Veteran

    Joined:
    Mar 31, 2002
    Messages:
    3,897
    Likes Received:
    87
    "Overall, the card is roughly fifteen percent faster than Radeon 4850, and it will sell for $329 in U.S. e-tail"

    We know he's wrong about the performance, but is it really going to sell for $329?
     
  3. Rys

    Rys PowerVR
    Moderator Veteran Alpha

    Joined:
    Oct 9, 2003
    Messages:
    4,156
    Likes Received:
    1,433
    Location:
    Beyond3D HQ
    You'll only be waiting for 2.5 hours.

    Random question answering from the thread and notes on things, since I won't have anything for NDA expiry (it's already 2am here, I'm not done, and I have work in the morning sadly):

    • FP16 filtering is half speed, and the samplers are limited by available interpolators (only 32 texcoords/clk) when processing INT8 at full speed
    • 260mm2, 960M transistors or so
    • Huge focus on area efficiency and perf/watt
    • Chip was pad limited in the beginning***, so the last couple of SIMDs are value adds that weren't originally planned for. Explains the first point a little bit.
    • ROPs are mostly 2x everywhere measured with MSAA and really help the chip go fast. ROP MSAA downfilter this time.
    • Seems to be 64KiB L1 per sampler with huge L1 bandwidth, new design
    • Finding peak rates everywhere on the chip has been easy. I've seen 1Tflop FP32, full bilinear rates and peak INT8 blend and Z-only writes (64 Zs/clock, yay!)
    • GDDR5 is really what lets the chip kick some real ass
    • GS perf is up compared to RV670, maybe a new (bigger) coalescing stream out cache there, and more threads in flight
    • Colour cache is per ROP, same as R6
    • 16KiB per SIMD shared memory + 16KiB global (not the SMX)
    • All 800 SPs can do integer, fat one only for specials still. 1 FP32 MAD or 1 FP<->INT or 1 integer op/clock for all
    • New caching scheme for the RF I think
    • Orlando-led design but very distributed in terms of teams. Scott Hartog led, and he worked on i740 at Intel
    • Over 5MiB of SRAM on-chip if you count every pool
    • New UVD block with 7.1 over HDMI
    • No ring bus MC, new controller nice and efficient due to new ROP design

    It's the single most impressive graphics processor (and pairing with a memory technology, nice one Joe!) I've ever seen, when looked at as a whole. I don't say that lightly either, there have been some winning chips over the years.

    Deeply impressive and really deserves to get ATI back on the map when it comes to performance 3D graphics. Sorry I won't have anything more filling for arch NDA expiry, go read hardware.fr, Tech Report and Morgoth's pieces if you want more data.

    *** :lol:^infinity, that's honestly the best thing ever
     
    #4723 Rys, Jun 25, 2008
    Last edited by a moderator: Jun 25, 2008
  4. AlphaWolf

    AlphaWolf Specious Misanthrope
    Legend

    Joined:
    May 28, 2003
    Messages:
    8,474
    Likes Received:
    322
    Location:
    Treading Water
    Thanks for filling in some blanks, I look forward to the article.
     
  5. Mat3

    Newcomer

    Joined:
    Nov 15, 2005
    Messages:
    163
    Likes Received:
    8

    Wow! So instead of 1 out of 5 being more capable, now they're all the same? That's amazing they added so many SPs and at the same time increased their functionality.

    Is there anything about the 4870X2?
     
  6. Rangers

    Legend

    Joined:
    Aug 4, 2006
    Messages:
    12,322
    Likes Received:
    1,120
    I believe 9800GX2 is EOL.
     
  7. mczak

    Veteran

    Joined:
    Oct 24, 2002
    Messages:
    3,015
    Likes Received:
    112
    Ahh that explains the int8 numbers... Test apps using the same coords for multiple textures shouldn't be showing this, though (are there any?)

    Not sure I follow this. Why would it be pad limited if it's larger than rv670? Shouldn't it have only a slightly larger pad count?
    Really? Surprising to see AMD would get back to fixed function resolve (for basic modes) - with tons of shader alus what's the point?

    Hmm I find that hard to believe. The pics don't indicate that.
     
  8. Farhan

    Newcomer

    Joined:
    May 19, 2005
    Messages:
    152
    Likes Received:
    13
    Location:
    in the shade
    zomg
     
    #4728 Farhan, Jun 25, 2008
    Last edited by a moderator: Jun 25, 2008
  9. Dave Baumann

    Dave Baumann Gamerscore Wh...
    Moderator Legend

    Joined:
    Jan 29, 2002
    Messages:
    14,079
    Likes Received:
    648
    Location:
    O Canada!
    FYI. This is the "96 Shader" chip Orton talked about some time ago. Things change.
     
  10. Arty

    Arty KEPLER
    Veteran

    Joined:
    Jun 16, 2005
    Messages:
    1,906
    Likes Received:
    55
    It is ~2MB for GT200?
     
  11. trinibwoy

    trinibwoy Meh
    Legend

    Joined:
    Mar 17, 2004
    Messages:
    10,430
    Likes Received:
    432
    Location:
    New York
    Nice! It's hilarious that they had to add ALUs just to fill up space. That's impressive :lol:

    No doubt. It does look like one sleek mofo. Good job AMD/ATI.
     
  12. Mintmaster

    Veteran

    Joined:
    Mar 31, 2002
    Messages:
    3,897
    Likes Received:
    87
    Where the heck are the texcoords interpolated? Odd that it doesn't scale with SIMD count.
    Why this wasn't always the case is beyond me...
    INT8 Z-only? What do you mean by that?
    Ridiculous. We gotta see some GPUBench numbers.
    Is this partly due to 1 ROP quad per memory channel, like NVidia's products since G80?

    That's some serious praise! I gotta agree with you, though, particularly when looking at ATI's recent track record. Before RV770, I honestly thought NVidia was just more talented.
     
  13. Rys

    Rys PowerVR
    Moderator Veteran Alpha

    Joined:
    Oct 9, 2003
    Messages:
    4,156
    Likes Received:
    1,433
    Location:
    Beyond3D HQ
    INT8 blend, sorry about that. My specials rate is wrong, only the fatter unit can do that (corrected that too). Integer for them all though.

    The controller seems designed around that being the case, where each quad ROP connects to a 64-bit memory partition and its L2 pool (and other caches).
     
  14. Mintmaster

    Veteran

    Joined:
    Mar 31, 2002
    Messages:
    3,897
    Likes Received:
    87
    I think lots of apps do this, but I'm not sure about test apps.

    I thought 3DMark06 was like this because it always showed higher texture rate than D3DRightMark for G84 onwards, but obviously I was wrong.

    I guess it's the power/ground pins, and maybe some more for R700... :wink:

    Maybe the rate at which samples can be fed back to the shader is limited. Another reason could be that the shaders can do something else in the meantime.
     
  15. Mintmaster

    Veteran

    Joined:
    Mar 31, 2002
    Messages:
    3,897
    Likes Received:
    87
    Okay. So you still mean 256 Z/s per clock? Are you talking about reads/tests (i.e. z-rejection rate) or writes?

    Does that mean 16xAA with almost no perf hit? :lol:
     
  16. ZerazaX

    Regular

    Joined:
    Oct 29, 2007
    Messages:
    280
    Likes Received:
    0
    So about an hour and a half left or so huh? This anticipation is killin me
     
  17. Rys

    Rys PowerVR
    Moderator Veteran Alpha

    Joined:
    Oct 9, 2003
    Messages:
    4,156
    Likes Received:
    1,433
    Location:
    Beyond3D HQ
    Let's pretend I didn't brainfart and x4 shall we :lol:
     
  18. Pete

    Pete Moderate Nuisance
    Moderator Veteran

    Joined:
    Feb 7, 2002
    Messages:
    4,943
    Likes Received:
    347
    Thanks for the talking points, Rys, and looking forward to the article (duh). Was AMD (read: Fusion) at all responsible for some of the left-field area efficiency increases, or is it all "ATI"?
     
  19. Mintmaster

    Veteran

    Joined:
    Mar 31, 2002
    Messages:
    3,897
    Likes Received:
    87
    Sure :)

    The "96-shader" version of this (i.e. 480 SPs) that Dave alluded to is what R600 should have been, IMO. Probably would have been almost the same size, too.
     
  20. Rys

    Rys PowerVR
    Moderator Veteran Alpha

    Joined:
    Oct 9, 2003
    Messages:
    4,156
    Likes Received:
    1,433
    Location:
    Beyond3D HQ
    It's presumably all ATI, given the timescales for design, Nice question for us to ask Scott, though, or maybe Wavey knows and can spill the beans. 4am here and the sun is coming up, I'm out for a few hours.
     
Loading...
Thread Status:
Not open for further replies.

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...