AMD: R9xx Speculation

Discussion in 'Architecture and Products' started by Lukfi, Oct 5, 2009.

  1. Dave Baumann

    Dave Baumann Gamerscore Wh...
    Moderator Legend

    Joined:
    Jan 29, 2002
    Messages:
    14,090
    Likes Received:
    694
    Location:
    O Canada!
    He actually means PHY. Both Redwood and Juniper are (or or less) pad limited 128-bit designs, given the smaller engine on Barts the higher speed PHY is not required anyway.
     
  2. mczak

    Veteran

    Joined:
    Oct 24, 2002
    Messages:
    3,022
    Likes Received:
    122
    Well but they are indeed using parts rated for 5Ghz anyway it seems...
    Good point. Some key figures Cypress vs. Barts:
    Die size: 334mm² - 255mm²: -24%
    Transistors: 2.15B vs. 1.7B: -21%
    SIMDs: 20 vs. 14: -30%

    The trouble is we don't know the size of a simd (though Dave essentially said 2 simds are as big as 16 rops...). Based on the numbers quoted for rv770 from Jawed, taking into account scaling to 40nm and that they are possibly slightly more complex, I'd reckon around 6-7mm²? So skipping 6 of them would indeed only save around 35-40mm², i.e. only half the effective difference.
    There are 3 identified other sources of savings:
    - MC designed for lower clock
    - removal of DP
    - removal of second Xfire port
    How much those 3 items saves is anyone's guess (the tesselation/thread management improvements OTOH also probably have some hardware cost, though I'd guess it's small). It is also worth noting imho that transistor density went up a bit.
     
  3. Mintmaster

    Veteran

    Joined:
    Mar 31, 2002
    Messages:
    3,897
    Likes Received:
    87
    I'm pretty sure IHVs will find a way to save cost by clocking at 4.2GHz.

    Yeah, that's basically what I did in saying 10%, but maybe we underestimated it a bit. Could be 50 mm2, or 15% of the die.

    Still, there are things that can cost die space, too: faster tessellation, higher clock speed, better CrossFire scaling. All in all, Barts is pretty impressive.
     
  4. ECH

    ECH
    Regular

    Joined:
    May 24, 2007
    Messages:
    692
    Likes Received:
    30
    Hmm looks like the 10.10 hotfix is specifically for 6870 and 6850 after all.
     
  5. Broken Hope

    Regular

    Joined:
    Jul 13, 2004
    Messages:
    483
    Likes Received:
    1
    Location:
    England
    All previous cards are included in the INF, so it will install on other cards at least.
     
  6. ECH

    ECH
    Regular

    Joined:
    May 24, 2007
    Messages:
    692
    Likes Received:
    30
    Thanks!
     
  7. Kaarlisk

    Regular Subscriber

    Joined:
    Mar 22, 2010
    Messages:
    293
    Likes Received:
    49
    Maybe they simply do not think that bothering with power saving during UVD operation (and validating it) is worth the cost?

    I had two different HD4670 cards, their UVD clocks were identical to load clocks, and one had DDR3-800, the other GDDR3-1000. The UVD memory clocks were identical to load clocks, even though its apparent that they could have underclocked the memory on the faster card.
    Also, there does not seem to be a difference between the HD4650 and HD4670 in UVD capabilities, so it should at least be possible to clock HD4670 in UVD mode down to HD4650's clocks.

    The same argument works with the HD5770 - if the HD5570's memory bandwidth (900MHzx2x128bit) is enough for perfect UVD/HPTC operation, then it should be possible to downclock HD5870 to 225MHzx4x256bit for the memory.
    So it does seem that AMD simply do not consider it worthwhile to conserve memory power during video decoding; but this is a complete guess and they probably have excellent reasons for this.

    One cannot assume that they could downclock the cores that much, even if they have four times as many shaders, because the UVD unit may require a minimum frequency to work.
     
  8. Kaarlisk

    Regular Subscriber

    Joined:
    Mar 22, 2010
    Messages:
    293
    Likes Received:
    49
    For CPUs, various parts (caches, cores, memory controllers) have different transistor densities.
    So did the density go up because they removed the less dense units, or because they removed some extra redundancy due to a more mature 40nm process?
     
  9. Jawed

    Legend

    Joined:
    Oct 2, 2004
    Messages:
    11,708
    Likes Received:
    2,132
    Location:
    London
    I measure 36.8mm² of GDDR5 interface (just area for interface pads, "PHY") on RV770. Say 40% bigger than that in Cypress? That's around 15mm².

    Hints point to the 17th, redundancy, VLIW-5 block being dropped in Cypress. Texturing is also a bit more complex. LDS is double the capacity, has atomics logic and a more complex read/write infrastructure.

    The cores in RV770 are ~41% of the die. The dual-engine nature of Barts must cost quite a bit of extra die space in comparison with RV770's layout, but there's no real way to work that out. No idea how much that would scale with core count.
     
  10. Jawed

    Legend

    Joined:
    Oct 2, 2004
    Messages:
    11,708
    Likes Received:
    2,132
    Location:
    London
    :lol: We all forgot, Cypress gained girth due to doubled vias, supposedly - so that's another thing we can't account for in Barts's diet.
     
  11. DavidGraham

    Veteran

    Joined:
    Dec 22, 2009
    Messages:
    3,976
    Likes Received:
    5,213
    The struggle over HAWX 2 seems to originate from the argument whether to use Adaptive Tessellation or not:

    1-Nvidia & Ubisoft say "No"
    2-AMD says "Yes"

    http://translate.google.com/translate?js=n&prev=_t&hl=en&ie=UTF-8&layout=2&eotf=1&sl=fr&tl=en&u=http%3A%2F%2Fwww.hardware.fr%2Farticles%2F804-17%2Fdossier-amd-radeon-hd-6870-6850.html

    Personally , I think Adaptive Tessellation could be the perfect replacement for LOD ,at least it should solve the annoying problem of geometry and detail pop-ups that plague all games nowadays .
     
  12. Harison

    Newcomer

    Joined:
    Mar 29, 2010
    Messages:
    195
    Likes Received:
    0
    Could Dave enlighten us if we'll see MLAA on 4xxx and 5xxx series cards? Several review sites mentioned its possible.
     
  13. BRiT

    BRiT (>• •)>⌐■-■ (⌐■-■)
    Moderator Legend Alpha

    Joined:
    Feb 7, 2002
    Messages:
    20,502
    Likes Received:
    24,399
  14. Sontin

    Banned

    Joined:
    Dec 9, 2009
    Messages:
    399
    Likes Received:
    0
    I run the benchmark and i think they are using adaptive tessellation. It would make no sense because nVidia is promoting it with their tessellation demos - oh and Stone Giant is using it, too.

    Maybe AMD thinks it could be more aggressive.
     
    #4014 Sontin, Oct 22, 2010
    Last edited by a moderator: Oct 22, 2010
  15. Harison

    Newcomer

    Joined:
    Mar 29, 2010
    Messages:
    195
    Likes Received:
    0
  16. brain_stew

    Regular

    Joined:
    Jun 4, 2006
    Messages:
    556
    Likes Received:
    0
    Any shots showing off the MLAA algorithm used in conjunction with MSAA surfaced, yet?
     
  17. EduardoS

    Newcomer

    Joined:
    Nov 8, 2008
    Messages:
    131
    Likes Received:
    0
    Wasn't the "Adaptive" part the initial goal of doing tesselation to start with?
     
  18. Tamlin

    Newcomer

    Joined:
    Jan 5, 2010
    Messages:
    146
    Likes Received:
    0
    Seems its hacked already for 5000 series cards. :D Gotta love those guy's at 3dcenter!

    Here's in English how:
    http://forums.guru3d.com/showthread.php?t=331108
     
  19. eastmen

    Legend Subscriber

    Joined:
    Mar 17, 2008
    Messages:
    13,878
    Likes Received:
    4,724
    So whats with the new FSAA option ? Will we get it on the 5x00 series ? I don't see either of these cards as a replacement for my 5850 at 5870 speeds if anything its a side step and currently i don't need anything more powerfull for my gaming. It be nice if the new fsaa comes to the 58x0 series , it seems like a really nice thing for eyefinity resolutions
     
  20. trinibwoy

    trinibwoy Meh
    Legend

    Joined:
    Mar 17, 2004
    Messages:
    12,055
    Likes Received:
    3,112
    Location:
    New York
    I find it amusing that as an AMD fan you aren't appalled that in this very thread AMD is telling you that it made more sense to reduce the default filtering quality on 5800 cards because raising the other cards (and the 6800 series) up to the 5800's level would affect more users.....I'm still trying to figure out what Dave was thinking by even making that statement in public. :) I guess the 5800's texture quality was also overboard and unnecessary.

    Anyway, you are proving my point. AMD is complaining/talking/whining about overblown tessellation workloads. Besides the irony that they found themselves in this position in the first place I'm saying instead of talking they should be out leveraging their 7 generations of tessellation hardware experience to influence devs in doing it the right way. You know, doing more than just talking.
     
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...