ATI RV740 review/preview

Discussion in 'Architecture and Products' started by LunchBox, Feb 25, 2009.

  1. Silent_Buddha

    Legend

    Joined:
    Mar 13, 2007
    Messages:
    19,426
    Likes Received:
    10,320
    Aren't the transistors for the ALU's significantly more densely packed than the rest of the chip? If so, isn't it possible that it has more transistors than just the die area alone would suggest?

    Regards,
    SB
     
  2. Jawed

    Legend

    Joined:
    Oct 2, 2004
    Messages:
    11,716
    Likes Received:
    2,137
    Location:
    London
    Transistor density will vary all over - typically memory is much denser. e.g. Cell has 4x the density in its local store memory than for general logic within each SPE.

    With the clusters there's a lot of memory - out of those 389M transistors there's 2.5MB of register file (thats counting 16 of the 17 "pixels" - the 17th is for redundancy) and a load of L1 cache and some LDS and GDS.

    Outside of the clusters there's L2 cache and various buffers, including those associated with the RBEs.

    A quick check with a die photo indicates that what looks like register file is ~32% of the area of a "pixel". Without knowing the real ratio between the density of the memory and the rest we're stuck.

    So the transistor density guessing game is pretty naive and really just for entertainment.

    Jawed
     
  3. Arty

    Arty KEPLER
    Veteran

    Joined:
    Jun 16, 2005
    Messages:
    1,906
    Likes Received:
    55
    Derek Wilson from Anandtech's Video Card Buyer's Guide - Spring 2009

    Seems like a strong premature recommendation for the RV740 unless they are talking about the 9800GT Tipexx edition.
     
  4. keritto

    Newcomer

    Joined:
    Apr 3, 2009
    Messages:
    143
    Likes Received:
    0
    This makes you sound like some ati's marketing executive :lol: I don't se why would be i too happy about die shrinkage if i don't get anything for it. Pretty much the same power consumption and what bothers me and seeem to be the fact only an single precision operation on RV740?! How can they call it dx10.1?? envidia wins again another match like with D3c which make it's way out after R200/R300 generation.:???:

    And 256mm2 divided by 2 (in best circumctances, not even with obviousleakage-power problem on 40nm they mentioned) isn't small as 125mm2. And shaders are lot smaller in weight on overall die size than TMUs and RBEs that are roughly the same (40vs32 TMUs?) Well after all that's what makes ATi's architecture win over nV.
     
  5. AlexV

    AlexV Heteroscedasticitate
    Moderator Veteran

    Joined:
    Mar 15, 2005
    Messages:
    2,535
    Likes Received:
    144
    So you know how the RV740 looks, that's good. The bolded part makes no sense, please elaborate.
     
  6. AnarchX

    Veteran

    Joined:
    Apr 19, 2007
    Messages:
    1,559
    Likes Received:
    34
    G92b (55nm) is 270mm² and RV740 136mm² according to AMD.

    Of course still a big difference, but I think we can consider, that 40nm waferprices are higher than 55nm ones and the yields also should be lower.

    On the other side, folks @Chiphell reported that HD4770 needs a 8-layer PCB, while 98 GT is now at the most partners 6-layer.
    Both cards use 8 memory chips, HD4770 I think because of lower prices of 512Mbit 0.55ns chips over 0.5ns 1Gbit ones.

    And in the end, GT215/214 (NVs 40nm competitor) should not be so far away.
     
  7. CarstenS

    Legend Subscriber

    Joined:
    May 31, 2002
    Messages:
    5,800
    Likes Received:
    3,920
    Location:
    Germany
    What you mean is 3dc, and it was in R4xx generation. And it did not disappear, but was integrated into DirectX, because it was found to be quite useful. :)
     
  8. keritto

    Newcomer

    Joined:
    Apr 3, 2009
    Messages:
    143
    Likes Received:
    0
    Even their presenter calls that dce a hog :grin: anyway it'll be pretty interesting when microcrap shouled try to explain to us why they use dce on obsolete engines when they're released dx9.0b on Vista that introduced dx10, and dx10.1 on NT7 that'll presumably have new dx11.

    On the other hand they have "compatibility desktop mode" for Vista for pre-dx9.0b based cards so they'll have it for in their new bloat :cry: and they'll ebven not try to fix it cause as we know dx10 is not so much prehistoric to dx10.1 as dx9.0b might be for ms sake to dx7+

    And that on rv740 losing double precision of their older r700 gang sisters. Doesn't that make it only dx10 compatible cause dx10.1 requires double precision shader operation?

    Unfortunately i read that moderately long thread. And i give you referrence above. And it seems you lock me from posting so i dont see how you mean to elaborate it.

    Hm that doesnt's sounds reasonable to use 8-layer pcb on 128-bit wide memory bus. Shouldn't they jump over onto 128-bit just to reduce that pcb layers from 6 to 4 or something?
     
  9. Dave Baumann

    Dave Baumann Gamerscore Wh...
    Moderator Legend

    Joined:
    Jan 29, 2002
    Messages:
    14,090
    Likes Received:
    694
    Location:
    O Canada!
    Double Presicion is FP64 - thats not a DX requirement yet. DPFP is just something thats being optionally implemented for GPGPU purposes.
     
  10. Kaotik

    Kaotik Drunk Member
    Legend

    Joined:
    Apr 16, 2003
    Messages:
    10,245
    Likes Received:
    4,465
    Location:
    Finland
    Win7 is NT6.1, not NT7, and DX11 is coming to both it and Vista
    Where did you pull DX9.0b to this? Basic DX9 PS2.0 support is only thing required (among with memory) for Aero / DCE

    Like already said, FP64 isn't required by DX10.1
    Also, these ARE 128bit, so no, it doesn't reduce the layers to 6 or 4 or something necessarily.
     
  11. v_rr

    Newcomer

    Joined:
    Apr 30, 2007
    Messages:
    147
    Likes Received:
    0
    Glad to see that there are still people with faith and hope.

    RV740 is going on Desktop and notebook. In news the AMD wafers to TSMC will jump in production for the next months coincident with RV790 and RV740 release.
     
  12. mczak

    Veteran

    Joined:
    Oct 24, 2002
    Messages:
    3,022
    Likes Received:
    122
    I'm not so sure about yields. Even if defects are higher per area on 40nm, the size difference could make that difference disappear (though that's assuming that defects per area aren't sky-high of course). And even if yields are somewhat lower and waferprice is higher, a factor 2 in size is quite something.
    This is interesting. With only 128bit memory bus you'd think routing would be easier (sure memory bus runs at high frequency but gddr5 should also help with that, with the compensation for unequal trace length).
    Both Hynix and Samsung also offer slower 1Gbit chips - no idea about prices...
    Using 8 chips would also make it possible to use the same pcb layout for 1GB cards. Dunno if we'd see anything like that, maybe cost of gddr5 is too high (that's really the big unknown factor here I guess).
    It has been really quiet lately about GT212/214/216, I wonder what's up with that...
     
  13. Vincent

    Newcomer

    Joined:
    May 28, 2007
    Messages:
    235
    Likes Received:
    0
    Location:
    London

    I thought this time that both Nvidia and ATI will come up with multi-chip solution from mid-range to high end. The cost/performance has been attested by the GTX280 with its 512bit bus, which only outperform RV770XT by small margin, not only in theortical benchmark, but also in real testing game.


    My 2010 mainstream prediction : GTX280 level GPU with 128bit GDDR5 7GHz
     
  14. Jawed

    Legend

    Joined:
    Oct 2, 2004
    Messages:
    11,716
    Likes Received:
    2,137
    Location:
    London
    ATI's ALUs are 16-in-17 redundant, so defects in that portion of the die have to be very severe to kill it.

    Jawed
     
  15. CarstenS

    Legend Subscriber

    Joined:
    May 31, 2002
    Messages:
    5,800
    Likes Received:
    3,920
    Location:
    Germany
    Is that a known fact?
     
  16. Jawed

    Legend

    Joined:
    Oct 2, 2004
    Messages:
    11,716
    Likes Received:
    2,137
    Location:
    London
    Not if the relevant patent documents and die photos aren't enough evidence.

    Jawed
     
  17. CarstenS

    Legend Subscriber

    Joined:
    May 31, 2002
    Messages:
    5,800
    Likes Received:
    3,920
    Location:
    Germany
    Thanks - must've missed all that.
     
  18. rjc

    rjc
    Regular

    Joined:
    Oct 27, 2008
    Messages:
    270
    Likes Received:
    0
    A few pages back Anarchx posted it168 link showing the current and proposed amd lineup. There were 512M and 1G variants of the 4850(a $20 difference) and 4870(a $30 difference) so very very roughly GDDR5 is 50% more expensive. Last year gpu cafe reckoned it carried a 20-40% premium. In february they are saying samsung and hynix are in mass production, still looking for any cards actually carrying this memory. A couple of weeks ago samsung announced they are shipping their 50nm DDR3 parts, so i suppose the previously announced 50nm GDDR5 shouldnt be too far behind.

    The GT218 is supposed to debut next month, the GT216 shortly after, i think these are mainly intended as oem parts. The GT215(a 192 bit G92) is delayed till they can clear the G92/G94 stock, it is approx the same cost to produce as current 55m parts so nvidia are not going to hurry with it.
     
    #298 rjc, Apr 6, 2009
    Last edited by a moderator: Apr 6, 2009
  19. Jawed

    Legend

    Joined:
    Oct 2, 2004
    Messages:
    11,716
    Likes Received:
    2,137
    Location:
    London
    But current 55nm parts cost nothing to produce, since they're nventory.

    Jawed
     
  20. rjc

    rjc
    Regular

    Joined:
    Oct 27, 2008
    Messages:
    270
    Likes Received:
    0
    Unless the inventory is written off, the production and maybe storage costs are still attached. There are 2 problems intertwined 1) maybe 100 days worth of inventory 2) unit cost of replacement part is roughly the same.
    This means normal strategy of producing new part to reduce the overall average unit costs of the inventory wont work. ie old part $30, new part $25 therefore if can produce enough new part average unit cost approaches $25 => will sell faster as can sell old stock at new lower price.

    I think they are hoping 40nm yields will improve enough with time to get the above situation happening. Is a warning to others not to try a large chip on 40nm.

    Back on topic - from other thread previously posted amd is ramping its wafers quite a bit over the next couple of months....what are they producing? Only a small proportion will be RV790, is it RV740 or something else?
     
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...