AMD: R9xx Speculation

Discussion in 'Architecture and Products' started by Lukfi, Oct 5, 2009.

  1. no-X

    Veteran

    Joined:
    May 28, 2005
    Messages:
    2,455
    Likes Received:
    471
    Sorry, but GF1xx isn't morally old? Or its delayed launch makes it fresher? nVidia still has no counterpart for HD5770 and you blame ATi?

    Given the size of Junipers die, ATi can reduce its price quite significantly. Juniper has also full tesselation performance of HD5870 and that's really not bad. Especially in DX11 games, Juniper offers better performance than GF106.
     
  2. Robert Varga

    Newcomer

    Joined:
    Jan 13, 2010
    Messages:
    26
    Likes Received:
    0
    Juniper is the sexiest one from evergreen family for me. Looking at the sales, I'm not alone...
     
  3. PSU-failure

    Newcomer

    Joined:
    May 3, 2007
    Messages:
    249
    Likes Received:
    0
    It's not "elegant", it's:

    - efficient (internal vertex data loop, via L1/L2?)
    - brutal (16 tessellation units)

    It's almost certain Evergreen has been engineered to reduce complexity, so NI has medium to high chance to speed up in this area.

    About this, R600 was very elegant, much more than R700 and Evergreen with its ring bus and programming model (not "graphics" oriented at all as gpusa shows).

    I think what Evergreen lacks is a general-purpose R/W L1, allowing the same efficiency as GF100 without the absurd waste.
     
  4. liolio

    liolio Aquoiboniste
    Legend

    Joined:
    Jun 28, 2005
    Messages:
    5,724
    Likes Received:
    195
    Location:
    Stateless
    And for me the sexiest GPU out of ATI last years production was the RV740
     
  5. OlegSH

    Regular

    Joined:
    Jan 10, 2010
    Messages:
    805
    Likes Received:
    1,634
    Maybe i'm skipped something, but what's elegant about R600 programming model? It's even not support very reduced CS4 preset
     
    #2385 OlegSH, Sep 28, 2010
    Last edited by a moderator: Sep 28, 2010
  6. Dave Baumann

    Dave Baumann Gamerscore Wh...
    Moderator Legend

    Joined:
    Jan 29, 2002
    Messages:
    14,090
    Likes Received:
    694
    Location:
    O Canada!
    R/W aside, what do you think that EV does different in tessellation than GF100 does according to that slide?
     
  7. CarstenS

    Legend Subscriber

    Joined:
    May 31, 2002
    Messages:
    5,800
    Likes Received:
    3,920
    Location:
    Germany
    Sorry?

    edit: I mean, what do you mean with I'd get Jack on a Geforce. You're right about non-amplified stuff not staying in the SM. What I meant but unfortunately did not write was: Stays inside the SM mostly without touching global memory and no need to go back to the general scheduler.
     
    #2387 CarstenS, Sep 28, 2010
    Last edited by a moderator: Sep 28, 2010
  8. caveman-jim

    Regular

    Joined:
    Sep 19, 2005
    Messages:
    305
    Likes Received:
    0
    Location:
    Austin, TX
    I think he said that when no amplification is processed, nothing else happens either - stalls; and the relevant contents of the SM get flush to L2 and re-read?
     
  9. AlexV

    AlexV Heteroscedasticitate
    Moderator Veteran

    Joined:
    Mar 15, 2005
    Messages:
    2,535
    Likes Received:
    144
    I mean that DP isn't the only thing that gets trimmed on consumer cards.
     
  10. CarstenS

    Legend Subscriber

    Joined:
    May 31, 2002
    Messages:
    5,800
    Likes Received:
    3,920
    Location:
    Germany
    So, non-ampflified geometry also takes unnecessary twists and turns through the chips innards?
     
  11. PSU-failure

    Newcomer

    Joined:
    May 3, 2007
    Messages:
    249
    Likes Received:
    0
    From this slide, data flow *could* be different (feed-forward? I was under the impression it was a given), but it's some marketing slide after all.

    What I find elegant is the way everything is considered "data", plus the arch in itself.

    Shader model is clearly irrelevant there as it's more of an implementation detail, it's mostly a conceptual thing. I'd not be surprised if we were to see the resurgence of some of R600's characteristics some day.
     
    #2391 PSU-failure, Sep 29, 2010
    Last edited by a moderator: Sep 29, 2010
  12. ShaidarHaran

    ShaidarHaran hardware monkey
    Veteran

    Joined:
    Mar 31, 2007
    Messages:
    4,027
    Likes Received:
    90
    On the surface this statement "feels wrong", but given the cyclical nature of things it's certainly possible. After all, some of the architectural concepts from P4 are now being revisited in Sandy Bridge (trace cache -> micro-op cache).
     
  13. trinibwoy

    trinibwoy Meh
    Legend

    Joined:
    Mar 17, 2004
    Messages:
    12,059
    Likes Received:
    3,119
    Location:
    New York
    It's certainly overkill for games but in targeted tessellation workloads the measured advantage over Cypress is over 6x and that's not even a fully enabled GF100. If you look at throughput per clock, the advantage goes up to 7.5x.

    Are the 15 polymorph engines and 4 GPC's of the GTX 480 taken together 6x larger than the front-end in Cypress? The efficiency question is hard to answer unless you know the relative costs.
     
  14. racca

    Newcomer

    Joined:
    Apr 3, 2010
    Messages:
    51
    Likes Received:
    0

    That's not true. If it were, then there must have been quite a few exceptions, in fact, it's almost as many as those XT's that are called XX70

    Juniper (LE) 5670 (640SP version)
    Redwood PRO 5570
    RV790 XT 4890
    RV740 N/A 4770
    RV710 N/A 4350/4550
    RV620 PRO 3470
    All X2 parts are codenamed RXX0 instead of RVXX0

    There's some truth to your claim here:
    They are NOT linked to each other. But on the other hand, there was indeed no XT chip marketed as XX50/XX30.
     
  15. racca

    Newcomer

    Joined:
    Apr 3, 2010
    Messages:
    51
    Likes Received:
    0
    Without saying the specs are true,
    if you actually look at it, 240TP@850MHz vs 288TP@725MHz, it's almost a tie here with raw power. Plus some tweaks here and there, ncreased ROP/Rasterizer etc, it can be ture.

    I call it bs or fud. Pure smoke. And I think for our health sake, we shoul quit somking. :twisted:

    Ooooow, that's not good. 1H'11? We might as well be expecting a 28nm version of Cayman (~200sqmm) by then. A GloFo part if we are really lucky (for the curiosity on how well it can be).
     
  16. LordEC911

    Regular

    Joined:
    Nov 25, 2007
    Messages:
    877
    Likes Received:
    208
    Location:
    'Zona
    Just haven't heard anything overly concrete but I would still expect it Q1.
    The door isn't completely closed on 28nm Turks/Caicos, unless they are already in mass production and just need to build up massive stock for OEMs.
     
  17. no-X

    Veteran

    Joined:
    May 28, 2005
    Messages:
    2,455
    Likes Received:
    471
    I speak about high-end / midrange. Your examples are mainly low-end products, where a single GPU is used for SKUs of several product lines. It's not logical to apply principles valid in low-end segment to high-end.

    PRO / XT
    HD3850 / HD3870
    HD4850 / HD4870
    HD5850 / HD5870

    There's not a single reason to expect, that HD6850 and 6870 will be based on different GPUs.
     
  18. Arnold Beckenbauer

    Veteran Subscriber

    Joined:
    Oct 11, 2006
    Messages:
    1,756
    Likes Received:
    722
    Location:
    Germany
    Nope. :twisted:
    HD2900...
    HD2000/3000(HD4200?) with UVD: you can use their UVD chip for decoding, CPU encodes the video.
    HD4000: SPs encode the video, and if you want, you can let UVD decode the video.
     
  19. fellix

    Veteran

    Joined:
    Dec 4, 2004
    Messages:
    3,552
    Likes Received:
    514
    Location:
    Varna, Bulgaria
    SPs are only being used for intermediate image processing (like deinterlacing, re-sampling, denoise), not the encoding phase?!
     
  20. Kaotik

    Kaotik Drunk Member
    Legend

    Joined:
    Apr 16, 2003
    Messages:
    10,245
    Likes Received:
    4,465
    Location:
    Finland
    If that's true, how on earth did they conjure up a transcoder which is leaps and bounds faster than any other encoder/transcoder I've ever used (I have HD3800 series card)
    Or are you suggesting that majority of time on transcoding goes into decoding rather than encoding?
     
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...