AMD: Southern Islands (7*** series) Speculation/ Rumour Thread

Discussion in 'Architecture and Products' started by UniversalTruth, Dec 17, 2010.

  1. fellix

    Veteran

    Joined:
    Dec 4, 2004
    Messages:
    3,552
    Likes Received:
    514
    Location:
    Varna, Bulgaria
    I'm more disappointed by the lack of Z-rate improvements, given the steady bump in memory throughput.
     
  2. rpg.314

    Veteran

    Joined:
    Jul 21, 2008
    Messages:
    4,298
    Likes Received:
    0
    Location:
    /
    The ROPs on AMD do 4x more z/stencil ops, and this has been there for a long time now, presumably by using the 4 channels. So that much seems unlikely to change. Which leaves the Z rate a function of ROPs.
     
  3. Kaotik

    Kaotik Drunk Member
    Legend

    Joined:
    Apr 16, 2003
    Messages:
    10,247
    Likes Received:
    4,465
    Location:
    Finland
    Can't be that, we already saw the board with 2x6pin which means 7950, and it had 12 mem chips
     
  4. anexanhume

    Veteran

    Joined:
    Dec 5, 2011
    Messages:
    2,078
    Likes Received:
    1,535
    And with a supposedly better architecture in GCN, it's hard to swallow it's only 20% better than the 580.
     
  5. 3dilettante

    Legend Alpha

    Joined:
    Sep 15, 2003
    Messages:
    8,579
    Likes Received:
    4,799
    Location:
    Well within 3d
    We should wait until we see the die size and TDP confirmed.

    The target die size is lower and the stock power limits probably cap performance in a power-dominated situation.
     
  6. DarthShader

    Regular

    Joined:
    Jul 18, 2010
    Messages:
    350
    Likes Received:
    0
    Location:
    Land of Mu
    Maybe those transistors went into power saving features and GPGPU features -the caches, EEC, etc.
     
  7. 3dilettante

    Legend Alpha

    Joined:
    Sep 15, 2003
    Messages:
    8,579
    Likes Received:
    4,799
    Location:
    Well within 3d
    At least some of the transistors went there. Some power-saving techniques such as clock or power gating can also consume more die area than transistors, since they involve physically larger gates and circuits.

    I am curious if the < 3W idle power means there is power gating involved.
     
  8. anexanhume

    Veteran

    Joined:
    Dec 5, 2011
    Messages:
    2,078
    Likes Received:
    1,535
    My impression was that AMD was decidedly less focused on the GPGPU side and that GCN was supposed to be more optimized for gaming workloads compared to their 4+1 scheme of yore.

    Wouldn't it HAVE to be? That many transistors means your leakage would exceed that without mitigation techniques like power gating in my mind.

    edit: Quick calc. It's safe to assume at least 10% power loss due to leakage. On a 300W card, that's 30W. Even if you downclock 1/10 of the frequency and leakage tracks with frequency, you're still looking at 3W, and you've not allowed anything but leakage in that 3W. I just don't see how it's possible without power gating.
     
    #1388 anexanhume, Dec 16, 2011
    Last edited by a moderator: Dec 16, 2011
  9. 3dilettante

    Legend Alpha

    Joined:
    Sep 15, 2003
    Messages:
    8,579
    Likes Received:
    4,799
    Location:
    Well within 3d
    4+1 was a better match to game shaders, since it devoted the +1 to a lot of specialized functionality that either did not get used in scientific computing or lacked the precision to be used for serious computation.
     
  10. anexanhume

    Veteran

    Joined:
    Dec 5, 2011
    Messages:
    2,078
    Likes Received:
    1,535
    Interesting. I'd love to see two cards as closely matched as can be (ALU count, core and memory clocks etc.) run the same benchmarks and see how GCN fares.
     
  11. 3dilettante

    Legend Alpha

    Joined:
    Sep 15, 2003
    Messages:
    8,579
    Likes Received:
    4,799
    Location:
    Well within 3d
    A lot of other features of the design have changed outside of the ALU composition, so I would doubt GCN would lose unless the 4+1 design also inherited the new scheduling hardware and memory pipeline.

    When comparing the 69xx to the 68xx series there were some games where the 5+1 architecture's slightly higher peak performance helped it match or beat the VLIW4.
    In terms of specialized functions, at least synthetic benchmarks showed that the penalty to FP throughput due to formerly T-slot instructions occupying the ALUs in VLIW4 was measurable.
    The problem was worse for some instructions than others, as some like sin and cos required setup code that took up slots anyway.
     
  12. fellix

    Veteran

    Joined:
    Dec 4, 2004
    Messages:
    3,552
    Likes Received:
    514
    Location:
    Varna, Bulgaria
    The VLIW-5 construct was a compromise for both pixel and vertex workloads in a unified shader architecture. It matched pretty well the dominant co-issue types in graphics, like vec3+1, vec4 and vec4+1 in a singular instruction bundle.
    Abstract compute workloads have much more variable combinations.
     
  13. Kaotik

    Kaotik Drunk Member
    Legend

    Joined:
    Apr 16, 2003
    Messages:
    10,247
    Likes Received:
    4,465
    Location:
    Finland
    From what I can recall about those different archictecures, VLIW5 was definately best for gaming workloads, while VLIW4 suited better for GPGPU without sacrificing much gaming speed.
    GCN is bit of a mystery, it was definately made GPGPU in mind, but should be great for games too - we can see from nVidia products that 1D at least can work great with games and with GPGPU, but GCN isn't quite the same even if it is 1D
     
  14. Man from Atlantis

    Regular

    Joined:
    Jul 31, 2010
    Messages:
    961
    Likes Received:
    855
  15. mczak

    Veteran

    Joined:
    Oct 24, 2002
    Messages:
    3,022
    Likes Received:
    122
    With "only" 65% more transistors than 6970, something like ~50% faster still doesn't sound that great (especially considering Cayman didn't have the best perf/area ratio, still it's not too bad). The speculated die size (similar to Cayman) is indeed rather large though, I might miss some details but theoretically you could fit twice as many transistors on 28nm compared to 40nm within the same area.
     
  16. Kaotik

    Kaotik Drunk Member
    Legend

    Joined:
    Apr 16, 2003
    Messages:
    10,247
    Likes Received:
    4,465
    Location:
    Finland
    Wait what?
    "around 30% faster than 6970, should put it somewhere around 6990" :???:

    Unlike OBR, Fudzilla says it's better in gaming though.

    edit:
    For clarification, 6990 is around 60% faster even at mere 1920x1200
     
  17. 3dilettante

    Legend Alpha

    Joined:
    Sep 15, 2003
    Messages:
    8,579
    Likes Received:
    4,799
    Location:
    Well within 3d
    In theory it could be almost 2x the transistors. They weren't going to get 1/2 the power consumption per transistor to be able to use them.
    Density would also have been impacted by the wider interface and memory controller, since those don't contribute as much to the transistor count, but do consume area.
    Power gating, if in use, also takes up more physical area than it contributes in transistors.
     
  18. air_ii

    Newcomer

    Joined:
    May 2, 2007
    Messages:
    134
    Likes Received:
    0
    Unless AMD miscalculated the transistor count by some 1 billion (as with Bulldozer) ;).
     
  19. rpg.314

    Veteran

    Joined:
    Jul 21, 2008
    Messages:
    4,298
    Likes Received:
    0
    Location:
    /
    The bandwidth is 50% higher, so that's an upper bound right there.
     
  20. silent_guy

    Veteran Subscriber

    Joined:
    Mar 7, 2006
    Messages:
    3,754
    Likes Received:
    1,382
    Do anyone, by any chance, have a pointer to sites with benchmark shmoos for shader and memory clocks? Especially where wide ranges are used, not just minor overclocks.

    It'd be interesting to see how big of a factor external BW really is.
     
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...