AMD GPU14 Tech Day Event - Sept 25'th

Discussion in 'Architecture and Products' started by Dave Baumann, Sep 20, 2013.

  1. 3dilettante

    Legend Alpha

    Joined:
    Sep 15, 2003
    Messages:
    8,579
    Likes Received:
    4,799
    Location:
    Well within 3d
    What happens when the game coded with low-level details of the new revision runs on the old hardware?
    Exposing low-level details means there's no intermediary to intercept your mistakes. CPU architectures are rife with low-level details that stick around because you can't wantonly crash software on your platform because you suckered people into using feature X in a certain way.

    Maintaining a higher level of abstraction means the software doesn't have details on how the hardware achieves an end result, so if it's a new vector ALU, a re-ordered data path, or turtles, it's none of the software's business.

    Consoles already have low-level access, and they barely change the core architecture because they need consistency. The 360 shrink that put the GPU and CPU on the same die actually spent silicon on a fake bus unit that pretended to be a slow external bus so that the chip behaved identically to the old ones.
     
  2. Davros

    Legend

    Joined:
    Jun 7, 2004
    Messages:
    17,879
    Likes Received:
    5,330
    It falls back to dx11 (or whatever ) I guess
     
  3. 3dilettante

    Legend Alpha

    Joined:
    Sep 15, 2003
    Messages:
    8,579
    Likes Received:
    4,799
    Location:
    Well within 3d
    The hypothetical is a new revision of a console, which wouldn't have that fallback.
    I don't think it would be acceptable even if there were. Unless Mantle brings nothing to the table--and in that case why bother, a fallback would lead to older consoles having an inferior experience.
     
  4. Dave Baumann

    Dave Baumann Gamerscore Wh...
    Moderator Legend

    Joined:
    Jan 29, 2002
    Messages:
    14,090
    Likes Received:
    694
    Location:
    O Canada!
    Mantle already deals with a number of architectural revs.
     
  5. Dominik D

    Regular

    Joined:
    Mar 23, 2007
    Messages:
    782
    Likes Received:
    22
    Location:
    Wroclaw, Poland
    These numbers are fairly meaningless w/o CPU usage numbers. Submitting from a single hardware thread at some point you'll become CPU limited - which is what these numbers indicate. If you calculate time per drawcall, you'll see that somewhere between 300 and 2100 drawcalls dude became CPU-bound.

    [​IMG]

    There's a mostly fixed CPU cost of each drawcall which is indicated by the almost flat line at the bottom. The reason it's not completely flat is that once you try pushing more and more drawcalls per frame, you decrease the number of "houskeeping" operations pushed down the pipe (clear, present, etc.) which also take time. So this time saved on, say, less presents gives you some extra time to perform draw calls, which shows up on the graph as decreased drawcall time.

    The bottom line is: yes, draws take time. This time is consumed by pushing data from UMD to KMD, memory operations (allocations, mapping, building paging buffers and what not) and can't be avoided if you want multitasking operating system that's responsive and works with more than one GPU type.
     
  6. lanek

    Veteran

    Joined:
    Mar 7, 2012
    Messages:
    2,469
    Likes Received:
    315
    Location:
    Switzerland
  7. Nemo

    Newcomer

    Joined:
    Sep 15, 2012
    Messages:
    126
    Likes Received:
    23
    One DP can up to three 4K displays via MST?
     
  8. Davros

    Legend

    Joined:
    Jun 7, 2004
    Messages:
    17,879
    Likes Received:
    5,330
    it says "1 display port with mst"
    Does that mean they supply a mst (they are $100)
    or that there is a mst built in and you just need some sort of dumb splitter
    it also says use "any of the outputs" that was not the case with the 6950 you had to use the displayport
    with an "active" adapter if you didnt have a dp monitor, this caused a lot of confusion for some people.

    ps: are the 2 dvi ports dvi-i or 1xdvi-i + 1xdvi-d as on the 6950 (bloody penny pinchers)
     
  9. Lille

    Newcomer

    Joined:
    Aug 17, 2012
    Messages:
    9
    Likes Received:
    0
    In a Linus Tech Tips video on youtube he mentions that it has 2 dual link dvi-d.
     
  10. sir doris

    Regular

    Joined:
    May 9, 2002
    Messages:
    708
    Likes Received:
    165
    Side says 2 x dual link DVI, so that would surely mean dvi -d.
     
  11. lanek

    Veteran

    Joined:
    Mar 7, 2012
    Messages:
    2,469
    Likes Received:
    315
    Location:
    Switzerland
    Looking the numbers of adaptaters including Active you have in the 7900 boxes.. I think its not a problem for them to include an Multi stream or splitters for DP .. should cost even less.
     
  12. KKRT

    Veteran

    Joined:
    Aug 10, 2009
    Messages:
    1,040
    Likes Received:
    0
    I was not CPU bound.

    This is my CPU utilization with less than 100 draw calls
    http://i2.minus.com/iOPjPdyK3T9a5.jpg

    With 23k draw calls (that spike before the end of the graph)
    http://i4.minus.com/iJq4S1WZS18z6.jpg

    With 100k draw calls
    http://i1.minus.com/ibaWNyioP6DyO5.jpg
     
    #332 KKRT, Sep 30, 2013
    Last edited by a moderator: Sep 30, 2013
  13. Psycho

    Regular

    Joined:
    Jun 7, 2008
    Messages:
    746
    Likes Received:
    41
    Location:
    Copenhagen
    Yes you were. CPU utilization falling to 25% on a quad core screams being limited by the draw submit thread.
     
  14. KKRT

    Veteran

    Joined:
    Aug 10, 2009
    Messages:
    1,040
    Likes Received:
    0
    It felt down to 25%, because it minimized SDK window when i took a screenshots of task manager.
     
  15. Dominik D

    Regular

    Joined:
    Mar 23, 2007
    Messages:
    782
    Likes Received:
    22
    Location:
    Wroclaw, Poland
    You could change core usage to 1 in msconfig and retest stuff with high priority process to get better results.
     
  16. KKRT

    Veteran

    Joined:
    Aug 10, 2009
    Messages:
    1,040
    Likes Received:
    0
    You mean setting Editor to only use one core? Editor is maxing out one core from the get go. With two cores enabled there is no performance increase.
     
  17. karlotta

    karlotta pifft
    Veteran

    Joined:
    Jun 7, 2003
    Messages:
    1,292
    Likes Received:
    10
    Location:
    oregon
    CoreParking?
     
  18. Andrew Lauritzen

    Andrew Lauritzen Moderator
    Moderator Veteran

    Joined:
    May 21, 2004
    Messages:
    2,629
    Likes Received:
    1,227
    Location:
    British Columbia, Canada
    If you weren't CPU bound on draw calls/setup (which I agree has yet to be proven), what is the point of the test? What are you even measuring that has any relevance to Mantle?

    Also this obviously isn't a great way to measure API overhead, since there's a lot more going on in an engine that relates to objects/instancing than just draw calls/3D API interaction. Typically you want to set up a microbenchmark that changes a specific set of state between each draw call (and the cost of the draw call will vary depending on which state that is!) and ensure that everything is offscreen/culled on the GPU.
     
  19. KKRT

    Veteran

    Joined:
    Aug 10, 2009
    Messages:
    1,040
    Likes Received:
    0
    That was only test how CryEngine 3 reacts to increasing draw calls, when nothing more is affected, except by creating some sprites.
     
  20. BRiT

    BRiT (>• •)>⌐■-■ (⌐■-■)
    Moderator Legend Alpha

    Joined:
    Feb 7, 2002
    Messages:
    20,502
    Likes Received:
    24,397
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...