AMD: R8xx Speculation

Discussion in 'Architecture and Products' started by Shtal, Jul 19, 2008.

?

How soon will Nvidia respond with GT300 to upcoming ATI-RV870 lineup GPUs

Poll closed Oct 14, 2009.
  1. Within 1 or 2 weeks

    1 vote(s)
    0.6%
  2. Within a month

    5 vote(s)
    3.2%
  3. Within couple months

    28 vote(s)
    18.1%
  4. Very late this year

    52 vote(s)
    33.5%
  5. Not until next year

    69 vote(s)
    44.5%
  1. fellix

    Veteran

    Joined:
    Dec 4, 2004
    Messages:
    3,552
    Likes Received:
    514
    Location:
    Varna, Bulgaria
    Meh, it's too late. The genie is out of the [strike]whoopass can[/strike] bottle. :smile:
     
  2. CarstenS

    Legend Subscriber

    Joined:
    May 31, 2002
    Messages:
    5,800
    Likes Received:
    3,920
    Location:
    Germany
    That's what marketing's all about, isn't it? And AMD has done a fantastic and highly efficient job wrt to marketing for HD5k.
     
  3. trinibwoy

    trinibwoy Meh
    Legend

    Joined:
    Mar 17, 2004
    Messages:
    12,055
    Likes Received:
    3,112
    Location:
    New York
    That's what that whole tiff boiled down to. Whether the definition of superscalar implies dynamic scheduling/issuing of instructions by the hardware to the various available execution units. As far as I know, it does.
     
  4. bridgman

    Newcomer Subscriber

    Joined:
    Dec 1, 2007
    Messages:
    62
    Likes Received:
    123
    Location:
    Toronto-ish
    Yeah, that's the problem in a nutshell. The term "dynamic superscalar" clearly implies dynamic extraction of ILP. Once you remove the word "dynamic" things get fuzzy; you see papers talking about "static superscalar" meaning VLIW.

    I even found one paper that distinguished between "static superscalar" and VLIW depending on whether instruction N could use the results of instruction N-1. By that definition "VLIW" needed the equivalent of a delay slot while "static superscalar" did not. Perversely enough, since the 6xx/7xx shaders can always access the results from the immediately preceding ALU instruction via the PS/PV registers by that definition the 6xx+ shaders are superscalar and *not* VLIW :D

    http://courses.ece.ubc.ca/476/www200.../Lecture29.pdf

    I looked at maybe 50 links; roughly 2/3 seem to say that VLIW is *not* a type of superscalar architecture and the rest said that it was.
     
  5. rpg.314

    Veteran

    Joined:
    Jul 21, 2008
    Messages:
    4,298
    Likes Received:
    0
    Location:
    /
    \appeal to authority

    Hennesey Patterson, page 115,

    they describe 3 kinds of superscalar processors

    • static, aka in order eg, ARM,
    • dynamic, aka ooo but no speculation, no examples
    • speculative, aka ooo with speculation, modern x86

    so plain superscalar is an ambiguous term. When somebody uses just the word superscalar, I take it to mean static superscalar. Your definitions/conventions/tastes may vary...
     
  6. 3dilettante

    Legend Alpha

    Joined:
    Sep 15, 2003
    Messages:
    8,579
    Likes Received:
    4,799
    Location:
    Well within 3d
    I've not seen anyone turn up their nose at a design that can fetch, issue, and execute multiple instructions or the equivalent of multiple independent instructions at once, regardless of method, which is an implementation detail.

    When I hear or read someone describe a core as being superscalar, I assume that the design can generally process more than one instruction at a time.
    I say generally because designs typically are not set up to support full issue/decode/execution for every combination of instructions possible at their given width, and some are much more limited than others.


    I am curious where people would put a design capable of fetching and issuing multiple instructions, with the caveat that the design eschews dependence checking by doing a scalar fetch from multiple threads.
     
  7. dkanter

    Regular

    Joined:
    Jan 19, 2008
    Messages:
    360
    Likes Received:
    20
    That's not superscalar - the best example is Niagara 2 which is decidedly not superscalar.

    Superscalar implies that you can (under most circumstances) fetch, issue, execute and retire multiple instructions in a single cycle from a single thread.

    David
     
  8. 3dilettante

    Legend Alpha

    Joined:
    Sep 15, 2003
    Messages:
    8,579
    Likes Received:
    4,799
    Location:
    Well within 3d
    I meant within a core, and at that level Niagara is single-issue.

    edit: or is it? That's how I remember it being presented.

    edit edit: Sorry, I read that too fast, you said Niagara 2.
     
  9. kresek

    Newcomer

    Joined:
    Apr 4, 2007
    Messages:
    14
    Likes Received:
    4
    Think of the first Pentium - an in-order, superscalar core. The same applies to early UltraSPARCs, Alphas, or even IBM's POWER6 - superscalar, albeit in-order; not VLIW at the same time. But as for VLIW machines, ILP extraction relied on compile time instruction scheduling. Guess some of you would call this "static superscalar".
     
  10. 3dilettante

    Legend Alpha

    Joined:
    Sep 15, 2003
    Messages:
    8,579
    Likes Received:
    4,799
    Location:
    Well within 3d
    The operative question is whether anyone would be confused by just calling a superscalar core "superscalar".

    If the chip exploits ILP (per the earlier clarification) by fetching and executing multiple instructions from a single thread at the same time, it is superscalar.
     
  11. kresek

    Newcomer

    Joined:
    Apr 4, 2007
    Messages:
    14
    Likes Received:
    4
    After beating the "what makes a thread" topic to death, one can get really confused even when looking at relatively simple terms.
    It just seems to me that plain "superscalar" is quite often mistakenly taken as "OoO superscalar" or - "dynamic superscalar".
     
  12. 3dilettante

    Legend Alpha

    Joined:
    Sep 15, 2003
    Messages:
    8,579
    Likes Received:
    4,799
    Location:
    Well within 3d
    OoO and dynamic are terms that are fully orthogonal to whether a design is superscalar.

    The great "thread" debate centers on a weakening of language that I do not see a parallel for in the usage of superscalar.
    That debate was a question over whether a given entity in a set implementation counted as a thread.

    It has been accepted any scheme that extracts ILP by fetching, issuing, and executing multiple instructions per cycle is superscalar, and this has been an acceptable usage for designs that have been in-order, VLIW, EPIC, OoO for decades.
     
  13. trinibwoy

    trinibwoy Meh
    Legend

    Joined:
    Mar 17, 2004
    Messages:
    12,055
    Likes Received:
    3,112
    Location:
    New York
    Mr Demers seems to disagree.

    Also, by your own definition I don't see how VLIW qualifies. After all the hardware is only fetching and decoding a single instruction isn't it?
     
  14. 3dilettante

    Legend Alpha

    Joined:
    Sep 15, 2003
    Messages:
    8,579
    Likes Received:
    4,799
    Location:
    Well within 3d
    Disagree on what? That AMD units are VLIW superscalar?
     
  15. 3dilettante

    Legend Alpha

    Joined:
    Sep 15, 2003
    Messages:
    8,579
    Likes Received:
    4,799
    Location:
    Well within 3d
    Some early VLIWs didn't even decode, the instruction word was the set of command signals that would have come out of a decoder, if it were present.

    I'll leave the long-instruction word items off my list if they don't fit.
     
  16. trinibwoy

    trinibwoy Meh
    Legend

    Joined:
    Mar 17, 2004
    Messages:
    12,055
    Likes Received:
    3,112
    Location:
    New York
    Yeah he makes the distinction here.

    bridgman, do you work for AMD? I see you refer to them as "us" over at Phoronix.
     
  17. FrameBuffer

    Banned

    Joined:
    Aug 7, 2005
    Messages:
    499
    Likes Received:
    3
    to be fair the quote: "Eric: Actually, it's not really superscalar...more like VLIW", doesn't explicitly say otherwise,.. in particular the "it's not really" and "like VLIW".
     
  18. bridgman

    Newcomer Subscriber

    Joined:
    Dec 1, 2007
    Messages:
    62
    Likes Received:
    123
    Location:
    Toronto-ish
  19. 3dilettante

    Legend Alpha

    Joined:
    Sep 15, 2003
    Messages:
    8,579
    Likes Received:
    4,799
    Location:
    Well within 3d
    Now that I've had a night to sleep on it and review that section of Patterson and Hennessy, I admit that my ealier VLIW/superscalar confusion was some kind of brain fart. The distinction has been made between the two methods of extracting parallelism from the instruction stream.
     
  20. trinibwoy

    trinibwoy Meh
    Legend

    Joined:
    Mar 17, 2004
    Messages:
    12,055
    Likes Received:
    3,112
    Location:
    New York
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...