ATI seems to have uploaded the OpenCL beta drivers again but without the 5900 driver ID's in the inf's. I'm guessing they weren't supposed to be letting us know about the 5900 series yet.
To me superscalar architecture is just one type of ILP extraction. VLIW architectures (amd gpu's, Itanium) extracts this at compile time. Dynamic superscalar (pentium and all modern CPUs) extract this at run time.
That's what that whole tiff boiled down to. Whether the definition of superscalar implies dynamic scheduling/issuing of instructions by the hardware to the various available execution units. As far as I know, it does.
I've not seen anyone turn up their nose at a design that can fetch, issue, and execute multiple instructions or the equivalent of multiple independent instructions at once, regardless of method, which is an implementation detail.
When I hear or read someone describe a core as being superscalar, I assume that the design can generally process more than one instruction at a time.
I say generally because designs typically are not set up to support full issue/decode/execution for every combination of instructions possible at their given width, and some are much more limited than others.
I am curious where people would put a design capable of fetching and issuing multiple instructions, with the caveat that the design eschews dependence checking by doing a scalar fetch from multiple threads.
After beating the "what makes a thread" topic to death, one can get really confused even when looking at relatively simple terms.The operative question is whether anyone would be confused by just calling a superscalar core "superscalar".
After beating the "what makes a thread" topic to death, one can get really confused even when looking at relatively simple terms.
It just seems to me that plain "superscalar" is quite often mistakenly taken as "OoO superscalar" or - "dynamic superscalar".
Some early VLIWs didn't even decode, the instruction word was the set of command signals that would have come out of a decoder, if it were present.Also, by your own definition I don't see how VLIW qualifies. After all the hardware is only fetching and decoding a single instruction isn't it?
Disagree on what? That AMD units are VLIW superscalar?
Eric: Actually, it's not really superscalar...more like VLIW
Yeah he makes the distinction here.
bridgman, do you work for AMD? I see you refer to them as "us" over at Phoronix.
bridgman, do you work for AMD? I see you refer to them as "us" over at Phoronix.