Linpack isn't the best case for GPUs, specifically nvidia GPUs. More GPU friendly algorithms definitely exist. And there are even problems, which run with higher efficiency on GPUs than on CPUs.So far, all the data points to the GPUs having even worst efficiency ratios compared to CPU when doing things that are not linpack. solving huge systems of linear equations is basically best case for GPUs.
If you start that way, I would recommend you to first getting a dictionary to look up the meaning of the word "scalar" and after that have a look at the history how the whole thing of scalar vs. vector vs. superscalar vs. VLIW vs. whatever developed.Superscalar has nothing to do with data. your whole entire argument is from a perspective of ignorance and invalid. I would suggest some reading on actual computer architecture and terminology.
Or I will shut up if you explain, your next statement a bit more in detail:
How didn't the individual instructions operate on scalar data for the CPUs in the last 20+ years besides the SIMD extensions I specifically mentioned (and which appeared later in mainstream CPUs)?No, not really. Superscalar and uniscalar refer to ISSUE not data. By your own argument, superscalar and uniscalar never applied to any CPU that has shipped in the last 20+ years as well...
Or to make it clearer, uniscalar (single issue) and superscalar (multi issue) refer of course to the instruction issue, but the scalar part originates from the scalar vs. vector processor question (clearly referring to the data the instructions operate on). As superscalar processors are normally seen as an evolution of (uni)scalar processors, they simply inherited this property. "Super" as a word simply means above or beyond and designates the issue of multiple instructions in an otherwise scalar ISA (opposed to VLIW, where the ISA and the compiler provide the means for issuing multiple parallel operations, superscalar is an µarchitectural implementation detail of a scalar ISA which is transparent). And in complete analogy there are also single issue, multi issue, and also VLIW vector processors as well as there are multi issue CPUs with a mixed scalar/vector instruction set (the majority right now). Those things are basically orthogonal properties of a processor.
Last edited by a moderator: