Jawed
Legend
The co-issue, MUL and dependent-ADD, I don't really understand. Is that producing two resultants in one cycle, i.e. the result of the MUL and the result of the MUL+ADD?There is slightly better co-issue, slightly better precision for FP, an additional instruction, etc.
There's been a patent application for something like the SAD instruction for years now. Can I be bothered to dig it up and check it out?...
This is important for addressing arithmetic, so will produce a very welcome speed-up in compute.Then there is the reduced-precision integer math on the slim units that appears to be a step up from RV770, where only shifts were available.
It's vastly more expensive than the double-precision arithmetic, so could be waiting a long time.This might mean that as time drags on that AMD might inch towards 32-bit integer operations across all units in the SIMD, as process nodes afford more space.
20 clusters instead of 10?I'm still waiting on what has doubled the complexity of Evergreen's scheduler.
Jawed