I've been trying to think why bother with a DP pipeline, if, as everyone says, you can do enough MADDs in the same time.darkblu said:jawed,
dp is strictly horizontal. if you had a vertical dp (i.e. multi-vector dp) that would have been a madd/macc, not a dp.
Do misaligned instructions occur often enough in the instruction stream to warrant the extra expenditure in resources to build a dedicated DP unit into the processor?Jawed said:From the MPR article on Cell.
On that basis, a dedicated DP pipeline makes sense, doesn't it
Jawed said:Pages 19 and 20 describe PPE. But the floating point portion of the pipeline is missing in action, sadly. I don't know of any document that describes the entire PPE pipeline.
Jawed
Jawed said:I've been trying to think why bother with a DP pipeline, if, as everyone says, you can do enough MADDs in the same time.
Guden Oden said:Do misaligned instructions occur often enough in the instruction stream to warrant the extra expenditure in resources to build a dedicated DP unit into the processor?
Seems not, from looking at what was actually built, rather than compared to peoples' preferred ideal wet dream architecture...
Sentence seems mangled - are you saying "reduces code size and likely increases execution time, on a lot of game code" or "decreases"?ERP said:DP is probably the single most common vector instruction in or outside graphics, providing it as a core instruction, reduces code size and like increases execution time, on a lot of game code.
I get what you're saying, but didn't IBM state DP was difficult to implement in high-speed FPUs, at least without it getting very expensive hardware-wise?ERP said:DP is probably the single most common vector instruction in or outside graphics, providing it as a core instruction, reduces code size and like increases execution time, on a lot of game code.
Guden Oden said:I get what you're saying, but didn't IBM state DP was difficult to implement in high-speed FPUs, at least without it getting very expensive hardware-wise?
It's one thing what's really convenient for programmers, and another what's realistic to implement unfortunately - much to the chagrin of all hardware geeks.
Seems like M$ asked IBM to disregard the hardware expense and make a seriously long pipeline...Guden Oden said:I get what you're saying, but didn't IBM state DP was difficult to implement in high-speed FPUs, at least without it getting very expensive hardware-wise?
Jawed said:So, is this DP faster or slower than executing a DP as MADDs?
Jawed
ERP said:Depends if you can group them into sets of 4 trivially, cost should be more or less the same in that case. In graphics it's usually not very hard to do the grouping, in other areas it can be difficult or impossible.
This is why in general peak numbers have no real bearing on real numbers.
blakjedi said:What does it mean when you guys use the term "trivial" to describe operations, shading etc...?