Missing from the diagram is the interconnect or even whether there is more than one APU per node.
Wouldn't this part, "the EHP is coupled to a second level of off package memory" hint at more than one APU per node?
Missing from the diagram is the interconnect or even whether there is more than one APU per node.
Wouldn't this part, "the EHP is coupled to a second level of off package memory" hint at more than one APU per node?
It is highly unlikely that this is in any way a valid way to "advertise" yourself for a governmental project (or any non-hobbyist work really). I know it is commonly accepted wisdom in the great Internet echo-chamber that governmental purchases are done by barely literate navel-gazers that need NICE BIG LETTERS TO UNDERSTAND ANYTHING, but the purchasing/financing process is a bit more involved. You'd better hope that AMD "advertised" itself actively for a pretty long time to even be in the running, and not by way of fluffy papers. In no small part because Bill Dally and his guys have been talking about pretty similar things for many years now. And Intel's Xeon Phi is narrowing down on those ideals while being an in-the-silicon product, as opposed to a vague "this is awesome" theoretisation.Like I said, this just looks like an advertisement for AMD to be a part of the US Government's new exascale computer project, rather than any actual project. "Hey look what we could do if we had money!" and the title of the paper says EXASCALE COMPUTING in nice big letters so the government wonks can understand it. Considering the amount of press this got, I'd say it's a pretty cheap marketing success for AMD so far.
Like I said, this just looks like an advertisement for AMD to be a part of the US Government's new exascale computer project, rather than any actual project. "Hey look what we could do if we had money!" and the title of the paper says EXASCALE COMPUTING in nice big letters so the government wonks can understand it. Considering the amount of press this got, I'd say it's a pretty cheap marketing success for AMD so far.
As far as I have read, PIM in this concept was meant for the high-capacity, off-package layer of memory in the first place.AMD's using an 8-stack interposer does go against what the PIM proposal had, for a number of reasons. Part of the PIM's benefit was that it adopted an HMC-like interface with the GPU and negated the need for an interposer.
Having an interposer wouldn't necessary mean you couldn't have PIM, but it takes away a benefit that might have pushed it over the top if other problems like software complexity and cost to implement outweighed its performance benefit in the workloads it worked well in.
Additionally, the PIM's projected power ceiling, coupled with there being 8 stacks, would devote a huge chunk of the node's power budget to the PIM stacks.
Clarification: The EHP paper itself didn't explicitly tell, but the prior work is AFAIR going along this direction.As far as I have read, PIM in this concept was meant for the high-capacity, off-package layer of memory in the first place.
The TOP-PIM paper placed PIM in opposition to HBM and WideIO, and it was originally evaluated as the primary memory pool.As far as I have read, PIM in this concept was meant for the high-capacity, off-package layer of memory in the first place.
https://asc.llnl.gov/fastforward/AMD-FF.pdfThe TOP-PIM paper placed PIM in opposition to HBM and WideIO, and it was originally evaluated as the primary memory pool.
It was also evaluated in terms of being placed under a stack of DRAM, which was a significant constraint on its power ceiling.
A non-volatile standard might be able to accept a higher power budget, although if that happened it would increase the amount of power budget taken away from the central APU.
The original proposal is years-old at this point, so it might predate AMD's current belief that it needs a tiered memory pool. It does seem to predate the idea that the off-package memory would be non-volatile.
That could be an evolution of the concept, although the paper cites the original concept.https://asc.llnl.gov/fastforward/AMD-FF.pdf
This also worths a read.