AMD Vega 10, Vega 11, Vega 12 and Vega 20 Rumors and Discussion

Kaotik · Aug 27, 2017

digitalwanderer said:
Hey, I would take David Kanters best guess over anyone else's facts! The man knows what he's talking about when he talks, I would count his estimates as extremely accurate.

I could agree if he was talking about 2x 8-Hi, but $150 for 2x 4-Hi I don't care if he hand-crafted the chips I still wouldn't believe him without actuall bill of materials + work

Rootax · Aug 27, 2017

If you don't know what height your chip will be, how the watercooling compagnies like EK or XSPC will provide a waterblock... (and EK already does) ?

Deleted member 13524 · Aug 27, 2017

Is the 0.1mm difference in height a problem for thermal paste to cease working properly?

Rootax · Aug 27, 2017

ToTTenTranz said:
Is the 0.1mm difference in height a problem for thermal paste to cease working properly?

I was thinking more about mounting pressure, stuff like that. If the chip is higher than anticipated, maybe it could damage it, I don't know...

CarstenS · Aug 27, 2017

Tom's Hardware has am opinion on this matter - complete with slides from AMD illustrating differences. It is a couple of days old, but apparently not everyone had a chance to take a look at it.
http://www.tomshardware.com/news/amd-vega-package-problem,35281.html
(this is also where they cite a source, that Vega 56's memory comes from SK Hynix.)

Rootax · Aug 27, 2017

CarstenS said:
Tom's Hardware has am opinion on this matter - complete with slides from AMD illustrating differences. It is a couple of days old, but apparently not everyone had a chance to take a look at it.
http://www.tomshardware.com/news/amd-vega-package-problem,35281.html
(this is also where they cite a source, that Vega 56's memory comes from SK Hynix.)

Merci !

Deleted member 2197 · Aug 27, 2017

Kaotik said:
I could agree if he was talking about 2x 8-Hi, but $150 for 2x 4-Hi I don't care if he hand-crafted the chips I still wouldn't believe him without actuall bill of materials + work

I think we can speculate the cost for 2x 4HI would be in the neighborhood of $150+ based on this HBM1 breakdown.

https://forum.beyond3d.com/posts/1982501/

Kaotik · Aug 27, 2017

pharma said:
I think we can speculate the cost for 2x 4HI would be in the neighborhood of $150+ based on this HBM1 breakdown.

https://forum.beyond3d.com/posts/1982501/

Same Electroiq site released the analyst estimation which says that 4 stacks of HBM1 on Fiji costs $48 + $25 for interposer & $30 for substrate+packaging. That $150 could be correct if it includes those, but definitely not for just the 2 4-Hi HBM2 stacks

Deleted member 13524 · Aug 27, 2017

Kaotik said:
Same Electroiq site released the analyst estimation which says that 4 stacks of HBM1 on Fiji costs $48 + $25 for interposer & $30 for substrate+packaging. That $150 could be correct if it includes those, but definitely not for just the 2 4-Hi HBM2 stacks

And if 4Hi HBM2 stacks cost 2.5x more than 4Hi HBM1 stacks, then the price for Vega would be (48/2)*2.5+25+30 = 60 + 25 + 30 = $115
And this is assuming the interposer costs the same between Fiji and Vega 10, though being smaller it should be cheaper too. Packaging for half the chips should be significantly cheaper too.

Regardless, $115 is a far cry from the $175 cost that Gamers Nexus released, and that later in the article went magically up to $200.

Now if that value refers to the 8Hi stacks in Vega FE, then it makes a lot more sense.

xEx · Aug 27, 2017

Kaotik said:
Same Electroiq site released the analyst estimation which says that 4 stacks of HBM1 on Fiji costs $48 + $25 for interposer & $30 for substrate+packaging. That $150 could be correct if it includes those, but definitely not for just the 2 4-Hi HBM2 stacks

In the video they clearly talk about total cost of implementation not just the stacks.

Deleted member 13524 · Aug 27, 2017

xEx said:
In the video they clearly talk about total cost of implementation not just the stacks.

In the article they say it's 150 for the stacks plus 25 for the interposer:

Regardless, we’re at about $150 on HBM2 and $25 on the interposer, putting us around $175 cost for the memory system.

Alexko · Aug 28, 2017

Jawed said:
Maybe because that's what customers of Google's cloud AI service want for the time being: NVidia's platform within that service? No reason to suppose that is a long term prospect.

Why would Google use NVidia for its own internal processes, now that it has TPU V2? In other words why build Tensor Flow and TPU? Do you think it's one of those beta things that Google will abandon after a couple of years?

For one thing, Google's TPU has limited FP32 capabilities and no FP64 support that I know of, which makes it great for inference but of limited use for training, assuming it's even usable at all for such purposes.

CarstenS · Aug 28, 2017

Google specifically mentioned DNN training as a design goal for TPU v2. So you could conclude, TPUv1 was about inference only. Actually, the MAC were 8-bit only, while the Adders are (FP)32-bit

Jawed · Aug 28, 2017

Alexko said:
For one thing, Google's TPU has limited FP32 capabilities and no FP64 support that I know of, which makes it great for inference but of limited use for training, assuming it's even usable at all for such purposes.

This was built explicitly to include training

https://www.blog.google/topics/google-cloud/google-cloud-offer-tpus-machine-learning/

so perhaps you'd like to explain why it is of limited use?

CarstenS · Aug 28, 2017

Maybe it is not clear, who's talking about TPUv1 and who about TPUv2?

Jawed said:
This was built explicitly to include training

https://www.blog.google/topics/google-cloud/google-cloud-offer-tpus-machine-learning/

so perhaps you'd like to explain why it is of limited use?

That blog is about TPUv2 and specifically mentions:

While our first TPU was designed to run machine learning models quickly and efficiently—to translate a set of sentences or choose the next move in Go—those models still had to be trained separately.

Jawed · Aug 28, 2017

Yes, Google is quite explicit that it is replacing GPUs with TPUv2 for training of its own systems. It may be that it's not a complete replacement and that will have to wait until v3 or later. With v2 offered in the cloud, I suppose we'll hear what v2's limitations are.

Anarchist4000 · Aug 28, 2017

Alexko said:
For one thing, Google's TPU has limited FP32 capabilities and no FP64 support that I know of, which makes it great for inference but of limited use for training, assuming it's even usable at all for such purposes.

FP64 support could probably come from the app using host CPUs. I'd think accumulating in FP32 and flushing to system memory for FP64 accumulation would be sufficient for deep learning. May be some HPC apps where that breaks down, but it's difficult to imagine apps with that large of a precision delta for performance critical work.

silent_guy · Aug 28, 2017

Alexko said:
For one thing, Google's TPU has limited FP32 capabilities and no FP64 support that I know of, which makes it great for inference but of limited use for training, assuming it's even usable at all for such purposes.

I don't think anyone uses FP64 for training.

CarstenS · Aug 28, 2017

silent_guy said:
I don't think anyone uses FP64 for training.

Maybe if you're in preparation for world championships or olympics?

Alexko · Aug 28, 2017

silent_guy said:
I don't think anyone uses FP64 for training.

What about FP32? It doesn't seem to have support for proper FP32 IEEE 754 ops, unless I missed something. But perhaps it's still usable in many cases. I'm really not knowledgeable enough about deep learning to say more.

AMD Vega 10, Vega 11, Vega 12 and Vega 20 Rumors and Discussion

Kaotik

Drunk Member

Rootax

Deleted member 13524

Guest

Rootax

CarstenS

Moderator

Rootax

Deleted member 2197

Guest

Kaotik

Drunk Member

Deleted member 13524

Guest

xEx

Deleted member 13524

Guest

Alexko

CarstenS

Moderator

Jawed

CarstenS

Moderator

Jawed

Anarchist4000

silent_guy

CarstenS

Moderator

Alexko