I thought it was on a HotChips presentation, but I didn't see any mention in that PDF. We knew the instruction/constant caches/buffers increased without specifics. Vega had 45MB total SRAM (mentioned by Mantor), but limited reference from Polaris/Fiji to extrapolate.The general layout is similar to Fiji, but I see the instruction and constant caches are 50% up in size for Vega. Was that mentioned in some document or paper?
Not sure if that was the source, but I did mention this in our RX Vega Q&A - but under another premises: Transistor count.The general layout is similar to Fiji, but I see the instruction and constant caches are 50% up in size for Vega. Was that mentioned in some document or paper?
I can count up to 30MB of SRAM in Vega for all known pools. I think the rest is in the front-end macro block in between the multiprocessor array. In that region, I can assume max 1 or 2 MB of parameter cache for sure in there, but there's a lot more visible.Vega had 45MB total SRAM (mentioned by Mantor), but limited reference from Polaris/Fiji to extrapolate.
The bottom-right looks like the SERDES phy used in Zen, so it must be used for the PCIe interfacing. AMD implemented it in Polaris too.hm.... top-right = display, bottom right = PCI-E ?
hm.... top-right = display, bottom right = PCI-E ?
That's roughly the number reached last time we attempted to add everything up. Still a good chunk of SRAM unaccounted. I can't recall where, but there was some anecdotal evidence the parameter cache may be larger than 1-2MB. That could make sense depending on how NGG was implemented in hardware. There would exist extra metadata for culling that might be outside of the normal paths in addition to simply having a larger cache for vertices.I can count up to 30MB of SRAM in Vega for all known pools. I think the rest is in the front-end macro block in between the multiprocessor array. In that region, I can assume max 1 or 2 MB of parameter cache for sure in there, but there's a lot more visible.
I believe AMD still supplies the drivers, it's just an Intel skinThat mixed Intel/Radeon driver gave me shivers to be honest. One can only wonder how long it will take for Intel to update their driver releases. .
There needs to be some magic for the shared TDP between the cpu and gpu. Albeit that doesn't rule out a standard AMD driver - presumably there's just some intel driver which monitors and adjusts the power bits of the gpu via some more or less official AMD api...I believe AMD still supplies the drivers, it's just an Intel skin
So Vega M here:
http://playwares.com/pcreview/56285625#
If AMD keeps this ratio of memory/CU's/ROP's of Vega M specs going forward and they can push past what appear to be previous unit count limits things might not look so "bad" for a navi thats still heavily on the traditional GCN path.
Goddamn I want one of those so bad. So lewd with all those ports too
Performance is right where I expected for a 3.5 to 4.0 TFLOP (with OC) AMD GPU.
Seems to me to be performing significantly above what "traditionally configured" 3.5 to 4 TFLOP amd cards would perform. Take RX 580 as an example its only somewhere like 15% behind it and has 66% of the FLOPS. More ROPs per ALU and more bandwidth per FLOP seem to result in significantly better performance per flop and per watt.
The most important factor isn't performance per flop, but performance per square milimeter. The ROPs are quite big and as a result performance per square milimeter is very close to Polaris 10. Those benchmark results are in the range of Tonga / R9 380X and Polaris 10 / RX 480; quite close to RX 470 at the average. I can't see any change in efficiency with the exception of performance per watt.Seems to me to be performing significantly above what "traditionally configured" 3.5 to 4 TFLOP amd cards would perform. Take RX 580 as an example its only somewhere like 15% behind it and has 66% of the FLOPS. More ROPs per ALU and more bandwidth per FLOP seem to result in significantly better performance per flop and per watt.
i completely disagree, people dont buy based on performance per mm.The most important factor isn't performance per flop, but performance per square milimeter. The ROPs are quite big and as a result performance per square milimeter is very close to Polaris 10. Those benchmark results are in the range of Tonga / R9 380X and Polaris 10 / RX 480; quite close to RX 470 at the average. I can't see any change in efficiency with the exception of performance per watt.
I can't see any change in efficiency with the exception of performance per watt.