Can I get my hopes up a little or will they be crushed again, that's all I want to know. :/
You can, just so long as you don't jinx it for the rest of us
Can I get my hopes up a little or will they be crushed again, that's all I want to know. :/
yeah, i didn´t notice they already showed slimmer design.That's what it is.
Are people still actually using PSUs where PCIe leads have something else than 8 or 6+2 plugs?A bit of a random question: if a card has 2 x 8pin connectors while it has a TGP of 220W-250W, do we really need to plug in 2 x 8pin? Could it work with 1 x 6 pin + 1 x 8 pin?
A bit of a random question: if a card has 2 x 8pin connectors while it has a TGP of 220W-250W, do we really need to plug in 2 x 8pin? Could it work with 1 x 6 pin + 1 x 8 pin?
Most likely the card will check that both are plugged in properly
Are people still actually using PSUs where PCIe leads have something else than 8 or 6+2 plugs?
Most likely the card will check that both are plugged in properly even if 8+6 would be enough for the power draw
Unless you got paperclips! Paperclips can easily make your 6-pin connectors 8-pin compatibleIn general, all power connectors have to be plugged into the GPU even if it does not really need it (i.e. my V56 Nitro+ LE has 3x8 pin, it never goes over 260W at stock settings, but it won't work without the third 8 pin).
Well, you can also use those 2xmolex => 8 pin cables, but they are very shoddy in general (and the PSUs which don't have the required connectors will probably die as my 600W, 9 years old FSP PSU did after a month with heavily OC'd Vega), so personally I'd not go with thatPaperclips can easily make your 6-pin connectors 8-pin compatible
The 256 MiB BAR has been exposed for a while already in Vulkan
Microsoft WDK documentation on GPUMMU states the 256 Mbyte limit is simply a default value set by device firmware to fit into 32-bit virtual adress space, and Vulkan is implemented as a user-mode driver which has to work through WDDM 2.0 kernel-mode driver (DXGK).The Ryzen 5000/RX 6000 series (Zen 3/RDNA 2) enables the host to seamlessly stream data into any memory region on the device by removing this 256MB limit
knowing that the host is PCI-E 4.0 and the GPU is PCI-E 4.0 ... might enable slightly different optimisations in the driver stack etc...
all non-driver allocated memory is CPU-cacheable and hence you need to enable cache-snooping for that memory on the GPU, which makes the GPU accesses to this memory slower. When you allocate using the driver you can choose for cacheable or non-cacheable. (see the 2 memory types for heap 1)
From a development standpoint would studios to need to change to specific CPU+GPU combinations? Or is it a "modify and forget" type code change not requiring any testing or validation?Yea it's not anything proprietary, just a slight performance increase similar to smartshift in laptops. And AMD is obviously trying to incentivize people to buy their own CPU+GPU combinations, nothing wrong with it. The competition are free to do something of their own.
Well, you can also use those 2xmolex => 8 pin cables, but they are very shoddy in general (and the PSUs which don't have the required connectors will probably die as my 600W, 9 years old FSP PSU did after a month with heavily OC'd Vega), so personally I'd not go with that
As per AMD it should provide some gains without any optimization from the developer end, but can provide even bigger gains if specifically optimized for. From what we've seen so far, it doesn't seem to be something than can't be enabled on Intel & Nvidia, so it might be possible on those platforms in the future. We should find out more information during the architecture deep dive around the launch.From a development standpoint would studios to need to change to specific CPU+GPU combinations? Or is it a "modify and forget" type code change not requiring any testing or validation?
Yes, it's not about the bandwidth, it's about cache coherence.
If the GPU can use system memory just like its own local video memory (and vice versa, the CPU can use local video memory as if it was system memory), you have to either synchronise the GPU and CPU caches using some cache coherence protocol over PCIe bus - preferably something more complex than bus snooping - or completely disable caching for this physical memory pool, with a detrimental effect on perfromance.
So is that what SAM is actually doing then? Allowing each device to see the others memory pool as if it were it's own and keeping the caches between both CPU and GPU coherent? So this becomes similar to a UMA?
Microsoft WDK documentation on GPUMMU states the 256 Mbyte limit is simply a default value set by device firmware to fit into 32-bit virtual adress space, and Vulkan is implemented as a user-mode driver which has to work through WDDM 2.0 kernel-mode driver (DXGK).
The PCIe standard supports BAR Size from 1 MB to 512 GB.
I don't think PCIe Resizable BAR is exclusive to RDNA architecture either; need to look into Linux driver code for a list of supporting hardware though.
Allowing each device to see the others memory pool as if it were it's own and keeping the caches between both CPU and GPU coherent? So this becomes similar to a UMA?
People interpreted it as support for Resizable BAR and PCIe 4.0 bandwidth - but AFAIK these two technologies are not really exclusive to RDNA2 (RX 6000) and Zen3 (Ryzen 5000).In conventional Windows-based PC systems, processors can only access a fraction of graphics memory (VRAM) at once, limiting system performance. With AMD Smart Access Memory, the data channel gets expanded to harness the full potential of GPU memory - removing the bottleneck to increase performance.
Only for system memory pools, but not for local video memory pools.No, the caching stuff was available before this already since forever (pre-GCN at least)
NVMe is a block I/O protocol for disk devices - it uses LBA sector numbers to access disk data, which are remapped to actual flash memory addresses by the NVMe controller. It can use PCIe memory mapping for the optional Host Memory Buffer (HMB) feature in entry-level DRAM-less controllers, but flash memory is not visible to the host.So what is stopping you from ... using a 512GB NVME drive and ... having SAM reconfigure the BAR to include some of the NVME drive
Interesting. I wondered what that toggle was for in the Intel X99 MB bios.Above 4G decoding option is available on all AM4 motherboards.