DavidGraham
Veteran
Elon Musk is builing a new AI project: a gigafactory comprising of a 100K H100 GPUs, he couldn't wait for H200 or B100/B200.
How about they give us fully bindless (REAL pointers in shaders) hardware too because the recent Marvel's Spider-Man games in RT mode do ~1 million descriptor updates/copies per frame and SM6.6 style dynamic resource binding model (ResourceDescriptorHeap/SamplerDescriptorHeap) isn't good enough for that purpose ...
It'd be a nice bonus as well to support RGBE encoded render target formats as well because Xbox developers have been raving for this feature too even though it's only available on recent Xbox consoles ... (it was that good to expose a PC D3D12 extension for it even if only one HW vendor currently supports it) ...
Nvidia h/w support RGBE just fine.lack of RGBE
Nvidia h/w support RGBE just fine.
A probable issue for work graphs /w mesh nodes on their implementation is that there's going to be potentially a lot of transitions between compute/mesh nodes which will cause WFI/subchannel switches (implict barriers) ...Sorry I meant what more should they be doing on work graphs to meet your bar.
If you look at his profiling data you'll notice that one hotspots involves the CopyDescriptorsSimple API so now that we games (Monster Hunter Rise exhibits similar issues) that are going against their one of API usage recommendations I think it's time that we stop kneecapping other architectures that are capable of loading descriptors from memory directly. D3D12's descriptor 'heap' abstraction model has not aged well in the bindless era ...What’s the source for the Spider-man descriptor issue? Is it being handled differently/better on other architectures?
https://wccftech.com/nvidia-mediate...computex-2024-setting-a-new-era-of-computing/
Computex 2024 will be a huge deal for the AI PC segment, as MediaTek & NVIDIA are expected to present their AI PC SOC solution at the event.
It's a shame about that SVE/2 implementation will still only be 128-bits. Just means that no ARM vendors are actually serious behind the idea of being an alternative to modern AAA PC gaming ...MediaTek uses off the shelf ARM designs so any secret sauce would likely come from Nvidia. Will be interesting to see how well the arch scales down. The Windows on ARM AI PC market is non-existent but they must know something we don’t. If Intel can deliver with Lunar Lake this ARM renaissance will be over before it begins.
one thing is interesting, although I don´t know if it´s true, but is suppose to be manufactured at Intel fab using Intel 3 processMediaTek uses off the shelf ARM designs so any secret sauce would likely come from Nvidia. Will be interesting to see how well the arch scales down. The Windows on ARM AI PC market is non-existent but they must know something we don’t. If Intel can deliver with Lunar Lake this ARM renaissance will be over before it begins.
it looks like chinese cooking festival meeting
Blackwell Ultra and X800Ultra is next "year".nVidia will increase the NVLink speed to 3.6 TB/s next year with Rubin. Was there ever another company doing such an leap with networking? Since 2016 with Pascal the speed would have gone up from 300 GB/s to 3.6 TB/s...
Yeah, missed the year.Blackwell Ultra and X800Ultra is next "year".
Rubin, Vera, and X1600 is the "year" after, 2026(?) which had the 3.6TB/s NVLink 6.