Nvidia: BlueField family of DPUs for data centers

Nvidia unveils new BlueField family of DPUs for data centers

https://seekingalpha.com/news/36198...tm_campaign=rta-stock-news&utm_content=link-3

From the article:

During today's GPU Technology Conference, Nvidia (NASDAQ:NVDA) announces DPUs, or data processing units, a new kind of "data-center-infrastructure-on-a-chip-architecture."

The company outlined a three-year DPU roadmap, including the new BlueField-2 of DPUs and the DOCA SDK for building applications.

Nvidia says one BlueField-2 DPU can deliver the same data center services that could consume up to 125 CPU cores.

Server manufacturers including Lenovo, ASUS, and Dell plan to integrate Nvidia DPUs into their enterprise server offerings.

VMware (NYSE:VMW) is working with Nvidia as part of its recently announced Project Monterey to deeply integrate Kubernetes into the vSphere.

IBM's (NYSE:IBM) Red Hat will offer DPU support through its open hybrid cloud portfolio components Red Hat Enterprise Linux and Red Hat OpenShift.

Check Point (NASDAQ:CHKP) will integrate Bluefield-2 DPU in its cybersecurity technologies.

Any speculation on what Bluefield DPUs really are?
 
There, we said that the endgame was to put GPUs on the network, potentially bypassing traditional x86 servers. With today’s BlueField-2 and BlueField-2X launch, along with DOCA software, NVIDIA showed its vision and roadmap for the future clearly. This includes DPUs that combine Arm compute cores, NVIDIA GPU IP, and Mellanox derived networking. At GTC 2020 (#2) we get the roadmap to a unified server DPU SoC.


NVIDIA-BlueField-2X-DPU-Overview.jpg



https://www.servethehome.com/nvidia-shows-dpu-roadmap-combining-arm-cores-gpu-and-networking/
 
NVIDIA Shows DPU Roadmap Combining Arm Cores GPU and Networking

https://www.servethehome.com/nvidia-shows-dpu-roadmap-combining-arm-cores-gpu-and-networking

In our piece covering NVIDIA announcing its intent to acquire Mellanox, we did not hide where we thought that acquisition signaled as a direction for the company. See our NVIDIA to Acquire Mellanox a Potential Prelude to Servers. There, we said that the endgame was to put GPUs on the network, potentially bypassing traditional x86 servers. With today’s BlueField-2 and BlueField-2X launch, along with DOCA software, NVIDIA showed its vision and roadmap for the future clearly. This includes DPUs that combine Arm compute cores, NVIDIA GPU IP, and Mellanox derived networking. At GTC 2020 (#2) we get the roadmap to a unified server DPU SoC.

...

Looking ahead, we see in 2022 that we will have another generation. This time we will move to 400Gbps networking. While a PCIe Gen4 x16 link can handle a 200Gbps interface, it cannot handle 400Gbps. As we discussed in The 2021 Intel Ice Pickle How 2021 Will be Crunch Time around late 2021 or early 2022 we expect to see PCIe Gen5. We can thus assume we will see a ConnectX-7 with 200Gbps/ 400Gbps networking and PCIe Gen5 support by 2022. This will be required to hit the BlueField-3 performance targets in 2022. We can also see about a doubling of performance. That means updated and perhaps more Arm cores. Some of NVIDIA’s DPU competitors are already pushing ahead with 16 core designs in this generation with newer cores.

We can also expect a PCIe Gen5 generation GPU from NVIDIA in 2022. This DPU roadmap chart is effectively tipping that update as well. That new PCIe Gen5 GPU will pair nicely with an updated DPU to make BlueField-3X. We can see there is still a PCIe link between the DPU and GPU. This will also be the era we get CXL so that opens some interesting options in this generation as well.

In 2023, we get to what is the near-term endgame. BlueField-4. We no longer see an “X” and something else is missing. There is no link between the DPU SoC and the GPU die. This is the vision we first discussed when we discussed BlueField (1) in the context of the Mellanox acquisition. NVIDIA is moving to deliver accelerated DPUs using CUDA GPU cores, Arm Cores, Mellanox Networking IP all on a single package.

NVIDIA BlueField-4 is being shown as a 2023 product but with a SPECINT number well beyond today’s x86 CPUs (assuming this is CPU2017 being used.) As you can see from the above chart, NVIDIA is effectively claiming that we will see a BlueField-4 DPU with the equivalent performance of around four AMD EPYC 7742 2019 generation processors.

Final Words

Overall, this is an extremely exciting development. This is how NVIDIA can start to move its GPUs from accelerators within servers to resources on the network managed by DPUs. For large data centers, this is the model we are moving towards. It is also impressive that NVIDIA has plans to move the integer performance higher by over 14x by 2023. That is an enormous gain in only three years.

The other side to getting a 14x gain in three years on the Integer side, assuming it is coming from the Arm CPU cores, is that NVIDIA basically needs something that is faster than today’s x86 CPUs. That makes the CPU cores in the BlueField-4 SoC more than just an offload engine in three years.

NVIDIA is already discussing how this type of card will push not just data center applications, but will also bring the EGX Edge AI platform to reality. NVIDIA can use these DPUs to deliver AI acceleration to the edge with products such as the BlueField-2X.

At STH we have been following the BlueField line since 2017. It started as a storage acceleration SoC. Our best bet is that over the next 3 years, NVIDIA is going to add to its impressive AI (CPU), Arm (CPU), and Networking (Mellanox) stable by adding a software-defined storage solution that can run atop future BlueField processors. Stay tuned for that story.
 
Back
Top