GH100 got the Gaming-Ampere SMs. So you have to compare Hopper to Volta. For example A100 FP32 and FP64 went only up 30% while the transistor budget increased by 2.6x.
AD102 has 16x more L2 cache, 71% more compute units, 71% more GPCs (rasterizer and ROPs), improved RT Cores (2x the triangle intersection, two new hardware features), new geometry processing (Micro-Meshes), improved TensorCores, new shader reordering function, new optic flow accelator, new video encoder, 40%+ higher clocks and i guess a lot of the Hopper's compute features. Looking at the raw performance of a "full" AD102 it doesnt look so bad.
AD102 has 16x more L2 cache, 71% more compute units, 71% more GPCs (rasterizer and ROPs), improved RT Cores (2x the triangle intersection, two new hardware features), new geometry processing (Micro-Meshes), improved TensorCores, new shader reordering function, new optic flow accelator, new video encoder, 40%+ higher clocks and i guess a lot of the Hopper's compute features. Looking at the raw performance of a "full" AD102 it doesnt look so bad.
Last edited: