Benchmarks aren't the same as real world use, but they can give a good idea of what’s to come, and Nvidia's Hopper GPU performance is impressive. Nvidia has released performance data for its forthcoming Hopper generation of GPUs, and the initial benchmarks are tremendous. The metrics are based on MLPerf Inference v2.1, an industry-standard benchmark that analyzes the performance of inferencing tasks using a machine-learning model against new data. Nvidia claims its Hopper-based H100 Tensor Core GPUs delivered up to 4.5x greater performance than its previous A100 Ampere GPUs. (Read more about Hopper: Nvidia unveils a new GPU architecture designed for AI data centers) It’s a remarkable jump in just one generation. For comparison, CPU benchmarks often grow 5% to 10% from one generation to the next. Nvidia’s performance leap comes with a caveat, however. The 450% boost came on a single benchmark; there were a total of six benchmarks run. The other benchmarks yielded at or below two-fold improvements. Still, a doubling of performance in one generation is impressive. The top gains came on the BERT-Large benchmark, which measures natural-language processing of the BERT AI model developed by Google and used in Google’s search engine, among other things. Nvidia says the BERT performance leap is due to Hopper’s Transformer Engine, which is specifically designed to accelerate training transformer models. Ampere isn’t the only older Nvidia technology getting trounced. The company also benchmarked Jetson AGX Orin, its Ampere-based SoC for robotics and edge systems and a replacement for the Jetson AGX Xavier processor. In those tests, Orin ran up to 5x faster than Xavier while delivering an average of 2x better energy efficiency. But I’m not writing the Ampere A100 obituary just yet. Thanks to improvements in Nvidia’s AI software, it is saying MLPerf figures for the Ampere have advanced by 6x since the A100 was first benchmarked two years ago. Orin is available now. Hopper, which was first introduced in March, is due later this year. Related content news AMD holds steady against Intel in Q1 x86 processor shipments finally realigned with typical seasonal trends for client and server processors, according to Mercury Research. By Andy Patrizio May 22, 2024 4 mins CPUs and Processors Data Center news Broadcom launches 400G Ethernet adapters The highly scalable, low-power 400G PCIe Gen 5.0 Ethernet adapters are designed for AI in the data center. By Andy Patrizio May 21, 2024 3 mins CPUs and Processors Networking news HPE updates block storage services The company adds new storage controller support as well as AWS. By Andy Patrizio May 20, 2024 3 mins Enterprise Storage Data Center news ZutaCore launches liquid cooling for advanced Nvidia chips The HyperCool direct-to-chip system from ZutaCore is designed to cool up to 120kW of rack power without requiring a facilities modification. By Andy Patrizio May 15, 2024 3 mins Servers Data Center PODCASTS VIDEOS RESOURCES EVENTS NEWSLETTERS Newsletter Promo Module Test Description for newsletter promo module. Please enter a valid email address Subscribe