Supermicro's NVIDIA HGX B200 Systems Lead MLPerf Inference Benchmarks

April 03, 2025

Supermicro has announced that its NVIDIA HGX B200 systems have achieved top performance in the MLPerf Inference v5.0 benchmarks, outperforming previous systems by generating three times more tokens per second.

Supermicro has announced its NVIDIA HGX B200 systems have achieved industry-leading performance in the MLPerf Inference v5.0 benchmarks, according to a press release. The systems, featuring 8-GPU configurations, demonstrated more than three times the token generation per second compared to previous generations.

The benchmarks highlighted the performance of both air-cooled and liquid-cooled systems, with the air-cooled B200 system matching the liquid-cooled system's performance within the operating margin. Supermicro's systems excelled in various benchmarks, including Llama2-70B and Llama3.1-405B, showcasing significant improvements in token generation rates.

Supermicro's systems, such as the SYS-421GE-NBRT-LCC and SYS-A21GE-NBRT, achieved top positions in several categories, including the Mixtral 8x7B Inference and Mixture of Experts benchmarks. These systems delivered impressive results, with the air-cooled and liquid-cooled NVIDIA B200 systems generating over 1,000 tokens per second for large models like Llama3.1-405b.

The company continues to offer a comprehensive AI portfolio with over 100 GPU-optimized systems, providing both air-cooled and liquid-cooled options to meet diverse workload requirements. Supermicro's collaboration with NVIDIA ensures that their systems remain at the forefront of AI performance and innovation.

We hope you enjoyed this article.

Consider subscribing to one of our newsletters like Silicon Brief or Daily AI Brief.

Also, consider following us on social media:

AI Chips & Datacenters AI Brief AI Brief (X)

More from: Data Centers

10/14

Open Compute Project and iMasons Introduce Carbon Disclosure Specification for Data Centers

10/14

SONiC Foundation Expands AI Networking Ecosystem at OCP Global Summit 2025

10/14

Axiado to Showcase AI-Embedded Security Chips at OCP Global Summit 2025

10/14

Jabil Unveils J-422G Servers for AI and Data Center Workloads

10/14

ABB and NVIDIA Collaborate on Next-Generation AI Data Centers

Subscribe to Silicon Brief

Weekly coverage of AI hardware developments including chips, GPUs, cloud platforms, and data center technology.

Whitepaper

Stanford HAI’s 2025 AI Index Reveals Record Growth in AI Capabilities, Investment, and Regulation

The 2025 AI Index by Stanford HAI provides a comprehensive overview of the global state of artificial intelligence, highlighting significant advancements in AI capabilities, investment, and regulation. The report details improvements in AI performance, increased adoption in various sectors, and the growing global optimism towards AI, despite ongoing challenges in reasoning and trust. It serves as a critical resource for policymakers, researchers, and industry leaders to understand AI's rapid evolution and its implications.

Categories

Companies

Resources

Supermicro's NVIDIA HGX B200 Systems Lead MLPerf Inference Benchmarks

We hope you enjoyed this article.

More from: Data Centers

Subscribe to Silicon Brief

Whitepaper

Stanford HAI’s 2025 AI Index Reveals Record Growth in AI Capabilities, Investment, and Regulation

You May Also Like

ASUS Showcases AI Factory and New NVIDIA HGX B300 Servers at OCP 2025

GIGABYTE Launches AI TOP ATOM Supercomputer Powered by NVIDIA GB10

Compal Showcases CXL and Liquid Cooling Innovations for AI Data Centers at OCP 2025

Microsoft Azure Launches First NVIDIA GB300 NVL72 Supercomputing Cluster for OpenAI

SuperX Unveils Modular AI Factory for Rapid Deployment

Wiwynn Unveils Double-Wide Rack for Next-Gen AI at OCP Global Summit 2025

ASICs and Advanced Packaging to Reshape AI Chip Market

Jabil Unveils J-422G Servers for AI and Data Center Workloads

ionstream.ai Provides Compute Resources to SGLang for B200 GPU Optimization

ABB and NVIDIA Collaborate on Next-Generation AI Data Centers

Qualcomm and MediaTek Expand Rivalry into Cloud AI Chips

Microsoft to Prioritize In-House AI Chips for Data Centers