Microsoft Azure Launches First NVIDIA GB300 NVL72 Supercomputing Cluster for OpenAI

October 10, 2025

Microsoft Azure has introduced the NDv6 GB300 VM series, the world’s first production-scale cluster based on NVIDIA GB300 NVL72 systems, purpose-built for OpenAI’s AI workloads. The deployment integrates over 4,600 NVIDIA Blackwell Ultra GPUs connected through Quantum-X800 InfiniBand networking to accelerate large-scale AI training and inference.

Microsoft Azure Launches First NVIDIA GB300 NVL72 Supercomputing Cluster for OpenAI

Microsoft Azure has introduced the NDv6 GB300 VM series, the world’s first production-scale cluster built on NVIDIA GB300 NVL72 systems, according to a Microsoft announcement. The new cluster, designed for NVIDIA’s Blackwell Ultra GPUs, is optimized for OpenAI’s most demanding inference workloads.

The Azure system integrates more than 4,600 NVIDIA Blackwell Ultra GPUs connected through the NVIDIA Quantum-X800 InfiniBand platform, providing 800 Gb/s bandwidth per GPU. Each rack-scale unit combines 72 Blackwell Ultra GPUs with 36 Grace CPUs, delivering up to 1.44 exaflops of FP4 Tensor Core performance and 37 terabytes of fast memory per VM. The NVLink Switch fabric enables 130 TB/s of bandwidth within each rack for high-speed intra-rack communication.

Microsoft Azure engineered the cluster with custom cooling, power distribution, and reworked software for orchestration and storage to support large-scale AI workloads. NVIDIA noted in its blog that the system is designed for reasoning models, agentic AI, and multimodal generative AI, providing unified memory and compute capacity for frontier model development.

The GB300 NVL72 deployment marks the next phase of Microsoft’s and NVIDIA’s collaboration to scale AI infrastructure globally. Microsoft plans to expand the rollout to hundreds of thousands of Blackwell Ultra GPUs across its data centers to support next-generation AI training and inference at supercomputing scale.

We hope you enjoyed this article.

Consider subscribing to one of our newsletters like Enterprise AI Brief, Silicon Brief or Daily AI Brief.

Also, consider following us on social media:

Enterprise AI AI Chips & Datacenters AI Brief AI Brief (X)

More from: Enterprise

03/05

Aquant Earns Microsoft Certified Software Designation for Manufacturing AI

03/05

Senhwa Biosciences Signs AI-Driven Drug Development Partnership with CellType

03/05

BonData and Elad Systems Partner on Automated Data Harmonization

03/05

Accel-KKR Takes Majority Stake in Whip Around to Support Growth and Acquisitions

03/05

United Planners Consolidates Compliance and Cybersecurity on SurgeONE.ai Platform

More from: Data Centers

03/05

Huawei’s OceanStor Dorado All-Flash Storage Validated by ESG at MWC 2026

03/05

Tencent Cloud Expands European Infrastructure with New Frankfurt Availability Zone

03/05

Flex Begins U.S. Production of AMD Instinct MI355X AI Platform

03/05

Vertiv Expands PowerBar Track Busway for High-Density AI Data Centers

03/04

SK Telecom, Supermicro, and Schneider Electric Sign MOU for Modular AI Data Centers

Subscribe to Silicon Brief

Weekly coverage of AI hardware developments including chips, GPUs, cloud platforms, and data center technology.

Market report

AI’s Time-to-Market Quagmire: Why Enterprises Struggle to Scale AI Innovation

ModelOp

The 2025 AI Governance Benchmark Report by ModelOp provides insights from 100 senior AI and data leaders across various industries, highlighting the challenges enterprises face in scaling AI initiatives. The report emphasizes the importance of AI governance and automation in overcoming fragmented systems and inconsistent practices, showcasing how early adoption correlates with faster deployment and stronger ROI.

Categories

Companies

Resources

Microsoft Azure Launches First NVIDIA GB300 NVL72 Supercomputing Cluster for OpenAI

We hope you enjoyed this article.

More from: Enterprise

More from: Data Centers

Subscribe to Silicon Brief

Market report

AI’s Time-to-Market Quagmire: Why Enterprises Struggle to Scale AI Innovation

You May Also Like

Supermicro Expands AI-RAN and Sovereign AI Infrastructure at MWC Barcelona

OpenAI Raises $110 Billion with Backing from Amazon, NVIDIA, and SoftBank

Cisco Introduces Silicon One G300 to Boost AI Data Center Performance

EPRI, Nvidia, Prologis, and InfraPartners to Develop Smaller AI Data Centers

OpenAI Partners with Tata Group for 100MW AI Data Center in India

UAE Unveils Wafer-Scale AI Chip with Four Trillion Transistors

Akash Systems Launches Diamond-Cooled AI Servers with AMD Instinct GPUs

Semidynamics Unveils 3nm AI Inference Silicon and Full-Stack Systems

OpenAI and Microsoft Join UK’s AI Alignment Coalition

Infosys and Intel Expand Collaboration to Scale Enterprise AI Deployments

Johnson Controls Unveils Thermal Management Guides for Gigawatt-Scale AI Data Centers

OpenAI Launches Frontier Platform for Enterprise AI Agents