CoreWeave Sets New AI Benchmark with NVIDIA GB200 Superchips

April 02, 2025

CoreWeave has achieved a new AI inferencing benchmark using NVIDIA GB200 Grace Blackwell Superchips, delivering 800 tokens per second on the Llama 3.1 model, as announced in a press release.

CoreWeave Sets New AI Benchmark with NVIDIA GB200 Superchips

CoreWeave has achieved a new record in AI inferencing benchmarks using NVIDIA GB200 Grace Blackwell Superchips, announced in a press release. The company reported delivering 800 tokens per second (TPS) on the Llama 3.1 405B model, one of the largest open-source models, using a CoreWeave instance equipped with two NVIDIA Grace CPUs and four NVIDIA Blackwell GPUs.

Additionally, CoreWeave submitted results for NVIDIA H200 GPU instances, achieving 33,000 TPS on the Llama 2 70B model, marking a 40% improvement over previous NVIDIA H100 instances. These achievements underscore CoreWeave's position as a leading provider of cloud infrastructure services optimized for AI applications.

We hope you enjoyed this article.

Consider subscribing to one of our newsletters like Silicon Brief or Daily AI Brief.

Also, consider following us on social media:

AI Chips & Datacenters AI Brief AI Brief (X)

More from: Data Centers

10/14

Open Compute Project and iMasons Introduce Carbon Disclosure Specification for Data Centers

10/14

SONiC Foundation Expands AI Networking Ecosystem at OCP Global Summit 2025

10/14

Axiado to Showcase AI-Embedded Security Chips at OCP Global Summit 2025

10/14

Jabil Unveils J-422G Servers for AI and Data Center Workloads

10/14

ABB and NVIDIA Collaborate on Next-Generation AI Data Centers

Subscribe to Silicon Brief

Weekly coverage of AI hardware developments including chips, GPUs, cloud platforms, and data center technology.

Whitepaper

Stanford HAI’s 2025 AI Index Reveals Record Growth in AI Capabilities, Investment, and Regulation

The 2025 AI Index by Stanford HAI provides a comprehensive overview of the global state of artificial intelligence, highlighting significant advancements in AI capabilities, investment, and regulation. The report details improvements in AI performance, increased adoption in various sectors, and the growing global optimism towards AI, despite ongoing challenges in reasoning and trust. It serves as a critical resource for policymakers, researchers, and industry leaders to understand AI's rapid evolution and its implications.

Categories

Companies

Resources

CoreWeave Sets New AI Benchmark with NVIDIA GB200 Superchips

We hope you enjoyed this article.

More from: Data Centers

Subscribe to Silicon Brief

Whitepaper

Stanford HAI’s 2025 AI Index Reveals Record Growth in AI Capabilities, Investment, and Regulation

You May Also Like

CoreWeave Expands OpenAI Partnership with $6.5 Billion Deal

GIGABYTE Launches AI TOP ATOM Supercomputer Powered by NVIDIA GB10

Microsoft Azure Launches First NVIDIA GB300 NVL72 Supercomputing Cluster for OpenAI

ASUS Showcases AI Factory and New NVIDIA HGX B300 Servers at OCP 2025

Wiwynn Unveils Double-Wide Rack for Next-Gen AI at OCP Global Summit 2025

Microsoft Unveils Fairwater AI Datacenter in Wisconsin

Clarifai Introduces Reasoning Engine for Faster AI Inference

Microsoft Invests $33 Billion in Neoclouds for AI Capacity

OpenAI and Broadcom to Deploy 10 GW of Custom AI Accelerators

Huawei Unveils Xinghe AI Fabric 2.0 for Enhanced Data Center Networks

Compal Showcases CXL and Liquid Cooling Innovations for AI Data Centers at OCP 2025

AI21 Labs Releases Jamba Reasoning 3B for On-Device AI Workloads