DeepSeek Clarifies True Cost of Training AI Model R1

September 22, 2025
DeepSeek has clarified that the actual cost of training its AI model R1 was significantly higher than initially reported, with the total expenses reaching approximately $5.87 million.

DeepSeek has clarified the true cost of training its AI model R1, following confusion over initial reports. The company initially stated that the training cost was $294,000, but this figure only accounted for the reinforcement learning phase. The complete training process, including the foundational work on the V3 base model, actually cost around $5.87 million.

The misunderstanding arose from supplementary information released with a paper in January, which detailed the use of 512 Nvidia H800 GPUs for 198 hours to train the preliminary R1-Zero release. However, this did not include the extensive pre-training of the V3 model, which required 2.79 million GPU hours.

DeepSeek's R1 model was trained using a technique called Group Relative Policy Optimization (GRPO), a form of reinforcement learning. This method was applied after the base model was already developed, which involved significant computational resources and costs. The company has emphasized that the $294,000 figure was only for the reinforcement learning phase, not the entire training process.

We hope you enjoyed this article.

Consider subscribing to one of our newsletters like Daily AI Brief.

Also, consider following us on social media:

Subscribe to Daily AI Brief

Daily report covering major AI developments and industry news, with both top stories and complete market updates

Whitepaper

Stanford HAI’s 2025 AI Index Reveals Record Growth in AI Capabilities, Investment, and Regulation

The 2025 AI Index by Stanford HAI provides a comprehensive overview of the global state of artificial intelligence, highlighting significant advancements in AI capabilities, investment, and regulation. The report details improvements in AI performance, increased adoption in various sectors, and the growing global optimism towards AI, despite ongoing challenges in reasoning and trust. It serves as a critical resource for policymakers, researchers, and industry leaders to understand AI's rapid evolution and its implications.

Read more