NVIDIA Introduces NVFP4 for Efficient 4-Bit AI Model Pretraining

August 26, 2025
NVIDIA has unveiled NVFP4, a 4-bit format designed to enhance AI model pretraining efficiency, offering the precision of 16-bit with the speed of 4-bit.

NVIDIA has introduced NVFP4, a new 4-bit format aimed at revolutionizing AI model pretraining by combining the precision of 16-bit with the speed and efficiency of 4-bit. This innovation was detailed in a company blog post, highlighting its potential to significantly improve training efficiency for large language models (LLMs).

NVFP4 is designed to address the growing demands of AI workloads, particularly in the pretraining phase of multi-billion-parameter models. By utilizing 4-bit precision, NVFP4 reduces memory requirements and boosts arithmetic throughput, enabling AI factories to process more tokens with the same hardware resources. This advancement is crucial for sustaining higher token throughput, a key factor in unlocking new model capabilities.

The NVFP4 format employs several techniques to maintain accuracy and stability during training, such as micro-block scaling and stochastic rounding. These methods ensure that even with reduced precision, the training process remains effective and efficient. NVIDIA is actively collaborating with major organizations like Amazon Web Services, Google Cloud, and Microsoft AI to explore and validate the potential of NVFP4 in large-scale model pretraining.

This development marks a significant step forward in AI model training, setting a new standard for efficiency and scalability in the field.

We hope you enjoyed this article.

Consider subscribing to one of our newsletters like Silicon Brief or Daily AI Brief.

Also, consider following us on social media:

Subscribe to Silicon Brief

Weekly coverage of AI hardware developments including chips, GPUs, cloud platforms, and data center technology.

Whitepaper

Governing the Future: A Strategic Framework for AI Adoption in Financial Institutions

This whitepaper explores the transformative impact of artificial intelligence on the financial industry, focusing on the governance challenges and regulatory demands faced by banks. It provides a strategic framework for AI adoption, emphasizing the importance of a unified AI approach to streamline compliance and reduce operational costs. The document offers actionable insights and expert recommendations for banks with fewer than 2,000 employees to become leaders in compliant, customer-centric AI.

Read more