Microsoft Unveils BitNet: A Hyper-Efficient AI Model for CPUs

April 17, 2025

Microsoft researchers have introduced BitNet b1.58 2B4T, a hyper-efficient AI model that can run on CPUs, including Apple's M2, offering significant computational efficiency.

Microsoft researchers have introduced BitNet b1.58 2B4T, a hyper-efficient AI model designed to run on CPUs, including Apple's M2. This model, described as the largest-scale 1-bit AI model or 'bitnet' to date, is openly available under an MIT license. Bitnets are compressed models that quantize weights into three values: -1, 0, and 1, making them more memory- and computing-efficient than traditional models.

BitNet b1.58 2B4T, with 2 billion parameters, was trained on a dataset of 4 trillion tokens. It reportedly outperforms traditional models of similar sizes, such as Meta's Llama 3.2 1B and Google's Gemma 3 1B, on benchmarks like GSM8K and PIQA. The model is also noted for its speed, operating at twice the speed of other models of its size while using significantly less memory.

However, to achieve optimal performance, BitNet requires Microsoft's custom framework, bitnet.cpp, which currently supports only certain hardware, excluding GPUs. This limitation highlights a potential challenge in broader adoption, as GPUs are prevalent in AI infrastructure.

We hope you enjoyed this article.

Consider subscribing to one of our newsletters like Daily AI Brief.

Also, consider following us on social media:

AI Brief AI Brief (X)

Subscribe to Daily AI Brief

Daily report covering major AI developments and industry news, with both top stories and complete market updates

Market report

AI’s Time-to-Market Quagmire: Why Enterprises Struggle to Scale AI Innovation

ModelOp

The 2025 AI Governance Benchmark Report by ModelOp provides insights from 100 senior AI and data leaders across various industries, highlighting the challenges enterprises face in scaling AI initiatives. The report emphasizes the importance of AI governance and automation in overcoming fragmented systems and inconsistent practices, showcasing how early adoption correlates with faster deployment and stronger ROI.

Categories

Companies

Resources

Microsoft Unveils BitNet: A Hyper-Efficient AI Model for CPUs

We hope you enjoyed this article.

Subscribe to Daily AI Brief

Market report

AI’s Time-to-Market Quagmire: Why Enterprises Struggle to Scale AI Innovation

You May Also Like

Google DeepMind Unveils Gemma 3 270M for Efficient On-Device AI

Tencent Unveils Compact Hunyuan AI Models

NVIDIA Releases Llama Nemotron Super 49B v1.5 AI Model

Zhipu AI Unveils New Open-Source Model GLM 4.5

OpenAI Releases GPT-OSS Models for Laptops

xAI Completes Rapid Installation of 550,000 Nvidia GPUs

SuperX Unveils All-in-One Multi-Model Server Series

Alibaba Launches Qwen3-Coder, an Advanced AI Coding Model

Blaize Unveils AI Platform for Multi-Modal Intelligence at the Edge

Alibaba Unveils Wan2.2 AI Video Generation Model

Normal Computing Unveils World's First Thermodynamic Computing Chip

Skywork AI Open-Sources UniPic 2.0 for Multimodal AI