Microsoft Unveils BitNet: A Hyper-Efficient AI Model for CPUs

Microsoft researchers have introduced BitNet b1.58 2B4T, a hyper-efficient AI model that can run on CPUs, including Apple's M2, offering significant computational efficiency.

Microsoft researchers have introduced BitNet b1.58 2B4T, a hyper-efficient AI model designed to run on CPUs, including Apple's M2. This model, described as the largest-scale 1-bit AI model or 'bitnet' to date, is openly available under an MIT license. Bitnets are compressed models that quantize weights into three values: -1, 0, and 1, making them more memory- and computing-efficient than traditional models.

BitNet b1.58 2B4T, with 2 billion parameters, was trained on a dataset of 4 trillion tokens. It reportedly outperforms traditional models of similar sizes, such as Meta's Llama 3.2 1B and Google's Gemma 3 1B, on benchmarks like GSM8K and PIQA. The model is also noted for its speed, operating at twice the speed of other models of its size while using significantly less memory.

However, to achieve optimal performance, BitNet requires Microsoft's custom framework, bitnet.cpp, which currently supports only certain hardware, excluding GPUs. This limitation highlights a potential challenge in broader adoption, as GPUs are prevalent in AI infrastructure.

We hope you enjoyed this article.

Consider subscribing to one of several newsletters we publish. For example, in the Daily AI Brief you can read the most up to date AI news round-up 6 days per week.

Also, consider following us on social media:

Subscribe to Daily AI Brief

Daily report covering major AI developments and industry news, with both top stories and complete market updates

Market report

AI’s Time-to-Market Quagmire: Why Enterprises Struggle to Scale AI Innovation

ModelOp

The 2025 AI Governance Benchmark Report by ModelOp provides insights from 100 senior AI and data leaders across various industries, highlighting the challenges enterprises face in scaling AI initiatives. The report emphasizes the importance of AI governance and automation in overcoming fragmented systems and inconsistent practices, showcasing how early adoption correlates with faster deployment and stronger ROI.

Read more