IBM Releases Granite 4.1 Model Family for Enterprise AI

May 01, 2026
IBM has introduced the Granite 4.1 collection, its largest release of AI models to date, covering language, vision, speech, embedding, and guardian models designed for enterprise use. The models deliver improved performance in instruction following, transcription accuracy, and harm detection, and are available under the Apache 2.0 license.

IBM has introduced the Granite 4.1 family of models, its most extensive release to date, according to IBM Research. The collection includes new language, vision, speech, embedding, and guardian models designed for enterprise workloads, all available under the Apache 2.0 license.

At the core of the release are the Granite 4.1 language models, available in 3B, 8B, and 30B parameter sizes. These models deliver improved performance in instruction following and tool calling compared with previous Granite 4.0 versions. They are trained on around 15 trillion tokens and optimized for efficiency and reliability across enterprise applications.

Granite Vision 4.1 focuses on document understanding, including table, chart, and key-value pair extraction. It uses a training approach that combines real and synthetic enterprise data to enhance performance. IBM also released ChartNet, a large dataset created to strengthen chart comprehension.

Granite Speech 4.1 introduces multilingual speech recognition and translation models, offering options that balance throughput, latency, and transcription quality. One variant, Granite Speech 4.1 2B, achieves a 5.33 percent word error rate on the OpenASR Leaderboard. The models are designed for deployment in noisy or resource-constrained environments.

The Granite Guardian 4.1 model replaces version 3.3 and extends risk detection capabilities to evaluate safety, bias, and correctness in AI outputs. Meanwhile, Granite Embedding Multilingual R2 supports over 200 languages for efficient semantic search across large document sets.

All Granite 4.1 models can be accessed through platforms such as watsonx, Hugging Face, and Replicate, and are optimized for inference runtimes including vLLM, SGLang, and llama.cpp.

We hope you enjoyed this article.

Consider subscribing to one of our newsletters like Enterprise AI Brief or Daily AI Brief.

Also, consider following us on social media:

Subscribe to Enterprise AI Brief

Weekly report on AI business applications, enterprise software releases, automation tools, and industry implementations.

Market report

AI’s Time-to-Market Quagmire: Why Enterprises Struggle to Scale AI Innovation

ModelOp

The 2025 AI Governance Benchmark Report by ModelOp provides insights from 100 senior AI and data leaders across various industries, highlighting the challenges enterprises face in scaling AI initiatives. The report emphasizes the importance of AI governance and automation in overcoming fragmented systems and inconsistent practices, showcasing how early adoption correlates with faster deployment and stronger ROI.

Read more