Baidu Releases ERNIE-4.5-VL-28B-A3B-Thinking Multimodal AI Model

November 12, 2025

Baidu has launched ERNIE-4.5-VL-28B-A3B-Thinking, an open-source multimodal AI model that processes text, images, and video with high efficiency. The system activates only 3 billion of its 28 billion parameters and is available under the Apache 2.0 license.

Baidu has introduced a new open-source multimodal AI model called ERNIE-4.5-VL-28B-A3B-Thinking, announced on its AI Studio platform. The model is designed to handle text, images, and video inputs while consuming significantly fewer computing resources than comparable systems from other major AI developers.

The model operates on a Mixture-of-Experts architecture with 28 billion total parameters but activates only 3 billion during inference. This selective activation allows it to perform complex reasoning tasks efficiently on a single 80GB GPU. Baidu states that the model matches the performance of leading systems while maintaining lower computational costs.

ERNIE-4.5-VL-28B-A3B-Thinking introduces several capabilities including visual reasoning, STEM problem solving, visual grounding, and video understanding. A distinctive feature, called “Thinking with Images,” enables the model to zoom in and out of images dynamically, improving its ability to analyze fine-grained visual details. It also supports tool calling for functions like image search and external data access.

The model is available under the Apache 2.0 license, allowing unrestricted commercial use. It supports deployment through multiple frameworks such as Transformers, vLLM, and Baidu’s FastDeploy toolkit. Developers can also fine-tune the model using ERNIEKit, Baidu’s training framework built on PaddlePaddle.

According to the official documentation, the model’s context window extends to 128,000 tokens and it supports both Chinese and English. Its design aims to make advanced multimodal reasoning more accessible to enterprises and researchers seeking efficient AI solutions.

We hope you enjoyed this article.

Consider subscribing to one of our newsletters like Daily AI Brief.

Also, consider following us on social media:

AI Brief AI Brief (X)

Subscribe to Daily AI Brief

Daily report covering major AI developments and industry news, with both top stories and complete market updates

Market report

AI’s Time-to-Market Quagmire: Why Enterprises Struggle to Scale AI Innovation

ModelOp

The 2025 AI Governance Benchmark Report by ModelOp provides insights from 100 senior AI and data leaders across various industries, highlighting the challenges enterprises face in scaling AI initiatives. The report emphasizes the importance of AI governance and automation in overcoming fragmented systems and inconsistent practices, showcasing how early adoption correlates with faster deployment and stronger ROI.

Categories

Companies

Resources

Baidu Releases ERNIE-4.5-VL-28B-A3B-Thinking Multimodal AI Model

We hope you enjoyed this article.

Subscribe to Daily AI Brief

Market report

AI’s Time-to-Market Quagmire: Why Enterprises Struggle to Scale AI Innovation

You May Also Like

Unisound Releases U2, a Native Agentic Large Model for Complex Task Execution

Neurometric Launches Automated Token Engineering Platform and Raises $4 Million

Z.ai Releases GLM 5.2 Open Model with 1M Context and MIT License

Brown Bacon AI Launches The Big Pan V2 Private AI Platform

VIDIZMO Highlights Local Control as Enterprise AI Platforms Turn to Foreign Models

Innovative Solutions Launches DarcyIQ Anywhere for Enterprise AI Context Integration

MGI Tech and Shanghai AI Lab Introduce Physical AI Systems for Life Sciences

OpenAI Begins Limited Preview of GPT-5.6 Series with Sol, Terra, and Luna Models

ShengShu Technology Unveils Vidu S1 for Real Time Interactive AI Video

OpenAI and Broadcom Reveal Jalapeño Inference Chip for LLMs

Pollo AI Launches Unified API for Access to 300 AI Video and Image Models

SAIHEAT Expands into AI Inference Services for Enterprises