Google Introduces Gemini 3.1 Flash-Lite for High-Volume AI Workloads

March 04, 2026
Google has launched Gemini 3.1 Flash-Lite, a new addition to the Gemini 3 series designed for high-volume developer workloads. The model offers faster performance and lower costs, now available in preview through the Gemini API, AI Studio, and Vertex AI.

Google has introduced Gemini 3.1 Flash-Lite, its fastest and most cost-efficient model in the Gemini 3 series. The model is designed for developers managing large-scale workloads and is now available in preview through the Gemini API in Google AI Studio and for enterprise users via Vertex AI.

Gemini 3.1 Flash-Lite is priced at $0.25 per million input tokens and $1.50 per million output tokens. It delivers 2.5 times faster response times and a 45% increase in output speed compared to Gemini 2.5 Flash, while maintaining similar or improved quality. According to benchmark results, it achieved an Elo score of 1432 on the Arena.ai leaderboard and scored 86.9% on GPQA Diamond and 76.8% on MMMU Pro, outperforming previous-generation models.

The model includes configurable “thinking levels” in AI Studio and Vertex AI, allowing developers to adjust reasoning depth for different tasks. This feature supports both high-frequency operations like translation and content moderation, and more complex tasks such as UI generation or simulation building.

Early-access users, including companies like Latitude, Cartwheel, and Whering, have reported efficient performance and strong instruction-following capabilities. Gemini 3.1 Flash-Lite is part of Alphabet’s ongoing expansion of the Gemini 3 lineup, aimed at scaling AI capabilities across diverse use cases.

We hope you enjoyed this article.

Consider subscribing to one of our newsletters like Daily AI Brief.

Also, consider following us on social media:

Subscribe to Daily AI Brief

Daily report covering major AI developments and industry news, with both top stories and complete market updates

Industry analysis

2025 Global Business Services Agenda: Gen AI Takes Center Stage

The Hackett Group

This industry analysis by The Hackett Group explores the transformative impact of generative artificial intelligence (Gen AI) on global business services (GBS) in 2025. The study highlights the shift from exploration to acceleration of Gen AI initiatives, with 89% of executives advancing these projects to improve customer satisfaction, innovate products, and reduce costs. The report also discusses the challenges and strategies for successful Gen AI adoption, emphasizing the need for a technology-enabled operating model and the importance of reskilling the workforce.

Read more