Meta's Maverick AI Model Falls Short in Benchmark Rankings

April 12, 2025

Meta's Llama 4 Maverick AI model ranks below competitors on the LM Arena benchmark, following controversy over the use of an experimental version.

Meta Platforms, Inc. has faced scrutiny after its Llama 4 Maverick AI model ranked below competitors on the LM Arena benchmark. This follows a controversy where Meta used an experimental version of the model to achieve a high score, prompting LM Arena to revise its policies and evaluate the unmodified version. The vanilla Maverick model, known as 'Llama-4-Maverick-17B-128E-Instruct,' was placed below models like OpenAI's GPT-4o and Google's Gemini 1.5 Pro.

Meta's experimental version, 'Llama-4-Maverick-03-26-Experimental,' was optimized for conversational tasks, which initially led to its high ranking. However, the unmodified version's performance was less competitive, ranking 32nd on the benchmark. A Meta spokesperson stated that the company experiments with various custom variants and is eager to see how developers will utilize the open-source version of Llama 4.

The release version of Llama 4 has been added to LMArena after it was found out they cheated, but you probably didn't see it because you have to scroll down to 32nd place which is where is ranks pic.twitter.com/A0Bxkdx4LX
— ρ:ɡeσn (@pigeon__s) April 11, 2025

We hope you enjoyed this article.

Consider subscribing to one of our newsletters like Daily AI Brief.

Also, consider following us on social media:

AI Brief AI Brief (X)

Subscribe to Daily AI Brief

Daily report covering major AI developments and industry news, with both top stories and complete market updates

Market report

AI’s Time-to-Market Quagmire: Why Enterprises Struggle to Scale AI Innovation

ModelOp

The 2025 AI Governance Benchmark Report by ModelOp provides insights from 100 senior AI and data leaders across various industries, highlighting the challenges enterprises face in scaling AI initiatives. The report emphasizes the importance of AI governance and automation in overcoming fragmented systems and inconsistent practices, showcasing how early adoption correlates with faster deployment and stronger ROI.

Categories

Companies

Resources

Meta's Maverick AI Model Falls Short in Benchmark Rankings

We hope you enjoyed this article.

Subscribe to Daily AI Brief

Market report

AI’s Time-to-Market Quagmire: Why Enterprises Struggle to Scale AI Innovation

You May Also Like

Meta Restructures AI Division for the Fourth Time in Six Months

NVIDIA Releases Llama Nemotron Super 49B v1.5 AI Model

Meta Shifts Strategy on AI Model Open Sourcing

AWS and Meta Launch AI Startup Accelerator

Anthropic Overtakes OpenAI in Enterprise LLM Market

Zhipu AI Unveils New Open-Source Model GLM 4.5

Carnegie Mellon and Anthropic Explore LLMs in Cyberattacks

Anthropic Releases Claude Opus 4.1 with Enhanced Coding Capabilities

PRC Enhances VoicesAI with Meta's LLaMA for Healthcare Insights

Alibaba Launches Qwen3-Coder, an Advanced AI Coding Model

Anthropic Develops AI Agents for Alignment Auditing

Google DeepMind Unveils Gemma 3 270M for Efficient On-Device AI