Anthropic Releases Bloom, an Open-Source Framework for AI Behavior Evaluation

December 22, 2025

Anthropic has introduced Bloom, an open-source tool designed to automate behavioral evaluations of large AI models. The system generates and scores scenarios to measure behaviors like bias and self-preservation across multiple models.

Anthropic has released Bloom, an open-source framework for automated behavioral evaluations of frontier AI models, according to an announcement on the company's website. The tool allows researchers to specify a target behavior and automatically generate scenarios that test how often and how severely that behavior appears.

Bloom operates through four stages—understanding, ideation, rollout, and judgment—to produce evaluation suites that quantify the presence of specific behaviors. It integrates with research tools such as Weights & Biases for large-scale experiments and exports results in Inspect-compatible formats. Each evaluation run can produce unique scenarios while maintaining reproducibility through a configuration seed file.

The company reported benchmark results for behaviors including delusional sycophancy, instructed long-horizon sabotage, self-preservation, and self-preferential bias across 16 models. Using Bloom, these evaluations were completed within days, and the results aligned closely with human-labeled judgments. Validation tests showed that Claude Opus 4.1 correlated most strongly with human scoring, achieving a Spearman correlation of 0.86.

Bloom complements Anthropic’s earlier open-source tool, Petri, which explores AI models' behavioral profiles through simulated interactions. Researchers can access Bloom and its documentation via the official GitHub repository.

We hope you enjoyed this article.

Consider subscribing to one of our newsletters like AI Policy Brief or Daily AI Brief.

Also, consider following us on social media:

AI Safety & Regulation AI Brief AI Brief (X)

More from: AI Safety

02/16

Google Threat Intelligence Group Reports Surge in AI Misuse for Cyber Operations

02/13

Astrix Security Releases OpenClaw Scanner to Detect AI Agent Deployments

02/13

Eve Security Files Patent for 'Interrogation-as-a-Service' to Manage AI Agent Risks

02/12

OpenAI Disbands Mission Alignment Team, Appoints Chief Futurist

02/11

Skan AI Launches Agentic Ontology of Work for Enterprise Automation

Subscribe to AI Policy Brief

Weekly report on AI regulations, safety standards, government policies, and compliance requirements worldwide.

Market report

AI’s Time-to-Market Quagmire: Why Enterprises Struggle to Scale AI Innovation

ModelOp

The 2025 AI Governance Benchmark Report by ModelOp provides insights from 100 senior AI and data leaders across various industries, highlighting the challenges enterprises face in scaling AI initiatives. The report emphasizes the importance of AI governance and automation in overcoming fragmented systems and inconsistent practices, showcasing how early adoption correlates with faster deployment and stronger ROI.

Categories

Companies

Resources

Anthropic Releases Bloom, an Open-Source Framework for AI Behavior Evaluation

We hope you enjoyed this article.

More from: AI Safety

Subscribe to AI Policy Brief

Market report

AI’s Time-to-Market Quagmire: Why Enterprises Struggle to Scale AI Innovation

You May Also Like

Anthropic Unveils Claude Opus 4.6 with 1M Token Context and Enhanced Coding Skills

Anthropic Publishes Revised Constitution for Claude AI

Bounteous and Anthropic Launch Claude Code Lab Series for Enterprise AI Adoption

Ant Group Releases Ling-2.5-1T and Ring-2.5-1T Open-Source AI Models

Robbyant Open-Sources LingBot-VLA Model for Cross-Platform Robotics

Impulse AI Launches Autonomous Machine Learning Engineer

Coder Introduces AI Maturity Self-Assessment for Enterprise Software Teams

Wonderful Introduces Autonomous Agent Builder for Enterprise AI Development

Anthropic Raises $30 Billion in Series G Funding, Valued at $380 Billion

2026 International AI Safety Report Highlights Rapid Advances and Rising Risks

Apple Adds Agentic Coding to Xcode with Anthropic and OpenAI Integration

OpenAI Launches Frontier Platform for Enterprise AI Agents