OpenAI Introduces Safety Evaluations Hub for AI Models

May 14, 2025

OpenAI has launched a Safety Evaluations Hub to regularly publish AI model safety test results, aiming to enhance transparency in AI safety metrics.

OpenAI Introduces Safety Evaluations Hub for AI Models

OpenAI has launched a new Safety Evaluations Hub to publish the results of its AI model safety tests more frequently. This initiative, announced on May 14, 2025, aims to increase transparency by providing ongoing updates on how OpenAI's models perform in various safety evaluations, including tests for harmful content generation, jailbreaks, and hallucinations.

The Safety Evaluations Hub will serve as a resource for exploring safety results, with updates occurring alongside major model updates. OpenAI has stated that this hub will help in understanding the safety performance of its systems over time and support community efforts to enhance transparency across the AI field.

The hub includes evaluations that measure a model's ability to avoid generating harmful content, resist adversarial prompts, and maintain factual accuracy. OpenAI plans to update the hub periodically as part of its broader effort to communicate more proactively about AI safety.

Introducing the Safety Evaluations Hub—a resource to explore safety results for our models.

While system cards share safety metrics at launch, the Hub will be updated periodically as part of our efforts to communicate proactively about safety.https://t.co/c8NgmXlC2Y
— OpenAI (@OpenAI) May 14, 2025

We hope you enjoyed this article.

Consider subscribing to one of our newsletters like AI Policy Brief or Daily AI Brief.

Also, consider following us on social media:

AI Safety & Regulation AI Brief AI Brief (X)

More from: AI Safety

09/29

OpenAI Introduces Safety Routing and Parental Controls for ChatGPT

09/19

OpenAI Research Tackles AI Scheming with New Techniques

09/06

Google's Gemini AI Labeled 'High Risk' for Kids by Common Sense Media

09/03

OpenAI Introduces Parental Controls and Sensitive Conversation Routing in ChatGPT

08/27

JLT Mobile Computers and Linnaeus University Develop AI Safety Solution for Vehicles

Subscribe to AI Policy Brief

Weekly report on AI regulations, safety standards, government policies, and compliance requirements worldwide.

Whitepaper

Stanford HAI’s 2025 AI Index Reveals Record Growth in AI Capabilities, Investment, and Regulation

The 2025 AI Index by Stanford HAI provides a comprehensive overview of the global state of artificial intelligence, highlighting significant advancements in AI capabilities, investment, and regulation. The report details improvements in AI performance, increased adoption in various sectors, and the growing global optimism towards AI, despite ongoing challenges in reasoning and trust. It serves as a critical resource for policymakers, researchers, and industry leaders to understand AI's rapid evolution and its implications.

Categories

Companies

Resources

OpenAI Introduces Safety Evaluations Hub for AI Models

We hope you enjoyed this article.

More from: AI Safety

Subscribe to AI Policy Brief

Whitepaper

Stanford HAI’s 2025 AI Index Reveals Record Growth in AI Capabilities, Investment, and Regulation

You May Also Like

OpenAI Introduces Safety Routing and Parental Controls for ChatGPT

OpenAI Collaborates with US and UK Institutes to Enhance AI Security

OpenAI Introduces Parental Controls and Sensitive Conversation Routing in ChatGPT

OpenAI Introduces New Safety Measures for ChatGPT Users Under 18

OpenAI Research Tackles AI Scheming with New Techniques

Anthropic Endorses California's AI Safety Bill SB 53

OpenAI's GDPval Benchmark Evaluates AI in Real-World Jobs

Legit Security Enhances AI Security Command Center

Google Introduces Stax for AI Evaluation

OpenAI Unveils AI-Powered Hiring Platform to Rival LinkedIn

OpenAI's Revenue Growth and Anthropic's Claude Sonnet 4.5 Launch

Survey Highlights Growing Adoption of AI in Video Security