Giskard Unveils Phare: A New Benchmark for Evaluating AI Models

February 19, 2025

Giskard has launched Phare, an open and independent benchmark to assess AI models on security dimensions like hallucination and bias, with Google DeepMind as a research partner.

Giskard Unveils Phare: A New Benchmark for Evaluating AI Models

Giskard has introduced Phare, a new open and independent benchmark designed to evaluate large language models (LLMs) on key security dimensions such as hallucination, factual accuracy, bias, and potential for harm. This announcement was made during the Paris AI Summit, with Google DeepMind collaborating as a research partner. The initiative aims to provide open measurements to assess the trustworthiness of generative AI models in real-world applications announced on their website.

Phare, which stands for "Potential Harm Assessment & Risk Evaluation," is designed to evaluate language models across multiple languages, initially including English, French, and Spanish. The benchmark will incorporate diverse linguistic and cultural contexts to ensure comprehensive assessments. The initial scope covers leading models from top AI labs such as OpenAI, Anthropic, Google DeepMind, Meta, Mistral, Alibaba, and DeepSeek.

The benchmark consists of modular test components focusing on four fundamental safety categories: hallucination, bias and fairness, intentional abuse by users, and harmful content generation. Giskard maintains full autonomy in determining the benchmark design, ensuring independence from model developers. The results from these assessments will be tracked on a public leaderboard, with future modules expanding to cover more languages and additional security aspects.

This collaborative effort is part of a broader initiative to improve AI security and robustness, encouraging practical developments in AI safety. Giskard plans to open-source a representative set of samples for each benchmarking module, enabling independent verification and private model testing.

We hope you enjoyed this article.

Consider subscribing to one of our newsletters like AI Policy Brief or Daily AI Brief.

Also, consider following us on social media:

AI Safety & Regulation AI Brief AI Brief (X)

More from: AI Safety

07/30

FAR.AI Launches AI Security Leaderboard for Frontier Model Safeguards

07/30

FAR.AI Opens First International Office in Singapore

07/29

Pangram Raises $9M for AI Content Detection Tools

07/29

Anthropic Says It Does Not Support Ban on Models With Open Weights

07/27

NVIDIA Starts Open Secure AI Alliance for AI Security Tools

Subscribe to AI Policy Brief

Weekly report on AI regulations, safety standards, government policies, and compliance requirements worldwide.

Whitepaper

Stanford HAI’s 2025 AI Index Reveals Record Growth in AI Capabilities, Investment, and Regulation

The 2025 AI Index by Stanford HAI provides a comprehensive overview of the global state of artificial intelligence, highlighting significant advancements in AI capabilities, investment, and regulation. The report details improvements in AI performance, increased adoption in various sectors, and the growing global optimism towards AI, despite ongoing challenges in reasoning and trust. It serves as a critical resource for policymakers, researchers, and industry leaders to understand AI's rapid evolution and its implications.

FAR.AI Launches AI Security Leaderboard for Frontier Model Safeguards

Jul 30, 2026 Cybersecurity

Sentient Index Labs Launches Independent AI Behavioral Risk Assessment

Jul 23, 2026 AI Safety

Categories

Companies

Resources

Giskard Unveils Phare: A New Benchmark for Evaluating AI Models

We hope you enjoyed this article.

More from: AI Safety

Subscribe to AI Policy Brief

Whitepaper

Stanford HAI’s 2025 AI Index Reveals Record Growth in AI Capabilities, Investment, and Regulation

You May Also Like

FAR.AI Launches AI Security Leaderboard for Frontier Model Safeguards

Sentient Index Labs Launches Independent AI Behavioral Risk Assessment