AI Safety

Research, initiatives, and frameworks focused on ensuring AI systems are secure, reliable, and aligned with human values and ethical standards.

Safe Pro Appoints Young J. Bang to Lead AI Integration for U.S. Military

Safe Pro Group has appointed Young J. Bang, former Principal Deputy Assistant Secretary of the Army, to spearhead the integration of AI technology into U.S. military systems, announced in a press release.

March 03, 2025

Anthropic Introduces Hierarchical Summarization for AI Monitoring

Anthropic has unveiled a new approach called hierarchical summarization to enhance AI monitoring, particularly for its computer use capabilities.

February 28, 2025

Infosys Introduces Open-Source Responsible AI Toolkit

Infosys has launched an open-source Responsible AI Toolkit to enhance trust and transparency in AI, announced in a press release. The toolkit is part of the Infosys Topaz Responsible AI Suite.

February 26, 2025

Leidos and SeeTrue Partner to Enhance AI Threat Detection

Leidos and SeeTrue have announced a collaboration to improve AI-powered threat detection technology for airport security and customs screenings.

February 25, 2025

OpenAI Bans Accounts Misusing ChatGPT for Surveillance

OpenAI has banned accounts from China and North Korea for using ChatGPT in surveillance and influence operations, according to Reuters.

February 23, 2025

Exabits and Phala Network Enhance AI Security with TEE-Enabled Infrastructure

Exabits has partnered with Phala Network to offer TEE-enabled GPU clusters for secure AI data processing, announced in a press release.

February 22, 2025

DeepSeek to Open-Source AGI Research Amid Privacy Concerns

DeepSeek, a Chinese AI startup, plans to open-source five repositories next week to promote transparency and community-driven innovation, amid ongoing privacy concerns.

February 22, 2025

Securiti and Databricks Collaborate to Enhance Enterprise AI Systems

Securiti has partnered with Databricks to integrate Databricks Mosaic AI and Delta tables into its Gencore AI solution, enabling safer enterprise AI development, according to a press release.

February 19, 2025

Giskard Unveils Phare: A New Benchmark for Evaluating AI Models

Giskard has launched Phare, an open and independent benchmark to assess AI models on security dimensions like hallucination and bias, with Google DeepMind as a research partner.

February 19, 2025

Mira Murati Launches Thinking Machine Labs with AI Focus

Former OpenAI CTO Mira Murati has launched a new AI startup, Thinking Machine Labs, with a team of top researchers and engineers, including many from OpenAI.

February 19, 2025

Pangea Launches AI Security Guardrails and $10,000 Jailbreak Competition

Pangea has announced the availability of AI Guard and Prompt Guard to enhance AI security, alongside a $10,000 jailbreak competition to highlight AI vulnerabilities.

February 18, 2025

OpenAI Co-Founder Sutskever's Startup Valued Over $30 Billion

Ilya Sutskever, co-founder of OpenAI, is raising over $1 billion for his startup Safe Superintelligence, which is now valued at more than $30 billion.

February 18, 2025

Caseware AiDA Receives Positive Evaluation for AI Safety Protocols

Caseware's AI digital assistant, AiDA, has been positively evaluated for its safety protocols by the Holistic AI Governance Platform, ensuring data security and compliance for accounting professionals.

February 13, 2025

ArisGlobal Joins EU AI Pact to Promote Ethical AI Practices

ArisGlobal has signed the EU AI Pact, reinforcing its commitment to ethical AI practices and preparing for the EU AI Act.

February 12, 2025

DeepSeek AI Model Faces Security Concerns After AppSOC Testing

AppSOC's testing reveals significant security vulnerabilities in DeepSeek's AI model, raising concerns over its use in enterprise applications.

February 12, 2025

ROOST Initiative Launches to Enhance AI Safety with Open-Source Tools

The ROOST initiative, launched at the AI Action Summit in Paris, aims to provide open-source safety tools for AI, focusing on child safety and leveraging large language models.

February 10, 2025

OpenAI CEO Sam Altman Discusses AI Accessibility and Future Plans

OpenAI CEO Sam Altman proposes a 'compute budget' to ensure AI benefits are widely distributed, while addressing the challenges of AGI development.

February 10, 2025

G42 and Microsoft Launch Responsible AI Foundation in Abu Dhabi

G42 and Microsoft have launched the Responsible AI Foundation in Abu Dhabi, focusing on promoting responsible AI standards in the Middle East and Global South.

February 09, 2025

Subscribe to AI Policy Brief

Weekly report on AI regulations, safety standards, government policies, and compliance requirements worldwide.