NewsGuard Launches FAILSafe to Protect AI from Foreign Disinformation

March 11, 2025

NewsGuard has introduced the FAILSafe service to shield AI models from foreign influence operations, particularly targeting Russian, Chinese, and Iranian disinformation.

NewsGuard Launches FAILSafe to Protect AI from Foreign Disinformation

NewsGuard has launched a new service called the Foreign Adversary Infection of LLMs Safety Service (FAILSafe) to protect AI models from foreign influence operations, announced in a press release. This initiative comes in response to reports of a pro-Kremlin program that has infiltrated AI models with disinformation.

FAILSafe provides AI companies with real-time data verified by NewsGuard's disinformation researchers. The service includes a continuously updated feed of false narratives spread by Russian, Chinese, and Iranian influence operations, as well as a database of websites and accounts involved in these operations. This data helps AI companies prevent their systems from repeating these narratives.

Additionally, FAILSafe offers periodic stress-testing of AI products to identify the extent of disinformation infiltration and provides continuous monitoring and alerts about emerging disinformation risks. This comprehensive approach aims to safeguard AI models against the manipulation of large language models by foreign influence networks.

We hope you enjoyed this article.

Consider subscribing to one of our newsletters like AI Policy Brief or Daily AI Brief.

Also, consider following us on social media:

AI Safety & Regulation AI Brief AI Brief (X)

More from: AI Safety

11/06

First Key Update of International AI Safety Report Released

10/29

OpenAI Releases Open-Weight Safety Reasoning Models for Developers

10/26

OpenAI Backs Valthos in $30 Million Biosecurity Funding Round

10/20

ESMO Issues First Guidelines for Safe Use of AI Language Models in Oncology

10/20

NeuralTrust Reports First Signs of Self-Fixing AI Behavior

Subscribe to AI Policy Brief

Weekly report on AI regulations, safety standards, government policies, and compliance requirements worldwide.

Market report

AI’s Time-to-Market Quagmire: Why Enterprises Struggle to Scale AI Innovation

ModelOp

The 2025 AI Governance Benchmark Report by ModelOp provides insights from 100 senior AI and data leaders across various industries, highlighting the challenges enterprises face in scaling AI initiatives. The report emphasizes the importance of AI governance and automation in overcoming fragmented systems and inconsistent practices, showcasing how early adoption correlates with faster deployment and stronger ROI.

Categories

Companies

Resources

NewsGuard Launches FAILSafe to Protect AI from Foreign Disinformation

We hope you enjoyed this article.

More from: AI Safety

Subscribe to AI Policy Brief

Market report

AI’s Time-to-Market Quagmire: Why Enterprises Struggle to Scale AI Innovation

You May Also Like

BrainFreeze Introduces AI Safety Guardrails for K-12 Education

Advanced Fraud Solutions Develops AI That Learns from Fraudsters’ Tactics

First Key Update of International AI Safety Report Released

OpenAI and Microsoft Join State-Led AI Safety Task Force

Bolster AI Launches Signals for Real-Time Brand and Cyber Risk Intelligence

Center for Frontier AI Security Launches to Advance AI in National Defense

NeuralTrust Reports First Signs of Self-Fixing AI Behavior

AISLE Emerges from Stealth with AI-Native Cyber Reasoning System

UpGuard Report Finds 68% of Security Leaders Use Unauthorized AI Tools

OpenAI Releases Open-Weight Safety Reasoning Models for Developers

ACC Intelligence Launches Definity Synthetic Audience Platform for Marketing Pre-Testing

Leidos and VML Unveil Imperium AI Platform for U.S. Information Operations