OpenAI's Red-Teaming Challenge for GPT-OSS-20B

August 07, 2025
OpenAI has launched a red-teaming challenge on Kaggle to identify vulnerabilities in its GPT-OSS-20B model. Participants are tasked with finding and reporting up to five distinct issues in the model.

OpenAI has initiated a red-teaming challenge on Kaggle to uncover vulnerabilities in its newly released GPT-OSS-20B model. Participants are encouraged to identify and report up to five distinct issues, focusing on areas such as reward hacking, deception, and data exfiltration. The challenge aims to enhance the safety and reliability of AI models by leveraging diverse perspectives and innovative probing techniques.

The competition, which started two days ago, will run for 20 days. Participants are required to submit a detailed report of their findings, including prompts, expected outputs, and automated tests that demonstrate the identified vulnerabilities. The challenge emphasizes creativity and innovation, allowing participants to use various methods to probe the model without altering its weights.

The judging panel, comprising experts from multiple labs, will evaluate submissions based on criteria such as severity, breadth, novelty, and reproducibility. The goal is to advance red-teaming methods and improve AI safety research, with the hope of hosting similar challenges in the future.

We hope you enjoyed this article.

Consider subscribing to one of our newsletters like Cybersecurity AI Weekly or Daily AI Brief.

Also, consider following us on social media:

Subscribe to Cybersecurity AI Weekly

Weekly newsletter about AI in Cybersecurity.

Market report

AI’s Time-to-Market Quagmire: Why Enterprises Struggle to Scale AI Innovation

ModelOp

The 2025 AI Governance Benchmark Report by ModelOp provides insights from 100 senior AI and data leaders across various industries, highlighting the challenges enterprises face in scaling AI initiatives. The report emphasizes the importance of AI governance and automation in overcoming fragmented systems and inconsistent practices, showcasing how early adoption correlates with faster deployment and stronger ROI.

Read more