Darwin Gödel Machine: A Self-Improving AI for Code Evolution

Researchers from Sakana AI, the University of British Columbia, and the Vector Institute have introduced the Darwin Gödel Machine, a self-modifying AI system designed to autonomously evolve its code using foundation models and real-world benchmarks.

Sakana AI, in collaboration with the University of British Columbia and the Vector Institute, has introduced the Darwin Gödel Machine (DGM), a novel AI system capable of self-improvement through code evolution. Unlike traditional AI systems that remain static post-deployment, DGM continuously modifies its own code, guided by performance metrics from coding benchmarks such as SWE-bench and Polyglot.

The DGM employs frozen foundation models to facilitate code execution and generation, beginning with a base coding agent that iteratively modifies itself to produce new variants. These variants are evaluated and retained if they demonstrate successful compilation and self-improvement, mimicking biological evolution by preserving diversity and enabling breakthroughs from previously suboptimal designs.

In testing, DGM improved its performance on SWE-bench from 20.0% to 50.0% and on Polyglot from 14.2% to 30.7%. These results underscore DGM's ability to autonomously refine its architecture and reasoning strategies without human intervention, outperforming hand-tuned systems in multiple scenarios.

The Darwin Gödel Machine represents a practical reinterpretation of the Gödel Machine concept, shifting from logical proof to evidence-driven iteration. While still computationally intensive, it offers a scalable path toward open-ended AI evolution, potentially expanding beyond code generation to broader domains in the future.

We hope you enjoyed this article.

Consider subscribing to one of several newsletters we publish. For example, in the Daily AI Brief you can read the most up to date AI news round-up 6 days per week.

Also, consider following us on social media:

Subscribe to AI Programming Weekly

Weekly news about AI tools for software engineers, AI enabled IDE's and much more.

Market report

2025 Generative AI in Professional Services Report

Thomson Reuters

This report by Thomson Reuters explores the integration and impact of generative AI technologies, such as ChatGPT and Microsoft Copilot, within the professional services sector. It highlights the growing adoption of GenAI tools across industries like legal, tax, accounting, and government, and discusses the challenges and opportunities these technologies present. The report also examines professionals' perceptions of GenAI and the need for strategic integration to maximize its value.

Read more