OpenAI launches Aardvark to detect and fix hidden errors in code

OpenAI has inserted Aardvark, a GPT-5-powered autonomous agent that scan, analyze and patch code like a human security researcher. “By integrating directly into the development process, Aardvark aims to turn security from a post-development concern into an ongoing safeguard that evolves with the software itself,” reports InfoWorld. From the report: What makes Aardvark unique, OpenAI noted, is its combination of reasoning, automation and verification. Instead of simply highlighting potential vulnerabilities, the agent promises a multi-stage analysis, starting by mapping an entire repository and building a contextual threat model around it. From there, it continually monitors new commits, checking whether each change introduces risks or violates existing security patterns.

Additionally, when identifying a potential issue, Aardvark attempts to validate the exploitability of the finding in an isolated environment before flagging it. This validation step could prove transformative. Traditional static analysis tools often overwhelm developers with false alarms: problems that may seem risky but are not actually exploitable. “The biggest advantage is that it will significantly reduce false positives,” Jain said. “It’s useful in open source codes and as part of the development process.”

Once a vulnerability is confirmed, Aardvark integrates with Codex to propose a patch and then reanalyzes the fix to ensure it doesn’t introduce new issues. OpenAI claims that in benchmark testing, the system identified 92 percent of known and synthetically introduced vulnerabilities in test repositories, a promising indication that AI could soon take on some of the burden of modern code auditing.