Skip to content

AI Aids in Vulnerability Detection, but Bogus Reports Pose Challenges

AI is revolutionizing vulnerability detection, but it's also causing headaches for maintainers due to an influx of bogus reports.

In this picture we can see a web page, in the web page we can find some text and a machine.
In this picture we can see a web page, in the web page we can find some text and a machine.

AI Aids in Vulnerability Detection, but Bogus Reports Pose Challenges

AI tools, when used by experienced humans, can significantly aid in detecting software vulnerabilities and bugs. This has been demonstrated in various open-source projects, including curl. However, the misuse of AI in generating bogus reports has also posed challenges to maintainers.

Security researcher Joshua Rogers, using AI-based scanner tools, uncovered hundreds of real security vulnerabilities and bugs in open-source software projects like curl. Rogers tested several AI vulnerability scanning tools, such as Almanax, Corgea, ZeroPath, Gecko, and Amplify, and found them capable of identifying genuine issues in complex code.

Over the past two years, the curl project has grappled with a deluge of invalid bug reports generated by AI models. Daniel Stenberg, the curl project maintainer, has had to publish blog posts discouraging such submissions. However, Google's OSS-Fuzz project has also found LLM-assisted bug hunting to be effective.

Rogers' reports, described by Stenberg as 'truly awesome findings', led to approximately 50 bugfixes being merged into the curl project. Rogers praised ZeroPath for its role in identifying hundreds of real vulnerabilities and bugs in critical software, including curl. Unfortunately, shoddy AI-generated bug reports have affected not only curl but also the Python community, Open Collective, and the Mesa Project.

While AI tools can greatly assist in finding vulnerabilities, their misuse can also cause significant issues for maintainers. As AI continues to evolve, it's crucial for users to understand its limitations and for projects to implement measures to filter out invalid reports.

Read also:

Latest