By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
Online Tech Guru
  • News
  • PC/Windows
  • Mobile
  • Apps
  • Gadgets
  • More
    • Gaming
    • Accessories
    • Editor’s Choice
    • Press Release
Reading: AI Agents Are Getting Better at Writing Code—and Hacking It as Well
Best Deal
Font ResizerAa
Online Tech GuruOnline Tech Guru
  • News
  • Mobile
  • PC/Windows
  • Gaming
  • Apps
  • Gadgets
  • Accessories
Search
  • News
  • PC/Windows
  • Mobile
  • Apps
  • Gadgets
  • More
    • Gaming
    • Accessories
    • Editor’s Choice
    • Press Release

Nothing Phone 3 With Snapdragon 8s Gen 4 SoC Surfaces on Geekbench Ahead of Launch

News Room News Room 26 June 2025
FacebookLike
InstagramFollow
YoutubeSubscribe
TiktokFollow
  • Subscribe
  • Privacy Policy
  • Contact
  • Terms of Use
© Foxiz News Network. Ruby Design Company. All Rights Reserved.
Online Tech Guru > News > AI Agents Are Getting Better at Writing Code—and Hacking It as Well
News

AI Agents Are Getting Better at Writing Code—and Hacking It as Well

News Room
Last updated: 25 June 2025 18:49
By News Room 4 Min Read
Share
SHARE

The latest artificial intelligence models are not only remarkably good at software engineering—new research shows they are getting ever-better at finding bugs in software, too.

AI researchers at UC Berkeley tested how well the latest AI models and agents could find vulnerabilities in 188 large open source codebases. Using a new benchmark called CyberGym, the AI models identified 17 new bugs including 15 previously unknown, or “zero-day,” ones. “Many of these vulnerabilities are critical,” says Dawn Song, a professor at UC Berkeley who led the work.

Many experts expect AI models to become formidable cybersecurity weapons. An AI tool from startup Xbow currently has crept up the ranks of HackerOne’s leaderboard for bug hunting and currently sits in top place. The company recently announced $75 million in new funding.

Song says that the coding skills of the latest AI models combined with improving reasoning abilities are starting to change the cybersecurity landscape. “This is a pivotal moment,” she says. “It actually exceeded our general expectations.”

As the models continue to improve they will automate the process of both discovering and exploiting security flaws. This could help companies keep their software safe but may also aid hackers in breaking into systems. “We didn’t even try that hard,” Song says. “If we ramped up on the budget, allowed the agents to run for longer, they could do even better.”

The UC Berkeley team tested conventional frontier AI models from OpenAI, Google, and Anthropic, as well as open source offerings from Meta, DeepSeek, and Alibaba combined with several agents for finding bugs, including OpenHands, Cybench, and EnIGMA.

The researchers used descriptions of known software vulnerabilities from the 188 software projects. They then fed the descriptions to the cybersecurity agents powered by frontier AI models to see if they could identify the same flaws for themselves by analyzing new codebases, running tests, and crafting proof-of-concept exploits. The team also asked the agents to hunt for new vulnerabilities in the codebases by themselves.

Through the process, the AI tools generated hundreds of proof-of-concept exploits, and of these exploits the researchers identified 15 previously unseen vulnerabilities and two vulnerabilities that had previously been disclosed and patched. The work adds to growing evidence that AI can automate the discovery of zero-day vulnerabilities, which are potentially dangerous (and valuable) because they may provide a way to hack live systems.

AI seems destined to become an important part of the cybersecurity industry nonetheless. Security expert Sean Heelan recently discovered a zero-day flaw in the widely used Linux kernel with help from OpenAI’s reasoning model o3. Last November, Google announced that it had discovered a previously unknown software vulnerability using AI through a program called Project Zero.

Like other parts of the software industry, many cybersecurity firms are enamored with the potential of AI. The new work indeed shows that AI can routinely find new flaws, but it also highlights remaining limitations with the technology. The AI systems were unable to find most flaws and were stumped by especially complex ones.

Share This Article
Facebook Twitter Copy Link
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Sony Announces PlayStation Plus Monthly Games for July 2025

News Room News Room 26 June 2025
FacebookLike
InstagramFollow
YoutubeSubscribe
TiktokFollow

Trending

I tested 12 Nintendo Switch 2 cases and these are the best

You might find it kind of sad to put a hard-earned gadget into a protective…

26 June 2025

The Trump Phone no longer promises it’s made in America

When the Trump Organization launched the Trump Mobile wireless carrier, it also launched a flagship…

26 June 2025

Resident Evil’s Raccoon City: A Complete History – From Sleepy Town to Zombie Nightmare

Survival horror is finally coming home. Raccoon City is the setting for the classic Resident…

26 June 2025
Apps

WhatsApp Introduces Meta AI-Powered Message Summaries to Catch Up on Unread Messages

WhatsApp on Wednesday rolled out a new artificial intelligence (AI)-backed feature which can summarise messages for you. Dubbed Message Summaries, it leverages Meta AI to help users catch up on…

News Room 26 June 2025

Your may also like!

Mobile

Upcoming Smartphones in July 2025: Samsung Galaxy Z Fold 7, Nothing Phone 3, OnePlus Nord 5 and More

News Room 26 June 2025
News

Snake Venom, Urine, and a Quest to Live Forever: Inside a Biohacking Conference Emboldened by MAHA

News Room 26 June 2025
News

Meta’s AI copyright win comes with a warning about fair use

News Room 26 June 2025
Gaming

Coke Zero 12-Pack, Loads of Doritos and New Game Preorders

News Room 26 June 2025

Our website stores cookies on your computer. They allow us to remember you and help personalize your experience with our site.

Read our privacy policy for more information.

Quick Links

  • Subscribe
  • Privacy Policy
  • Contact
  • Terms of Use
Advertise with us

Socials

Follow US
Welcome Back!

Sign in to your account

Lost your password?