BTC $71,807
2026 Bull Run Is Building Start trading with 5% OFF all fees
Sign Up Now
BTC $71,807
Bull Run 2026 | 5% Off Fees Open your Binance account today
Sign Up

OpenAI Agents Better at Hacking Than Fixing Code

OpenAI launches EVMbench to test AI agents on smart contract security tasks.

  • OpenAI and Paradigm released EVMbench, a new tool to test AI agents on smart contract security tasks.
  • Research shows AI agents are significantly better at exploiting smart contract flaws than finding or fixing them, with GPT-5.3-Codex excelling.
  • The tool’s release follows a recent incident where an AI-generated bug cost Moonwell users nearly $2.7 million.

OpenAI and crypto venture firm Paradigm launched a new benchmarking tool on Wednesday that rigorously evaluates how AI agents handle smart contract security vulnerabilities. This release arrives just days after a costly bug in AI-generated code led to significant user losses.

- Advertisement -

The tool, called EVMbench, is built from 120 vulnerabilities identified in over 40 prior audits. Consequently, it provides a standardized way to measure AI performance on detection, patching, and exploitation tasks.

Results from the tool reveal a stark capability gap among current AI models. OpenAI’s latest model, GPT-5.3-Codex, more than doubled its predecessor’s effectiveness at exploiting flaws to drain funds.

However, its success in finding and fixing vulnerabilities “remain below full coverage,” according to the company’s news release. The agents sometimes stop after finding one issue or struggle to maintain functionality while patching.

In benchmark comparisons, Anthropic’s Claude Opus 4.6 scored highest for detecting vulnerabilities. Meanwhile, GPT-5.3-Codex achieved top results in both patching and exploiting smart contracts.

- Advertisement -

OpenAI cautioned that EVMbench has limitations due to its finite sample of vulnerabilities. The tool also cannot reliably determine if agent-found vulnerabilities are false positives.

Testing such tools is critical as smart contract hacks continue to plague the industry. According to data, protocols have suffered over $108 million in exploits so far in 2026.

✅ Follow BITNEWSBOT on Telegram, Facebook, LinkedIn, X.com, and Google News for instant updates.

Previous Articles:

- Advertisement -
Ad
Altseason Is Loading. Don't watch from the sidelines.
SOL $90.51
DOGE $0.0963
LINK $9.02
SUI $1.00
5% off fees when you sign up
Start Trading
Ad
Pay Less on Every Trade. For Life.
$10K/mo volume Save $60/yr
$50K/mo volume Save $300/yr
$100K/mo volume Save $600/yr
5% off all trading fees when you sign up
Claim Your Discount

Latest News

Lotus Wiper Targets Venezuela’s Energy Infrastructure

Lotus Wiper, a new data-destroying malware, has been used in targeted attacks against Venezuela's...

Sun Sues Trump-Linked Crypto Project

Tron founder Justin Sun is suing leadership at the World Liberty Financial project, accusing...

UK Sets 2026 Start for Crypto Licensing, Stresses Compliance

UK crypto firms must transition from Money Laundering Regulations registration to full Financial Services...

Bitcoin Hits $78K, Fueling $418M in Liquidations

Bitcoin surged to $78,000 on Wednesday, triggering over $418 million in leveraged trading liquidations.Altcoins...

Faraday Future Expands AI Amid Nasdaq Pressure

Faraday Future stock dropped over 10% premarket, surrendering part of an 86% rally from...

Must Read

Top 5 Best Crypto Faucets To Earn Free Crypto This Year

QUICK LINKSWhat Are Crypto Faucets and How Do They Work?How Do Crypto Faucets Make Money?What to Expect: Realistic EarningsThe Best Crypto Faucets of 2025:...
Ad
Altseason Is Loading. These 4 coins are trending right now.
SOL $92.12
DOGE $0.0950
LINK $9.02
SUI $1.02
5% off spot fees when you sign up
Start Trading