BTC $71,807
2026 Bull Run Is Building Start trading with 5% OFF all fees
Sign Up Now
BTC $71,807
Bull Run 2026 | 5% Off Fees Open your Binance account today
Sign Up

OpenAI and Paradigm Introduce EVMbench Ethereum Security Tool

EVMbench tests AI agents on detecting, patching, and exploiting Ethereum contract flaws.

  • OpenAI and Paradigm have launched EVMbench, a new tool to test AI agents on real Ethereum smart contract vulnerabilities.
  • The benchmark evaluates three critical modes: detecting, patching, and exploiting high-severity security flaws.
  • In exploit tests, the latest GPT-5.3-Codex scored 72.2%, more than doubling the performance of its predecessor.
  • The tool uses 120 curated vulnerabilities from 40 audits, with many sourced from open competitions like Code4rena and Tempo.

On February 18, 2026, OpenAI, in collaboration with investment firm Paradigm, introduced EVMbench, a groundbreaking tool designed to evaluate AI agents on Ethereum smart contract security. The Ethereum Virtual Machine test suite focuses on high-severity vulnerabilities as smart contract deployment reaches record highs. Consequently, the benchmark draws on 120 real-world vulnerabilities curated from 40 audits, including scenarios from the security process for Tempo.

- Advertisement -

Stripe launched the public testnet for its purpose-built blockchain, Tempo, in December with input from VISA, Shopify, and OpenAI. The goal is to ground testing in economically meaningful code, especially as AI-driven stablecoin payments expand. EVMbench tests agents across three distinct modes: detect, patch, and exploit. In the exploit phase, agents attempt end-to-end fund-draining attacks within a sandboxed environment.

Testing revealed that GPT-5.3-Codex achieved a 72.2% success rate in exploit mode, according to an OpenAI blog post. This significantly outperformed the 31.9% score of GPT-5, released six months earlier. However, performance was weaker in the detect and patch tasks, where agents sometimes failed to audit exhaustively.

Researchers cautioned that EVMbench does not fully capture real-world security complexity. Meanwhile, the weekly number of smart contracts deployed on Ethereum reached 669,500, according to Token Terminal. Measuring AI performance in such environments is now critical as models become powerful tools for both attackers and defenders.

✅ Follow BITNEWSBOT on Telegram, Facebook, LinkedIn, X.com, and Google News for instant updates.

- Advertisement -

Previous Articles:

- Advertisement -
Ad
Altseason Is Loading. Don't watch from the sidelines.
SOL $90.51
DOGE $0.0963
LINK $9.02
SUI $1.00
5% off fees when you sign up
Start Trading
Ad
Pay Less on Every Trade. For Life.
$10K/mo volume Save $60/yr
$50K/mo volume Save $300/yr
$100K/mo volume Save $600/yr
5% off all trading fees when you sign up
Claim Your Discount

Latest News

Dromos Launches Predictive Allocation for Real-Time Voting

Dromos Labs unveiled "Predictive Allocation," a new feature at EthCC in Cannes.The feature allows...

Senators Probe SEC Over Favoritism in Trump-Linked Crypto Cases

Two Democratic senators, Richard Blumenthal and Elizabeth Warren, are demanding answers from SEC Chair...

Sen. Blumenthal Probes SEC for Crypto Favoritism to Trump Allies

Connecticut Senator Richard Blumenthal has formally requested records from the Securities and Exchange Commission...

SpaceX may bar Robinhood, SoFi from IPO share sales – Reuters

SpaceX is reportedly considering excluding platforms like Robinhood (HOOD) and SoFi from its upcoming...

Nium Launches Stablecoin Card Platform via Visa, Mastercard

Nium has launched a platform enabling businesses to issue VISA and Mastercard cards funded...

Must Read

Top 10 Best DeFi Tokens to Invest in 2022

Decentralized Finance (Defi), is one of the most talked-about topics in the crypto space alongside NFTs. So if you want to know the best...
Ad
Altseason Is Loading. These 4 coins are trending right now.
SOL $92.12
DOGE $0.0950
LINK $9.02
SUI $1.02
5% off spot fees when you sign up
Start Trading