OpenAI AI Benchmark Tests Crypto Contract Exploit Skills

New AI benchmark tests smart contract security; Claude leads as experts foresee AI-driven crypto future.

  • OpenAI, with Paradigm and OtterSec, launched a new benchmark to test AI agents on smart contract security vulnerabilities.
  • Anthropic’s Claude Opus 4.6 outperformed competitors, finding the most potential value in exploitable flaws.
  • The benchmark arrives as crypto thefts grow, and top executives foresee AI agents driving future crypto transactions and security.
  • Crypto venture capitalist Haseeb Qureshi argues smart contracts need AI intermediaries, or “self-driving wallets,” to achieve mainstream use.

On Wednesday, OpenAI unveiled a major new benchmark evaluating AI models on their ability to find and exploit security weaknesses in crypto smart contracts. The research, conducted in collaboration with Paradigm and OtterSec and released in a new paper, tested AI agents against 120 curated vulnerabilities to measure their performance in an economically critical domain.

- Advertisement -

Anthropic’s Claude Opus 4.6 led the pack with an average “detect award” of $37,824. OpenAI’s own model and Google’s Gemini 3 Pro followed with $31,623 and $25,112, respectively, according to the published data. OpenAI stated it’s vital to test AI in meaningful environments, noting smart contracts secure billions and AI will transform both attack and defense.

Consequently, the need for such tools is underscored by the $3.4 billion in crypto stolen by attackers just last year. This benchmark aims to track AI progress in mitigating these costly vulnerabilities at scale, a growing priority for the ecosystem.

Meanwhile, industry leaders are predicting a future where AI agents dominate crypto transactions. Circle CEO Jeremy Allaire recently forecast billions of AI agents using stablecoins within five years, a sentiment echoed by former Binance chief Changpeng Zhao.

However, a core challenge remains user experience and security. Dragonfly’s Haseeb Qureshi argued on X that smart contracts were never designed for human intuition, making large transactions feel “terrifying” compared to traditional bank transfers. He proposed the solution is AI-intermediated, self-driving wallets that manage complex operations securely.

- Advertisement -

✅ Follow BITNEWSBOT on Telegram, Facebook, LinkedIn, X.com, and Google News for instant updates.

Previous Articles:

- Advertisement -

Latest News

Tesla: FSD Cars Drive 5.3M Miles Per Major Crash

EV giant Tesla Inc. announced its Full Self-Driving (FSD) technology has surpassed 8 billion...

OpenAI Agents Better at Hacking Than Fixing Code

OpenAI and Paradigm released EVMbench, a new tool to test AI agents on smart...

Downturn Trims Hype, Sharpens Builder Focus at ETH Denver

ETH Denver founder John Paller asserts that the current market downturn has filtered out...

Goldman Sachs CEO David Solomon Holds Bitcoin

Goldman Sachs CEO David Solomon has revealed he personally owns a "very, very limited"...

Bitcoin Holder Stress Hits 2018 Lows, Hinting at Bottom

Bitcoin’s “short-term holder stress” metric has reached its lowest level since the 2018 bear...

Must Read

12 Hosting Providers To Buy VPS With Bitcoin: An Expert Guide for 2026

You need a VPS. You want to pay with Bitcoin. Simple enough, right?Not quite. The market for crypto VPS = VPS hosting that accepts...
🔥 #AD Get 20% OFF any new 12 month hosting plan from Hostinger. Click here!