OpenAI pits AI agents against each other in Red Team sharp contracts

Featured in:
abcd

OpenAI has launched a recent benchmark that assesses how well various AI models detect, patch, and even exploit vulnerabilities discovered in crypto sharp contracts.

OpenAI released on Wednesday, working with investment firm Paradigm and cryptocurrency security firm OtterSec to assess how much artificial intelligence agents could theoretically exploit of 120 sharp contract vulnerabilities.

sadasda

Anthropic’s Claude Opus 4.6 won with an average “detection reward” of $37,824, followed by OpenAI’s OC-GPT-5.2 and Google’s Gemini 3 Pro with $31,623 and $25,112, respectively.

Detect rewards earned by AI agents. Source: OpenAI

As AI agents become more and more productive at performing basic tasks, OpenAI he said it is becoming increasingly vital to evaluate their performance in “economically significant environments.”

“Smart contracts secure billions of dollars in assets, and AI agents are likely to be transformative for both attackers and defenders.”

“We expect to see an increase in agent payments in stablecoins and help establish them in a domain of new practical importance,” OpenAI added.

Circle CEO Jeremy Allaire predicted on January 22 that within five years, billions of AI agents will be transacting using stablecoins for everyday payments on behalf of users, while former Binance CEO Changpeng “CZ” Zhao also recently tipped that cryptocurrencies will become the “native currency of AI agents.”

The need to test AI agent performance to detect vulnerabilities comes as attackers stole $3.4 billion in crypto assets in 2025, a marginal augment from 2024.

Related: China’s AI leader will shape the future of cryptocurrencies

EVMbench exploited 120 selected vulnerabilities from 40 sharp contract audits, with most of them coming from open source auditing competitions. OpenAI said it hopes the benchmark will assist track AI’s progress in detecting and mitigating sharp contract vulnerabilities at scale.

Smart contracts weren’t made for humans: Dragonfly

Dragonfly managing partner Haseeb Qureshi wrote in a post to X on Wednesday he said Cryptocurrency’s promise to replace property rights and legal contracts has never materialized, not because the technology failed, but because it was never designed with human intuition in mind.

Qureshi said signing vast deals still seems “scary,” especially since wallet drains and other risks are always present, while bank transfers rarely pose the same concern.

Instead, Qureshi believes that the future of cryptocurrency transactions will be facilitated by autonomous AI-powered wallets that will take care of these threats and manage intricate operations on behalf of users:

“Technology often gets implemented when a complement finally comes along. GPS had to wait for the smartphone, TCP/IP had to wait for the browser. In the case of cryptocurrencies, we could simply find it in AI agents.”

Warehouse: IronClaw competes with OpenClaw, Olas launches bots for Polymarket – AI Eye

Cointelegraph is committed to independent and see-through journalism. This news article has been produced in accordance with Cointelegraph’s Editorial Policy and is intended to provide true and up-to-date information. Readers are encouraged to verify the information themselves. Read our Editorial Policy https://cointelegraph.com/editorial-policy
abcd
sadasda

Find us on

Latest articles

Related articles

See more articles

Ethereum price is facing resistance and breakout hopes are...

Ethereum price found support near $1,922 and recovered some of the losses. ETH is currently consolidating and...

XRP funding level drops to extremely negative levels, what...

XRP derivatives markets include: still showing signs of bearish pressure, with funding rates on...

SOL’s path of least resistance is trending toward $50,...

SOL's price looks bearish on multiple charts, prompting analysts to set a short-term target of $50. Will...

The cup and handle pattern puts the XRP price...

Cryptocurrency analyst CryptoBull highlighted the bullish pattern it could send XRP price up to...

SocGen’s FORGE extends euro stablecoin to XRP Ledger in...

The digital assets arm of French banking group Societe Generale, SG-FORGE, has launched its euro-denominated stablecoin, EUR...

Bitcoin could benefit if AI job losses cause bank...

Arthur Hayes created a raw market warning: Sees the growing divide between his preferred risk measure, Bitcoin,...