OpenAI has launched EVMbench. It is a new device that means that you can measure the efficiency of synthetic intelligence brokers in a sequence of checks (benchmarks) by discovering, modifying, and leveraging good contracts on Ethereum. The corporate developed EVMbench in collaboration with Paradigm, an organization that funds cryptocurrency community tasks.
OpenAI AI bots on Ethereum can consider three essential points of the protocol.
starting, Precise vulnerability detected in Ethereum open supply Makes use of information from public audits. We then assess the chance of the vulnerability and audit the rewards for the remainder of the builders within the ecosystem.
Quantity 2, Recommend patches to repair these vulnerabilities with out compromising the operation of the protocol. “Brokers should modify weak contracts to eradicate exploitability whereas preserving meant performance. That is verified via automated testing and exploit checks,” OpenAI stated.
Third, Simulate assaults that exploit these flaws to exfiltrate funds Managed simulation setting (sandbox) Protected. Nevertheless, OpenAI clarifies that EVMbench “doesn’t signify the complete complexity of real-world good contract safety.”
EVMbench makes use of 120 real-world vulnerabilities extracted from 40 public competency audits, together with Code4rena. Efficiency checks carried out thus far present that the GPT-5.3-Codex agent mannequin achieved a 72.2% success fee in exploiting the flaw. The earlier mannequin's GPT-5 success fee was solely 31.9%.
Nevertheless, vulnerability detection and patching The outcomes from the mannequin are usually not very encouraging.
Efficiency is degraded for discovery and patching duties. Throughout the discovery section, the agent might cease after figuring out a single difficulty reasonably than completely auditing the codebase. Throughout the patching section, sustaining full performance whereas eliminating delicate vulnerabilities stays a problem.
OpenAI, a synthetic intelligence firm.
Why is EVMbench essential?
In response to the corporate, the significance of this agent audit device lies in the truth that good contracts routinely assure $100 billion in crypto property inside open supply protocols.
“As AI brokers enhance their abilities in studying, writing, and executing code, it turns into more and more essential to measure their capabilities in economically applicable environments and encourage the usage of AI methods defensively to audit and implement deployed contracts,” the corporate stated in a press release on the product presentation.
OpenAI brokers on Ethereum come at a time when autonomous brokers throughout the crypto asset ecosystem are quickly advancing. As reported by CriptoNoticias, these are already capable of work together with advanced environments such because the Lightning Community. On this second layerbrokers can handle liquidity channels and financial interactions with different AIs.
Coinbase has launched Agentic Pockets, a pockets that permits AI brokers to function on the Base community with out paying charges. And Phantom, Solana's hottest pockets, has activated an MCP server for AI brokers to handle balances and function autonomously.
In response to information from Token Terminal, the variety of weekly transactions on the Ethereum community reached 17.3 million. Explosion of transactions on the community Happens after the launch of ERC-8004 in August 2025. It’s a customary that permits “discovering, deciding on, and interacting with brokers throughout organizational boundaries with out the necessity for pre-existing belief” within the open agent financial system.
(Tag translation) Good contract

