CoinWorld Network launches its first comprehensive benchmark index for coding intelligent agents, integrating code generation, terminal operations, and technical Q&A to evaluate the real engineering capabilities of AI programmers. In the initial evaluation, Cursor CLI paired with Opus 4.7 scored 61 points to take the top spot, leading OpenAI Codex (GPT-5.5) and Anthropic Claude Code (Opus 4.7) by 1 point. Under the same model, Cursor CLI's score is slightly higher than Claude Code, but the task completion time is longer (7.8 minutes versus 5.8 minutes), and API costs are also higher ($1.47 versus $1.24). The most cost-effective option is Cursor's built-in Composer 2, with a single task costing only $0.07. DeepSeek V4 Pro and Kimi K2.6 follow closely, but their durations are significantly longer.

CoinNetwork

2026-05-12 00:42:04

Abstract generation in progress

CoinWorld News reports that the artificial intelligence analysis platform has released the first comprehensive benchmark index for coding agents (coding agent index). The index integrates three test areas—code generation, terminal operations, and technical Q&A—to evaluate the real engineering performance of AI programmers. In the first round of evaluation, Cursor CLI paired with the Opus 4.7 model scored 61 points to take the top spot, beating OpenAI’s Codex (paired with GPT-5.5) and Anthropic’s Claude Code (paired with Opus 4.7) by a margin of 1 point. Using the Opus 4.7 model as well, Cursor CLI’s score was slightly higher than the official Claude Code, but the trade-off was longer average task time (7.8 minutes vs. 5.8 minutes), and higher API call costs ($1.47 vs. $1.24). The most cost-effective option is Cursor’s built-in Composer 2, at just $0.07 per task. DeepSeek V4 Pro and Kimi K2.6 follow closely, but these domestic models take noticeably longer to run.

View Original

This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.

Reward
like
Comment
Repost
Share

Comment

Add a comment

No comments

Trending Topics
View More
#
GateSquareMayTradingShare
1.32M Popularity
#
TROLLSurgesOver160PercentInTwoDays
8.56M Popularity
#
DailyPolymarketHotspot
212.9K Popularity
#
CapitalFlowsBackToAltcoins
96.62K Popularity
#
TrumpVisitsChinaMay13
25.87M Popularity

Sitemap

First AI programmer index released: Cursor narrowly beats Codex to top with Opus 4.7

Trending Topics

GateSquareMayTradingShare

TROLLSurgesOver160PercentInTwoDays

DailyPolymarketHotspot

CapitalFlowsBackToAltcoins

TrumpVisitsChinaMay13

Pin