Open-source GLM-5.2 is ridiculously good value—its cost is less than one quarter of Opus 4.8, yet it’s only 90 points behind. After reading this review, I couldn’t help but think, “So worth it.”

View Original
CoinNetwork
AA-Briefcase发布:Claude Fable 5夺冠,GLM-5.2挤进前三
Artificial Analysis launches its first long-term knowledge work evaluation benchmark for large model intelligent agents, covering four scenarios: data science, product management, banking operations, and heavy industry strategy, with 91 tasks developed by experts from Google, McKinsey, and Boston Consulting. The results show that Claude Fable 5 took the top spot, followed by Opus 4.8, with GLM-5.2 ranking in the top three; however, under the all-correct standard for individual items, Fable 5's perfect rate is only 3%. The open-source GLM-5.2 has a composite score only 90 points lower than Opus 4.8, but costs less than a quarter of it.
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • Comment
  • Repost
  • Share
Comment
Add a comment
Add a comment
No comments
  • Pinned