Wu learned that Coinbase CEO Brian Armstrong stated that the company has nearly halved its enterprise AI spending by optimizing infrastructure, while AI token usage continues to grow exponentially. Key cost reduction measures include: adopting open-source models such as GLM 5.2 and Kimi 2.7 as the default options for the internal LLM gateway, using intelligent routing to match tasks with the most cost-effective models, and significantly increasing the cache hit rate of tools like LibreChat from 5% to 60%. Armstrong emphasized that the goal of managing AI costs is not to limit usage, but to make exponential growth sustainable by reducing waste.

View Original
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • Comment
  • Repost
  • Share
Comment
Add a comment
Add a comment
No comments
  • Pinned