Brian Armstrong: Coinbase's AI spending has nearly halved, while token usage continues to grow.

robot
Abstract generation in progress
Wu said it learned that Coinbase CEO Brian Armstrong shared practical experience in keeping AI spending stable amid exponential growth in token usage. Armstrong stated that the company did not adopt restrictive measures like setting usage caps, but instead achieved cost reduction and efficiency improvement by optimizing default models, intelligent routing, and caching strategies: in terms of default models, introducing open-weight models such as GLM 5.2 and Kimi 2.7 to replace expensive general-purpose models; matching models based on task requirements through a routing mechanism; using cache pre-processing and task session management to reduce token waste (cache hit rate increased from 5% to 60%). Thanks to this series of optimizations, Coinbase's AI spending has nearly halved, while token usage continues to grow.
View Original
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • 5
  • 1
  • Share
Comment
Add a comment
Add a comment
GateUser-d2b4d9c6
· 2h ago
Unlimited + Smart Routing, this approach is worth copying.
View OriginalReply0
HotAirBalloonCrossingMountains
· 2h ago
Token rises, spending falls, Armstrong's move is textbook.
View OriginalReply0
GateUser-673fb6fa
· 2h ago
GLM 5.2's cost-effectiveness is indeed impressive, and we are also cutting in.
View OriginalReply0
BerryColdWallet
· 2h ago
Cache hit rate from 5% to 60% is crazy, this is real cost reduction.
View OriginalReply0
GateUser-94818fd0
· 2h ago
Open-weight models are now really good, closed-source big companies are under pressure.
View OriginalReply0
  • Pinned