Wu Says learned that Coinbase CEO Brian Armstrong stated that the company has cut its enterprise AI spending by nearly half through infrastructure optimization, while AI token usage continues to grow exponentially. Key cost reduction measures include: adopting open-source models such as GLM 5.2 and Kimi 2.7 as the default options for the internal LLM gateway, intelligently routing tasks to the most cost-effective model, and significantly increasing the cache hit rate of tools like LibreChat from 5% to 60%. Armstrong emphasized that the goal of managing AI costs is not to limit usage, but to make exponential growth sustainable by reducing waste.

View Original
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • 7
  • 2
  • Share
Comment
Add a comment
Add a comment
MechanicalHummingbird
· 5h ago
GLM 5.2 as default, this wave of domestic models going overseas has been verified, right?
View OriginalReply0
ElevatorMeme
· 11h ago
Exponential growth + halved costs, Coinbase's AI infrastructure team can start selling courses.
View OriginalReply0
GateUser-2bbf8435
· 11h ago
Brian's thinking is truly unique — instead of cutting budgets, he cuts waste. AI usage doubled, yet costs went down instead. This is real cost reduction and efficiency improvement.
View OriginalReply0
GateUser-ffe7bee5
· 12h ago
From 5% to 60%, this cache optimization is probably mastering prompt engineering.
View OriginalReply0
GateUser-870b5e71
· 12h ago
Is there an open-source solution for LibreChat caching? I want to copy it.
View OriginalReply0
FragmentedSilverStarMap
· 12h ago
The 60% cache hit rate climbed from 5%—this optimization is making me bitter. Even internally, we’re still stuck at 20%.
View OriginalReply0
SunshineCollector
· 12h ago
Open-source models + intelligent routing is indeed a strong move; GLM and Kimi are directly set as defaults, and the saved money can be used to run a few more rounds of training.
View OriginalReply0
  • Pinned