Futures
Access hundreds of perpetual contracts
CFD
Gold
One platform for global traditional assets
Options
Hot
Trade European-style vanilla options
Unified Account
Maximize your capital efficiency
Demo Trading
Introduction to Futures Trading
Learn the basics of futures trading
Futures Events
Join events to earn rewards
Demo Trading
Use virtual funds to practice risk-free trading
Launch
CandyDrop
Collect candies to earn airdrops
Launchpool
Quick staking, earn potential new tokens
HODLer Airdrop
Hold GT and get massive airdrops for free
Pre-IPOs
Unlock full access to global stock IPOs
Alpha Points
Trade on-chain assets and earn airdrops
Futures Points
Earn futures points and claim airdrop rewards
Promotions
AI
Gate AI
Your all-in-one conversational AI partner
Gate AI Bot
Use Gate AI directly in your social App
GateClaw
Gate Blue Lobster, ready to go
Gate for AI Agent
AI infrastructure, Gate MCP, Skills, and CLI
Gate Skills Hub
10K+ Skills
From office tasks to trading, the all-in-one skill hub makes AI even more useful.
GateRouter
Smartly choose from 40+ AI models, with 0% extra fees
Google Gemini API Sky-High "Ghost Billing" Vulnerability: Deleting Cache Still Charges, Zero Output Also Billed
According to “Beating Monitoring,” recently, multiple emergency requests for help regarding the Gemini API billing system going out of control were reported on the Google AI Developers Forum. During normal use, multiple developers faced massive abnormal deductions due to vulnerabilities in the underlying system. For example, one person was charged nearly 27,000 yuan RMB in just 12 hours. At present, Google’s billing team and technical team are still trading blame for this matter, and no official fix statement or quick refund channel has been released.
After investigation, the two main core bugs that led to developers receiving sky-high bills are as follows: First is the “ghost cache” vulnerability. When the context cache created via the API expires or is deleted, the front-end management list is emptied, but Google’s backend billing continues to “spin idle,” deducting fees at a rate of thousands of yuan per hour. Second is the “thinking dead loop” trap. When tools such as enabled network search are turned on, the model’s “thinking budget limit” fails, causing the model to get stuck in infinite reasoning while handling simple tasks. After burning through up to 64,000 tokens, it times out and crashes. Even if the final result is “zero output” (no useful answers are returned), Google still charges the full amount, and the thinking cost skyrockets by 1500 times.
Because Google Cloud’s billing system has a severe delay of 32 to 72 hours and lacks a quota-based automatic circuit-breaker mechanism, developers had already been charged large sums before they received any alerts. Due to official customer service deflecting responsibility and no one giving a direct response in the forum, some affected developers have announced that, to avoid financial risk, they will completely abandon Gemini’s context cache and reasoning models in production environments.