Futures
Access hundreds of perpetual contracts
TradFi
Gold
One platform for global traditional assets
Options
Hot
Trade European-style vanilla options
Unified Account
Maximize your capital efficiency
Demo Trading
Introduction to Futures Trading
Learn the basics of futures trading
Futures Events
Join events to earn rewards
Demo Trading
Use virtual funds to practice risk-free trading
Launch
CandyDrop
Collect candies to earn airdrops
Launchpool
Quick staking, earn potential new tokens
HODLer Airdrop
Hold GT and get massive airdrops for free
Pre-IPOs
Unlock full access to global stock IPOs
Alpha Points
Trade on-chain assets and earn airdrops
Futures Points
Earn futures points and claim airdrop rewards
Promotions
AI
Gate AI
Your all-in-one conversational AI partner
Gate AI Bot
Use Gate AI directly in your social App
GateClaw
Gate Blue Lobster, ready to go
Gate for AI Agent
AI infrastructure, Gate MCP, Skills, and CLI
Gate Skills Hub
10K+ Skills
From office tasks to trading, the all-in-one skill hub makes AI even more useful.
GateRouter
Smartly choose from 40+ AI models, with 0% extra fees
OpenRouter Launch Response Caching: Same requests with zero charges, latency reduced from seconds to milliseconds
CoinWorld News, OpenRouter has launched a response caching feature. Developers can enable it by adding x-openrouter-cache: true to the request header. The first call will be billed normally by the provider, and subsequent identical requests will directly return cached results without incurring token costs. After a cache hit, response times range from 80 to 300 milliseconds, with an average query time of 4 milliseconds. When not cached, Gemini 2.5 Flash averages about 1.3 seconds, Kimi K2.6 about 4.6 seconds, and GPT-5.5 approximately 9.1 seconds. This feature differs from the provider’s prompt caching; response caching completely bypasses the provider and returns the full response directly from OpenRouter’s edge cache. Text, images, audio, documents, and tool calls can all be cached, covering four endpoints. Cache isolation is based on API keys, with a default TTL of 5 minutes, configurable from 1 second to 24 hours.