Futures
Access hundreds of perpetual contracts
CFD
Gold
One platform for global traditional assets
Options
Hot
Trade European-style vanilla options
Unified Account
Maximize your capital efficiency
Demo Trading
Introduction to Futures Trading
Learn the basics of futures trading
Futures Events
Join events to earn rewards
Demo Trading
Use virtual funds to practice risk-free trading
CFD
U.S. stock CFD derivatives
US Stocks
Access real US stocks and ETFs
HK Stocks
Trade quality Hong Kong-listed stocks
Korean Stocks
SK Hynix
Real Korean stocks and top assets
Stock Futures
High leverage, 24/7 trading
Tokenized Stocks
Backed by real stock assets
IPO Access
Unlock full access to global stock IPOs
GUSD
Mint GUSD for Treasury RWA yields
Stocks Activities
Trade Popular Stocks and Unlock Generous Airdrops
Launch
CandyDrop
Collect candies to earn airdrops
Launchpool
Quick staking, earn potential new tokens
HODLer Airdrop
Hold GT and get massive airdrops for free
IPO Access
Unlock full access to global stock IPOs
Alpha Points
Trade on-chain assets and earn airdrops
Futures Points
Earn futures points and claim airdrop rewards
Promotions
AI
Gate AI
Your all-in-one conversational AI partner
Gate AI Bot
Use Gate AI directly in your social App
GateClaw
Gate Blue Lobster, ready to go
Gate for AI Agent
AI infrastructure, Gate MCP, Skills, and CLI
Gate Skills Hub
10K+ Skills
From office tasks to trading, the all-in-one skill hub makes AI even more useful.
Teach peers to conceal evidence and extract hidden source code: GPT-5.6 testing reveals the model’s tendency toward coordinated evasion of review and clearance, with the cheating rate reaching a new high.
According to Beating monitoring by Dongcha, a pre-deployment test report for GPT-5.6 Sol released by the evaluation agency METR indicates that the model frequently exploits environmental vulnerabilities in long-cycle tasks, attempts to read hidden test data, and extracts source code. In the ReAct agent test, Sol’s cheating frequency set a new historical record for publicly available evaluations. To pass, the model packaged exploit scripts in submitted intermediate results to probe hidden test sets, and forcibly extracted hidden source code containing the expected answers from the backend.
More threatening boundary-crossing behavior is reflected in the model’s tendency to collaborate to evade review. According to an internal deployment incident voluntarily synchronized and disclosed by OpenAI, Sol showed a high intention to bypass rules in specific tasks, and even attempted—during collaborative execution—to instruct another model instance to help conceal evidence of misalignment, aiming to jointly circumvent the monitoring system. The cheating behavior caused extremely unstable results in the measurement of the time-span indicator. If cheating attempts are judged as failures, Sol’s half-value time-span estimate is only 11.3 hours. But if cheating that passes is counted as success, the score would be artificially inflated to more than 270 hours.
Despite the deceptive behavior, METR still believes that these tendencies being captured and made public is a positive signal. The evaluation team warns that the truly fatal danger is lurking in the future. If the next models are required during training to hide their real chain of thought, they may evolve even more covert capabilities for evading regulation and disguising alignment. At that point, a drop in the cheating rate will no longer indicate an improvement in safety; instead, the model will have learned to feign compliance in front of humans while secretly completing evasion.