Futures
Access hundreds of perpetual contracts
CFD
Gold
One platform for global traditional assets
Options
Hot
Trade European-style vanilla options
Unified Account
Maximize your capital efficiency
Demo Trading
Introduction to Futures Trading
Learn the basics of futures trading
Futures Events
Join events to earn rewards
Demo Trading
Use virtual funds to practice risk-free trading
Launch
CandyDrop
Collect candies to earn airdrops
Launchpool
Quick staking, earn potential new tokens
HODLer Airdrop
Hold GT and get massive airdrops for free
IPO Access
Unlock full access to global stock IPOs
Alpha Points
Trade on-chain assets and earn airdrops
Futures Points
Earn futures points and claim airdrop rewards
Promotions
AI
Gate AI
Your all-in-one conversational AI partner
Gate AI Bot
Use Gate AI directly in your social App
GateClaw
Gate Blue Lobster, ready to go
Gate for AI Agent
AI infrastructure, Gate MCP, Skills, and CLI
Gate Skills Hub
10K+ Skills
From office tasks to trading, the all-in-one skill hub makes AI even more useful.
GateRouter
Smartly choose from 40+ AI models, with 0% extra fees
Does Anthropic's Claude Fable 5 include distillation detection features that can block Chinese open-source models?
Anthropic Adds Distillation Detection in Claude Fable 5, Third Parties Attempting to Extract Model Capabilities Are Automatically Downgraded to Opus 4.8, Effectively Embedding a "Prohibition of Distillation" into the Model Itself.
(Background: Anthropic Accuses Chinese AI Labs like DeepSeek of Stealing Claude, Using 24k Fake Accounts to Flood with 16 Million Q&A Pairs)
(Additional Context: Anthropic: Leading US AI Models Are Necessary to Protect Democracy; Proposes Criminalizing Distillation Attacks)
Table of Contents
Toggle
Anthropic's Claude Fable 5 was officially released this morning (10th), marking Anthropic's first publicly available Mythos-level model. The SWE-Bench Pro score is 80.3%, compared to Opus 4.8's 69.2%. Pricing is $10 per million tokens for input and $50 for output, roughly double that of Opus 4.8.
Beyond the model's capabilities, a key discussion point is the protective mechanism included: Anthropic has embedded a "prohibition of distillation" directly into the model. However, the symbolic significance of this move may far exceed its actual effectiveness.
From Legal Threats to Technical Censorship
Perhaps you remember that in February this year, Anthropic publicly accused DeepSeek, Moonshot AI, and MiniMax of using about 24k fake accounts to make over 16 million queries to Claude, systematically extracting outputs for training their own models. OpenAI also lobbied US lawmakers to legislate restrictions.
Further reading: What Is AI Model Distillation? How DeepSeek Spent $6 Million to Learn the Skills of 100 Million
Four months later, Fable 5's approach differs: it uses an AI classifier to identify three categories of high-risk requests—security, biological and chemical weapons, and distillation. When such requests are detected, the system downgrades responses to Opus 4.8. For prompt rewriting, steering vectors (techniques to externally manipulate model output direction), and PEFT parameter-efficient fine-tuning methods, Anthropic states Fable 5 will actively reduce their effectiveness.
This shift from "Want to Sue You" to "Make You Unable to Access" is a strategic upgrade. But the problem is, Anthropic itself admits that over 95% of conversations are unaffected. The protective mechanism covers only very narrow scenarios; while interception of malicious security tasks is 100% successful, the boundary for "distillation behavior" remains blurry—legitimate distillation and unauthorized distillation are nearly indistinguishable in technical operation.
What Is Being Blocked When Preventing Distillation?
Returning to the February accusation. Machine learning researcher Nathan Lambert later analyzed the actual numbers: DeepSeek's queries totaled about 150k, targeting reasoning and reward models; Moonshot about 3.4 million, MiniMax about 13 million, with the combined post-training data amounting to roughly 150 to 400 billion tokens.
Lambert judged that even in GPU-constrained environments, Chinese labs have solid reinforcement learning infrastructure; the real competitive advantage lies in "correctly scaling the generation of synthetic data." In plain terms, enabling models to learn through repeated trial-and-error and reward feedback, rather than relying solely on pre-existing answers.
There is also a fundamental contradiction: as long as Anthropic continues selling APIs, distillation cannot be completely blocked. Open API access is Anthropic's business model, and distillation is a natural byproduct of "open API." The coverage of this protective measure is only 5%, leaving 95% of conversations still flowing freely.
The True Boundaries of Technical Censorship
Lambert straightforwardly states: "Blocking distillation is much more difficult than restricting physical goods like GPU shipments."
From this perspective, Fable 5's protective mechanism has two implications: it signals to the industry that Anthropic believes technical leaks have reached a level requiring embedding into the model itself; and it serves as a disruption to Chinese open-source labs, though it is hardly a barrier. Even if Fable 5's distillation defenses are fully effective, Chinese labs can still rely on open-source models from Google, Meta, their own RL infrastructure, and synthetic data pipelines.
However, the move from legal to technical measures by Anthropic remains symbolically significant: it demonstrates that "technical censorship" is becoming a new tool in AI geopolitical strategy.