Former Byte engineer: The AI gap between China and the US is widening, with the lack of distillation shortcuts and feedback flywheels being the main reasons.

ME News message: On April 24 (UTC+8), according to Beating monitoring, Zhang Chi made a direct assessment of the AI gap between China and the United States in the same interview: “I even don’t agree with the claim that China is catching up. I think we are still far behind. The gap is widening, which is very regrettable.” His colleagues and students around him generally agree, but he also admits that the leadership of listed companies such as Zhipu and MiniMax would not agree with this assessment. He attributes the reasons to three areas. First, “distillation shortcuts”: he believes that many Chinese companies directly use the outputs of Claude, GPT, or Gemini as training data. “Claude recently said it detected a large number of distillation attempts, and I guess that’s how some companies take shortcuts.” However, he also acknowledges that DeepSeek has shown genuine architectural innovation in V3 and R1. Second, the missing user feedback flywheel: U.S. models are useful, so they have more users, and user feedback then makes the models better; Chinese models did not start out well, have fewer users, and can’t obtain data, creating a vicious cycle. Third, the infrastructure gap: when he interned at Google, he felt the infrastructure was “so good, the code ran extremely smoothly,” and the gap with ByteDance was huge. (Source: BlockBeats)
DEEPSEEK-3.76%
View Original
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • Comment
  • Repost
  • Share
Comment
Add a comment
Add a comment
No comments
  • Pinned