20 times cheaper performance with only an 8% decrease! Gemini 3.2 Flash I/O release next week, directly competing with GPT-5.5

robot
Abstract generation in progress

AIMPACT News, May 14 (UTC+8), according to Beating Monitoring, Google plans to release the next-generation lightweight model Gemini 3.2 Flash at the I/O conference on May 20. The overall performance of the model is roughly on par with GPT-5.5, but it is clearly not as good as Anthropic’s Mythos. Abacus.AI CEO Bindu Reddy revealed rumors that Gemini 3.2 Flash has achieved 92% of GPT-5.5 in coding and reasoning tasks, but its inference cost is only one fifteenth to one twentieth of the latter, with most query latencies below 200 milliseconds. She believes Google’s distillation and sparsification techniques are playing a significant role, essentially compressing a cutting-edge model into a Flash-level, without the usual performance cliff. Gemini 3.2 Flash had previously shown signs of leakage. In early May, traces of the model were found in iOS application build packages and AI Studio metadata, and it later appeared anonymously in evaluations on LM Arena. Early testers reported that the model performs outstandingly in creative coding tasks, even surpassing Gemini 3.1 Pro on some benchmarks. (Source: BlockBeats)

View Original
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • Comment
  • Repost
  • Share
Comment
Add a comment
Add a comment
No comments
  • Pinned