Gemini 3.1 Flash-Lite officially released: the input price is only one quarter of Claude 4.5 Haiku, and GPQA is nearly 14 percentage points higher

Crypto News, Google Gemini 3.1 Flash-Lite officially released, becoming the cheapest and fastest model in the Gemini 3 series, now in high-concurrency production environment. The model supports four levels of inference strength control (minimal, low, medium, high), allowing users to adjust speed and quality based on scenarios. Pricing remains at preview levels: $0.25 per million tokens for input, $1.50 per million tokens for output, with input price at one-quarter of Claude 4.5 Haiku, and output price less than one-third. In terms of performance, GPQA Diamond score is 86.9%, surpassing Claude 4.5 Haiku’s 73.0% and GPT-5 Mini’s 82.3%, with MMMU-PRO scoring 76.8%. Output speed is 363 tokens/sec, 45% faster than 2.5 Flash.

View Original
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • Comment
  • Repost
  • Share
Comment
Add a comment
Add a comment
No comments
  • Pin