Price Per TokenPrice Per Token
Google
Google
vs
Qwen
Qwen

Gemini 2.5 Flash vs Qwen3 Max

A detailed comparison of pricing, benchmarks, and capabilities

OpenClaw

Deploy OpenClaw in Under 1 Minute We handle hosting, scaling, and maintenance

114 out of our 303 tracked models have had a price change in February.

Get our weekly newsletter on pricing changes, new releases, and tools.

Key Takeaways

Gemini 2.5 Flash wins:

  • Cheaper input tokens
  • Cheaper output tokens
  • Larger context window
  • Faster response time
  • Supports vision
  • Has reasoning mode

Qwen3 Max wins:

  • Higher intelligence benchmark
  • Better at coding
  • Better at math
Price Advantage
Gemini 2.5 Flash
Benchmark Advantage
Qwen3 Max
Context Window
Gemini 2.5 Flash
Speed
Gemini 2.5 Flash

Pricing Comparison

Price Comparison

MetricGemini 2.5 FlashQwen3 MaxWinner
Input (per 1M tokens)$0.30$1.20 Gemini 2.5 Flash
Output (per 1M tokens)$2.50$6.00 Gemini 2.5 Flash
Cache Read (per 1M)$0.03$0.24 Gemini 2.5 Flash
Cache Write (per 1M)$0.08N/A Gemini 2.5 Flash
Using a 3:1 input/output ratio, Gemini 2.5 Flash is 65% cheaper overall.

Gemini 2.5 Flash Providers

Vercel $0.30 (Cheapest)
Google AI Studio $0.30 (Cheapest)
Google $0.30 (Cheapest)

Qwen3 Max Providers

Alibaba $1.20 (Cheapest)

Benchmark Comparison

8
Benchmarks Compared
0
Gemini 2.5 Flash Wins
6
Qwen3 Max Wins

Benchmark Scores

BenchmarkGemini 2.5 FlashQwen3 MaxWinner
Intelligence Index
Overall intelligence score
21.131.4
Coding Index
Code generation & understanding
17.826.4
Math Index
Mathematical reasoning
60.380.7
MMLU Pro
Academic knowledge
80.984.1
GPQA
Graduate-level science
68.376.4
LiveCodeBench
Competitive programming
49.576.7
Aider
Real-world code editing
55.1--
AIME
Competition math
50.0--
Qwen3 Max significantly outperforms in coding benchmarks.

Cost vs Quality

X-axis:
Y-axis:
Loading chart...
Gemini 2.5 Flash
Other models

Context & Performance

Context Window

Gemini 2.5 Flash
1,048,576
tokens
Max output: 65,535 tokens
Qwen3 Max
262,144
tokens
Max output: 32,768 tokens
Gemini 2.5 Flash has a 75% larger context window.

Speed Performance

MetricGemini 2.5 FlashQwen3 MaxWinner
Tokens/second246.2 tok/s27.3 tok/s
Time to First Token0.47s2.29s
Gemini 2.5 Flash responds 801% faster on average.

Capabilities

Feature Comparison

FeatureGemini 2.5 FlashQwen3 Max
Vision (Image Input)
Tool/Function Calls
Reasoning Mode
Audio Input
Audio Output
PDF Input
Prompt Caching
Web Search

License & Release

PropertyGemini 2.5 FlashQwen3 Max
LicenseProprietaryOpen Source
AuthorGoogleQwen
ReleasedJun 2025Sep 2025

Gemini 2.5 Flash Modalities

Input
fileimagetextaudiovideo
Output
text

Qwen3 Max Modalities

Input
text
Output
text

Related Comparisons

Compare Gemini 2.5 Flash with:

Compare Qwen3 Max with:

Frequently Asked Questions

Gemini 2.5 Flash has cheaper input pricing at $0.30/M tokens. Gemini 2.5 Flash has cheaper output pricing at $2.50/M tokens.
Qwen3 Max scores higher on coding benchmarks with a score of 26.4, compared to Gemini 2.5 Flash's score of 17.8.
Gemini 2.5 Flash has a 1,048,576 token context window, while Qwen3 Max has a 262,144 token context window.
Gemini 2.5 Flash supports vision. Qwen3 Max does not support vision.