Price Per TokenPrice Per Token
Google
Google
vs
Qwen
Qwen

Gemini 2.5 Flash vs Qwen3 8B

A detailed comparison of pricing, benchmarks, and capabilities

OpenClaw

Deploy OpenClaw in Under 1 Minute We handle hosting, scaling, and maintenance

114 out of our 303 tracked models have had a price change in February.

Get our weekly newsletter on pricing changes, new releases, and tools.

Key Takeaways

Gemini 2.5 Flash wins:

  • Larger context window
  • Faster response time
  • Higher intelligence benchmark
  • Better at coding
  • Better at math
  • Supports vision
  • Has reasoning mode

Qwen3 8B wins:

  • Cheaper input tokens
  • Cheaper output tokens
Price Advantage
Qwen3 8B
Benchmark Advantage
Gemini 2.5 Flash
Context Window
Gemini 2.5 Flash
Speed
Gemini 2.5 Flash

Pricing Comparison

Price Comparison

MetricGemini 2.5 FlashQwen3 8BWinner
Input (per 1M tokens)$0.30$0.05 Qwen3 8B
Output (per 1M tokens)$2.50$0.40 Qwen3 8B
Cache Read (per 1M)$0.03$0.05 Gemini 2.5 Flash
Cache Write (per 1M)$0.08N/A Gemini 2.5 Flash
Using a 3:1 input/output ratio, Qwen3 8B is 84% cheaper overall.

Gemini 2.5 Flash Providers

Vercel $0.30 (Cheapest)
Google AI Studio $0.30 (Cheapest)
Google $0.30 (Cheapest)

Qwen3 8B Providers

AtlasCloud $0.05 (Cheapest)
Alibaba $0.12
Fireworks $0.20

Benchmark Comparison

8
Benchmarks Compared
7
Gemini 2.5 Flash Wins
0
Qwen3 8B Wins

Benchmark Scores

BenchmarkGemini 2.5 FlashQwen3 8BWinner
Intelligence Index
Overall intelligence score
21.113.2
Coding Index
Code generation & understanding
17.87.1
Math Index
Mathematical reasoning
60.324.3
MMLU Pro
Academic knowledge
80.964.3
GPQA
Graduate-level science
68.345.2
LiveCodeBench
Competitive programming
49.520.2
Aider
Real-world code editing
55.1--
AIME
Competition math
50.024.3
Gemini 2.5 Flash significantly outperforms in coding benchmarks.

Cost vs Quality

X-axis:
Y-axis:
Loading chart...
Gemini 2.5 Flash
Other models

Context & Performance

Context Window

Gemini 2.5 Flash
1,048,576
tokens
Max output: 65,535 tokens
Qwen3 8B
40,960
tokens
Max output: 8,192 tokens
Gemini 2.5 Flash has a 96% larger context window.

Speed Performance

MetricGemini 2.5 FlashQwen3 8BWinner
Tokens/second246.2 tok/s50.8 tok/s
Time to First Token0.47s0.99s
Gemini 2.5 Flash responds 385% faster on average.

Capabilities

Feature Comparison

FeatureGemini 2.5 FlashQwen3 8B
Vision (Image Input)
Tool/Function Calls
Reasoning Mode
Audio Input
Audio Output
PDF Input
Prompt Caching
Web Search

License & Release

PropertyGemini 2.5 FlashQwen3 8B
LicenseProprietaryOpen Source
AuthorGoogleQwen
ReleasedJun 2025Apr 2025

Gemini 2.5 Flash Modalities

Input
fileimagetextaudiovideo
Output
text

Qwen3 8B Modalities

Input
text
Output
text

Related Comparisons

Compare Gemini 2.5 Flash with:

Compare Qwen3 8B with:

Frequently Asked Questions

Qwen3 8B has cheaper input pricing at $0.05/M tokens. Qwen3 8B has cheaper output pricing at $0.40/M tokens.
Gemini 2.5 Flash scores higher on coding benchmarks with a score of 17.8, compared to Qwen3 8B's score of 7.1.
Gemini 2.5 Flash has a 1,048,576 token context window, while Qwen3 8B has a 40,960 token context window.
Gemini 2.5 Flash supports vision. Qwen3 8B does not support vision.