Price Per TokenPrice Per Token
Google
Google
vs
Z-ai

Gemini 2.5 Flash vs GLM 4.5

A detailed comparison of pricing, benchmarks, and capabilities

OpenClaw

Deploy OpenClaw in Under 1 Minute We handle hosting, scaling, and maintenance

114 out of our 303 tracked models have had a price change in February.

Get our weekly newsletter on pricing changes, new releases, and tools.

Key Takeaways

Gemini 2.5 Flash wins:

  • Cheaper input tokens
  • Larger context window
  • Faster response time
  • Higher intelligence benchmark
  • Better at coding
  • Better at math
  • Supports vision
  • Has reasoning mode

GLM 4.5 wins:

  • Cheaper output tokens
Price Advantage
Gemini 2.5 Flash
Benchmark Advantage
Gemini 2.5 Flash
Context Window
Gemini 2.5 Flash
Speed
Gemini 2.5 Flash

Pricing Comparison

Price Comparison

MetricGemini 2.5 FlashGLM 4.5Winner
Input (per 1M tokens)$0.30$0.55 Gemini 2.5 Flash
Output (per 1M tokens)$2.50$2.00 GLM 4.5
Cache Read (per 1M)$0.03N/A Gemini 2.5 Flash
Cache Write (per 1M)$0.08N/A Gemini 2.5 Flash
Using a 3:1 input/output ratio, Gemini 2.5 Flash is 7% cheaper overall.

Gemini 2.5 Flash Providers

Vercel $0.30 (Cheapest)
Google AI Studio $0.30 (Cheapest)
Google $0.30 (Cheapest)

GLM 4.5 Providers

WandB $0.55 (Cheapest)
Z.AI $0.60
Nebius $0.60
Novita $0.60

Benchmark Comparison

8
Benchmarks Compared
0
Gemini 2.5 Flash Wins
0
GLM 4.5 Wins

Benchmark Scores

BenchmarkGemini 2.5 FlashGLM 4.5Winner
Intelligence Index
Overall intelligence score
21.1--
Coding Index
Code generation & understanding
17.8--
Math Index
Mathematical reasoning
60.3--
MMLU Pro
Academic knowledge
80.9--
GPQA
Graduate-level science
68.3--
LiveCodeBench
Competitive programming
49.5--
Aider
Real-world code editing
55.1--
AIME
Competition math
50.0--
Gemini 2.5 Flash significantly outperforms in coding benchmarks.

Cost vs Quality

X-axis:
Y-axis:
Loading chart...
Gemini 2.5 Flash
Other models

Context & Performance

Context Window

Gemini 2.5 Flash
1,048,576
tokens
Max output: 65,535 tokens
GLM 4.5
131,000
tokens
Max output: 131,000 tokens
Gemini 2.5 Flash has a 88% larger context window.

Speed Performance

MetricGemini 2.5 FlashGLM 4.5Winner
Tokens/second246.2 tok/sN/A
Time to First Token0.47sN/A

Capabilities

Feature Comparison

FeatureGemini 2.5 FlashGLM 4.5
Vision (Image Input)
Tool/Function Calls
Reasoning Mode
Audio Input
Audio Output
PDF Input
Prompt Caching
Web Search

License & Release

PropertyGemini 2.5 FlashGLM 4.5
LicenseProprietaryProprietary
AuthorGoogleZ-ai
ReleasedJun 2025Jul 2025

Gemini 2.5 Flash Modalities

Input
fileimagetextaudiovideo
Output
text

GLM 4.5 Modalities

Input
text
Output
text

Related Comparisons

Compare Gemini 2.5 Flash with:

Compare GLM 4.5 with:

Frequently Asked Questions

Gemini 2.5 Flash has cheaper input pricing at $0.30/M tokens. GLM 4.5 has cheaper output pricing at $2.00/M tokens.
Gemini 2.5 Flash scores higher on coding benchmarks with a score of 17.8, compared to GLM 4.5's score of N/A.
Gemini 2.5 Flash has a 1,048,576 token context window, while GLM 4.5 has a 131,000 token context window.
Gemini 2.5 Flash supports vision. GLM 4.5 does not support vision.