Price Per TokenPrice Per Token
Z-ai
vs
Z-ai

GLM 4.7 vs GLM-4.7-Flash

A detailed comparison of pricing, benchmarks, and capabilities

OpenClaw

Best LLMs for OpenClaw Vote for which model works best with OpenClaw

100 out of our 304 tracked models have had a price change in February.

Get our weekly newsletter on pricing changes, new releases, and tools.

Key Takeaways

GLM 4.7 wins:

  • Larger context window
  • Faster response time
  • Higher intelligence benchmark
  • Better at coding
  • Better at math
  • Supports tool calls

GLM-4.7-Flash wins:

  • Cheaper input tokens
  • Cheaper output tokens
Price Advantage
GLM-4.7-Flash
Benchmark Advantage
GLM 4.7
Context Window
GLM 4.7
Speed
GLM 4.7

Pricing Comparison

Price Comparison

MetricGLM 4.7GLM-4.7-FlashWinner
Input (per 1M tokens)$0.40$0.07 GLM-4.7-Flash
Output (per 1M tokens)$1.50$0.40 GLM-4.7-Flash
Cache Read (per 1M)N/A$10000.00 GLM-4.7-Flash
Using a 3:1 input/output ratio, GLM-4.7-Flash is 77% cheaper overall.

GLM 4.7 Providers

Nebius $0.40 (Cheapest)
Chutes $0.40 (Cheapest)
DeepInfra $0.40 (Cheapest)
SiliconFlow $0.42
Parasail $0.45

GLM-4.7-Flash Providers

Z.AI $0.07 (Cheapest)
Novita $0.07 (Cheapest)
Phala $0.10

Benchmark Comparison

6
Benchmarks Compared
0
GLM 4.7 Wins
0
GLM-4.7-Flash Wins

Benchmark Scores

BenchmarkGLM 4.7GLM-4.7-FlashWinner
Intelligence Index
Overall intelligence score
34.1--
Coding Index
Code generation & understanding
32.0--
Math Index
Mathematical reasoning
48.0--
MMLU Pro
Academic knowledge
79.4--
GPQA
Graduate-level science
66.4--
LiveCodeBench
Competitive programming
56.2--
GLM 4.7 significantly outperforms in coding benchmarks.

Cost vs Quality

X-axis:
Y-axis:
Loading chart...
GLM 4.7
Other models

Context & Performance

Context Window

GLM 4.7
202,752
tokens
Max output: 65,535 tokens
GLM-4.7-Flash
200,000
tokens
Max output: 131,072 tokens
GLM 4.7 has a 1% larger context window.

Speed Performance

MetricGLM 4.7GLM-4.7-FlashWinner
Tokens/second125.3 tok/sN/A
Time to First Token0.61sN/A

Capabilities

Feature Comparison

FeatureGLM 4.7GLM-4.7-Flash
Vision (Image Input)
Tool/Function Calls
Reasoning Mode
Audio Input
Audio Output
PDF Input
Prompt Caching
Web Search

License & Release

PropertyGLM 4.7GLM-4.7-Flash
LicenseProprietaryProprietary
AuthorZ-aiZ-ai
ReleasedDec 2025Jan 2026

GLM 4.7 Modalities

Input
text
Output
text

GLM-4.7-Flash Modalities

Input
text
Output
text

Related Comparisons

Compare GLM 4.7 with:

Compare GLM-4.7-Flash with:

Frequently Asked Questions

GLM-4.7-Flash has cheaper input pricing at $0.07/M tokens. GLM-4.7-Flash has cheaper output pricing at $0.40/M tokens.
GLM 4.7 scores higher on coding benchmarks with a score of 32.0, compared to GLM-4.7-Flash's score of N/A.
GLM 4.7 has a 202,752 token context window, while GLM-4.7-Flash has a 200,000 token context window.
GLM 4.7 does not support vision. GLM-4.7-Flash does not support vision.