Key Takeaways
GLM-4.7-Flash wins:
- Cheaper input tokens
- Cheaper output tokens
- Faster response time
- Supports vision
GLM 5 wins:
- Higher intelligence benchmark
- Better at coding
Price Advantage
GLM-4.7-Flash
Benchmark Advantage
GLM 5
Context Window
GLM 5
Speed
GLM-4.7-Flash
Pricing Comparison
Price Comparison
| Metric | GLM-4.7-Flash | GLM 5 | Winner |
|---|---|---|---|
| Input (per 1M tokens) | $0.06 | $0.72 | GLM-4.7-Flash |
| Output (per 1M tokens) | $0.40 | $2.30 | GLM-4.7-Flash |
| Cache Read (per 1M) | $0.01 | $0.19 | GLM-4.7-Flash |
Using a 3:1 input/output ratio, GLM-4.7-Flash is 87% cheaper overall.
GLM-4.7-Flash Providers
No provider data available
GLM 5 Providers
No provider data available
Benchmark Comparison
3
Benchmarks Compared
0
GLM-4.7-Flash Wins
3
GLM 5 Wins
Benchmark Scores
| Benchmark | GLM-4.7-Flash | GLM 5 | Winner |
|---|---|---|---|
Intelligence Index Overall intelligence score | 22.1 | 40.6 | |
Coding Index Code generation & understanding | 11.0 | 39.0 | |
GPQA Graduate-level science | 45.2 | 66.6 |
GLM 5 significantly outperforms in coding benchmarks.
Cost vs Quality
X-axis:
Y-axis:
Loading chart...
Other models
Context & Performance
Context Window
GLM-4.7-Flash
202,752
tokens
GLM 5
202,752
tokens
Speed Performance
| Metric | GLM-4.7-Flash | GLM 5 | Winner |
|---|---|---|---|
| Tokens/second | 87.8 tok/s | 65.3 tok/s | |
| Time to First Token | 0.71s | 1.03s |
GLM-4.7-Flash responds 34% faster on average.
Capabilities
Feature Comparison
| Feature | GLM-4.7-Flash | GLM 5 |
|---|---|---|
| Vision (Image Input) | ||
| Tool/Function Calls | ||
| Reasoning Mode | ||
| Audio Input | ||
| Audio Output | ||
| PDF Input | ||
| Prompt Caching | ||
| Web Search |
License & Release
| Property | GLM-4.7-Flash | GLM 5 |
|---|---|---|
| License | Open Source | Open Source |
| Author | Z-ai | Z-ai |
| Released | Jan 2026 | Feb 2026 |
GLM-4.7-Flash Modalities
Input
text
Output
text
GLM 5 Modalities
Input
text
Output
text