Key Takeaways
Kimi K2 Thinking wins:
- Cheaper input tokens
- Larger context window
GLM 4.5 wins:
- Supports tool calls
Price Advantage
Kimi K2 Thinking
Benchmark Advantage
Kimi K2 Thinking
Context Window
Kimi K2 Thinking
Speed
N/A
Pricing Comparison
Price Comparison
| Metric | Kimi K2 Thinking | GLM 4.5 | Winner |
|---|---|---|---|
| Input (per 1M tokens) | $0.47 | $0.55 | Kimi K2 Thinking |
| Output (per 1M tokens) | $2.00 | $2.00 | Tie |
| Cache Read (per 1M) | $0.14 | N/A | Kimi K2 Thinking |
Using a 3:1 input/output ratio, Kimi K2 Thinking is 7% cheaper overall.
Kimi K2 Thinking Providers
DeepInfra $0.47 (Cheapest)
Parasail $0.50
SiliconFlow $0.55
AtlasCloud $0.60
Nebius $0.60
GLM 4.5 Providers
WandB $0.55 (Cheapest)
Z.AI $0.60
Nebius $0.60
Novita $0.60
Context & Performance
Context Window
Kimi K2 Thinking
131,072
tokens
GLM 4.5
131,000
tokens
Max output: 131,000 tokens
Kimi K2 Thinking has a 0% larger context window.
Speed Performance
Speed benchmarks not available for these models
Capabilities
Feature Comparison
| Feature | Kimi K2 Thinking | GLM 4.5 |
|---|---|---|
| Vision (Image Input) | ||
| Tool/Function Calls | ||
| Reasoning Mode | ||
| Audio Input | ||
| Audio Output | ||
| PDF Input | ||
| Prompt Caching | ||
| Web Search |
License & Release
| Property | Kimi K2 Thinking | GLM 4.5 |
|---|---|---|
| License | Proprietary | Proprietary |
| Author | Moonshotai | Z-ai |
| Released | Nov 2025 | Jul 2025 |
Kimi K2 Thinking Modalities
Input
text
Output
text
GLM 4.5 Modalities
Input
text
Output
text
