Price Per TokenPrice Per Token
Moonshotai
Moonshotai
vs
Z-ai

Kimi K2 Thinking vs GLM 5

A detailed comparison of pricing, benchmarks, and capabilities

OpenClaw

Best LLMs for OpenClaw Vote for which model works best with OpenClaw

112 out of our 301 tracked models have had a price change in February.

Get our weekly newsletter on pricing changes, new releases, and tools.

Key Takeaways

Kimi K2 Thinking wins:

  • Cheaper input tokens
  • Cheaper output tokens
  • Larger context window
  • Faster response time
  • Higher intelligence benchmark
  • Better at coding
  • Better at math
  • Has reasoning mode
  • Supports tool calls

GLM 5 wins:

  • No clear advantages in compared metrics
Price Advantage
Kimi K2 Thinking
Benchmark Advantage
Kimi K2 Thinking
Context Window
Kimi K2 Thinking
Speed
Kimi K2 Thinking

Pricing Comparison

Price Comparison

MetricKimi K2 ThinkingGLM 5Winner
Input (per 1M tokens)$0.40$0.80 Kimi K2 Thinking
Output (per 1M tokens)$1.75$2.56 Kimi K2 Thinking
Cache Read (per 1M)$200000.00$160000.00 GLM 5
Using a 3:1 input/output ratio, Kimi K2 Thinking is 41% cheaper overall.

Kimi K2 Thinking Providers

Chutes $0.40 (Cheapest)
DeepInfra $0.47
Parasail $0.50
SiliconFlow $0.55
AtlasCloud $0.60

GLM 5 Providers

AtlasCloud $0.80 (Cheapest)
Parasail $1.00
Z.AI $1.00
Friendli $1.00
Together $1.00

Benchmark Comparison

6
Benchmarks Compared
0
Kimi K2 Thinking Wins
0
GLM 5 Wins

Benchmark Scores

BenchmarkKimi K2 ThinkingGLM 5Winner
Intelligence Index
Overall intelligence score
40.7--
Coding Index
Code generation & understanding
34.8--
Math Index
Mathematical reasoning
94.7--
MMLU Pro
Academic knowledge
84.8--
GPQA
Graduate-level science
83.8--
LiveCodeBench
Competitive programming
85.3--
Kimi K2 Thinking significantly outperforms in coding benchmarks.

Cost vs Quality

X-axis:
Y-axis:
Loading chart...
Kimi K2 Thinking
Other models

Context & Performance

Context Window

Kimi K2 Thinking
262,144
tokens
Max output: 65,535 tokens
GLM 5
202,752
tokens
Max output: 131,072 tokens
Kimi K2 Thinking has a 23% larger context window.

Speed Performance

MetricKimi K2 ThinkingGLM 5Winner
Tokens/second87.6 tok/sN/A
Time to First Token0.62sN/A

Capabilities

Feature Comparison

FeatureKimi K2 ThinkingGLM 5
Vision (Image Input)
Tool/Function Calls
Reasoning Mode
Audio Input
Audio Output
PDF Input
Prompt Caching
Web Search

License & Release

PropertyKimi K2 ThinkingGLM 5
LicenseProprietaryProprietary
AuthorMoonshotaiZ-ai
ReleasedNov 2025Feb 2026

Kimi K2 Thinking Modalities

Input
text
Output
text

GLM 5 Modalities

Input
text
Output
text

Related Comparisons

Compare Kimi K2 Thinking with:

Compare GLM 5 with:

Frequently Asked Questions

Kimi K2 Thinking has cheaper input pricing at $0.40/M tokens. Kimi K2 Thinking has cheaper output pricing at $1.75/M tokens.
Kimi K2 Thinking scores higher on coding benchmarks with a score of 34.8, compared to GLM 5's score of N/A.
Kimi K2 Thinking has a 262,144 token context window, while GLM 5 has a 202,752 token context window.
Kimi K2 Thinking does not support vision. GLM 5 does not support vision.