Price Per TokenPrice Per Token
Moonshotai
Moonshotai
vs
Z-ai

Kimi K2 Thinking vs GLM 4.5

A detailed comparison of pricing, benchmarks, and capabilities

OpenClaw

Best LLMs for OpenClaw Vote for which model works best with OpenClaw

101 out of our 303 tracked models have had a price change in February.

Get our weekly newsletter on pricing changes, new releases, and tools.

Key Takeaways

Kimi K2 Thinking wins:

  • Larger context window
  • Faster response time
  • Higher intelligence benchmark
  • Better at coding
  • Better at math
  • Has reasoning mode

GLM 4.5 wins:

  • Cheaper input tokens
  • Cheaper output tokens
Price Advantage
GLM 4.5
Benchmark Advantage
Kimi K2 Thinking
Context Window
Kimi K2 Thinking
Speed
Kimi K2 Thinking

Pricing Comparison

Price Comparison

MetricKimi K2 ThinkingGLM 4.5Winner
Input (per 1M tokens)$0.40$0.35 GLM 4.5
Output (per 1M tokens)$1.75$1.55 GLM 4.5
Cache Read (per 1M)$200000.00$175000.00 GLM 4.5
Using a 3:1 input/output ratio, GLM 4.5 is 12% cheaper overall.

Kimi K2 Thinking Providers

Chutes $0.40 (Cheapest)
DeepInfra $0.47
Parasail $0.50
SiliconFlow $0.55
AtlasCloud $0.60

GLM 4.5 Providers

Chutes $0.35 (Cheapest)
WandB $0.55
Z.AI $0.60
Nebius $0.60
Novita $0.60

Benchmark Comparison

6
Benchmarks Compared
0
Kimi K2 Thinking Wins
0
GLM 4.5 Wins

Benchmark Scores

BenchmarkKimi K2 ThinkingGLM 4.5Winner
Intelligence Index
Overall intelligence score
40.7--
Coding Index
Code generation & understanding
34.8--
Math Index
Mathematical reasoning
94.7--
MMLU Pro
Academic knowledge
84.8--
GPQA
Graduate-level science
83.8--
LiveCodeBench
Competitive programming
85.3--
Kimi K2 Thinking significantly outperforms in coding benchmarks.

Cost vs Quality

X-axis:
Y-axis:
Loading chart...
Kimi K2 Thinking
Other models

Context & Performance

Context Window

Kimi K2 Thinking
262,144
tokens
Max output: 65,535 tokens
GLM 4.5
131,072
tokens
Max output: 65,536 tokens
Kimi K2 Thinking has a 50% larger context window.

Speed Performance

MetricKimi K2 ThinkingGLM 4.5Winner
Tokens/second99.5 tok/sN/A
Time to First Token0.57sN/A

Capabilities

Feature Comparison

FeatureKimi K2 ThinkingGLM 4.5
Vision (Image Input)
Tool/Function Calls
Reasoning Mode
Audio Input
Audio Output
PDF Input
Prompt Caching
Web Search

License & Release

PropertyKimi K2 ThinkingGLM 4.5
LicenseProprietaryProprietary
AuthorMoonshotaiZ-ai
ReleasedNov 2025Jul 2025

Kimi K2 Thinking Modalities

Input
text
Output
text

GLM 4.5 Modalities

Input
text
Output
text

Related Comparisons

Compare Kimi K2 Thinking with:

Compare GLM 4.5 with:

Frequently Asked Questions

GLM 4.5 has cheaper input pricing at $0.35/M tokens. GLM 4.5 has cheaper output pricing at $1.55/M tokens.
Kimi K2 Thinking scores higher on coding benchmarks with a score of 34.8, compared to GLM 4.5's score of N/A.
Kimi K2 Thinking has a 262,144 token context window, while GLM 4.5 has a 131,072 token context window.
Kimi K2 Thinking does not support vision. GLM 4.5 does not support vision.