Price Per TokenPrice Per Token
Z

Z-ai API Pricing (Updated 2026)

Compare pricing for all 11 Z-ai models. Prices shown per 1M tokens with historical trends.

Get our weekly newsletter on pricing changes, new releases, and tools.

OpenClaw

Deploy OpenClaw in Under 1 Minute We handle hosting, scaling, and maintenance

8 Ways to Use Fewer Tokens
Last updated: April 1, 2026 at 08:39 AM

Overview

11
Total Models
$0.06
Cheapest Input (per 1M)
205K
Max Context Length

Model Release Timeline

15 releases
Jul 2025
$0.60 / $2.20
Jul 2025
$0.60 / $2.20
Aug 2025
$0.60 / $1.80
Aug 2025
$0.60 / $1.80
Sep 2025
$0.39 / $1.70
Sep 2025
$0.39 / $1.70
Dec 2025
$0.30 / $0.90
Dec 2025
$0.30 / $0.90
Dec 2025
$0.39 / $1.75
Dec 2025
$0.39 / $1.75
Jan 2026
$0.06 / $0.40
Jan 2026
$0.06 / $0.40
Feb 2026
$0.72 / $2.30
Feb 2026
$0.72 / $2.30
Mar 2026
$1.20 / $4.00
Spacing represents time between releases
Jul 2025Mar 2026

All Z-ai Models

Compare pricing for all 11 Z-ai models.

Model
Context
Input
Output
Coding
MMLU
GPQA
Cache Read
Cache Write
Actions
-
$0.000
$0.000
-
-
Try
203K
$0.060
$0.400
45.2
$0.01
-
Try
GLM-4.7-Flash Thinking
203K
$0.060
$0.400
58.1
$0.01
-
Try
128K
$0.100
$0.100
-
-
Try
131K
$0.130
$0.850
68.4
81.5
73.3
$0.025
-
Try
131K
$0.300
$0.900
41.1
75.2
56.6
$0.05
-
Try
GLM 4.6V Thinking
131K
$0.300
$0.900
16.0
79.9
71.9
$0.05
-
Try
203K
$0.390
$1.750
56.2
79.4
66.4
$0.08
-
Try
GLM 4.7 Thinking
203K
$0.390
$1.750
89.4
85.6
85.9
$0.08
-
Try
205K
$0.390
$1.700
56.1
78.4
63.2
$0.08
-
Try
GLM 4.6 Thinking
205K
$0.390
$1.700
69.5
82.9
78.0
$0.08
-
Try
66K
$0.600
$1.800
35.2
75.1
57.3
$0.11
-
Try
GLM 4.5V Thinking
66K
$0.600
$1.800
60.4
78.8
68.4
$0.11
-
Try
131K
$0.600
$2.200
$0.11
-
Try
GLM 4.5 Thinking
131K
$0.600
$2.200
73.8
83.5
78.2
$0.11
-
Try
203K
$0.720
$2.300
66.6
$0.19
-
Try
GLM 5 Thinking
203K
$0.720
$2.300
82.0
$0.19
-
Try
203K
$1.200
$4.000
84.7
$0.24
-
Try

* Some models use tiered pricing based on prompt length. Displayed prices are for prompts under 200k tokens.

Showing lowest price across all providers. Benchmarks from Artificial Analysis and HuggingFace Open LLM Leaderboard.

Cost vs. Quality

Compare Z-ai models by benchmark performance and cost.

X-axis:
Y-axis:
Loading chart...
Z-ai Models

Community Ratings

Rate how well Z-ai models perform across popular AI coding tools.

Historical Pricing Trends

Price Changes Over Time

No historical pricing data available for this provider.

Try Z-ai Models

Cost Calculator

Estimate your costs for any Z-ai model based on expected token usage.

Open Cost Calculator

Try in Playground

Test Z-ai models directly in your browser.

Open Playground

Frequently Asked Questions

Currently, GLM-4.7-Flash is the most affordable at $0.06 per 1M input tokens.
Count your input tokens (prompt) and expected output tokens (response). Use the formula: (input_tokens/1M × input_price) + (output_tokens/1M × output_price). Our pricing calculator can help automate this.
Z-ai pricing varies by model. Input costs range from $0.06 to $1.20 per 1M tokens. See the table above for specific model pricing.