Price Per TokenPrice Per Token
Z

Z-ai API Pricing (Updated 2026)

Compare pricing for all 13 Z-ai models. Prices shown per 1M tokens with historical trends.

Get our weekly newsletter on pricing changes, new releases, and tools.

No bots, no big tech influence — join the new community for AI devs
8 Ways to Use Fewer Tokens
Last updated: May 13, 2026 at 08:24 AM

Overview

13
Total Models
$0.06
Cheapest Input (per 1M)
205K
Max Context Length

Model Release Timeline

15 releases
Aug 2025
$0.60 / $1.80
Aug 2025
$0.60 / $1.80
Sep 2025
$0.39 / $1.74
Sep 2025
$0.39 / $1.74
Dec 2025
$0.30 / $0.90
Dec 2025
$0.30 / $0.90
Dec 2025
$0.40 / $1.75
Dec 2025
$0.40 / $1.75
Jan 2026
$0.06 / $0.40
Jan 2026
$0.06 / $0.40
Feb 2026
$0.60 / $1.92
Feb 2026
$0.60 / $1.92
Mar 2026
$1.20 / $4.00
Apr 2026
$1.20 / $4.00
Apr 2026
$0.98 / $3.08
Spacing represents time between releases
Aug 2025Apr 2026

All Z-ai Models

Compare pricing for all 13 Z-ai models.

Model
Context
Input
Output
Coding
MMLU
GPQA
Cache Read
Cache Write
Actions
-
$0.000
$0.000
-
-
Try
203K
$0.060
$0.400
45.2
$0.01
-
Try
GLM-4.7-Flash Thinking
203K
$0.060
$0.400
58.1
$0.01
-
Try
128K
$0.100
$0.100
-
-
Try
131K
$0.130
$0.850
68.4
81.5
73.3
$0.025
-
Try
131K
$0.300
$0.900
41.1
75.2
56.6
$0.05
-
Try
GLM 4.6V Thinking
131K
$0.300
$0.900
16.0
79.9
71.9
$0.05
-
Try
205K
$0.390
$1.740
56.1
78.4
63.2
$0.08
-
Try
GLM 4.6 Thinking
205K
$0.390
$1.740
69.5
82.9
78.0
$0.08
-
Try
203K
$0.400
$1.750
56.2
79.4
66.4
$0.08
-
Try
GLM 4.7 Thinking
203K
$0.400
$1.750
89.4
85.6
85.9
$0.08
-
Try
203K
$0.600
$1.920
66.6
$0.12
-
Try
GLM 5 Thinking
203K
$0.600
$1.920
82.0
$0.12
-
Try
66K
$0.600
$1.800
35.2
75.1
57.3
$0.11
-
Try
GLM 4.5V Thinking
66K
$0.600
$1.800
60.4
78.8
68.4
$0.11
-
Try
131K
$0.600
$2.200
$0.11
-
Try
GLM 4.5 Thinking
131K
$0.600
$2.200
73.8
83.5
78.2
$0.11
-
Try
203K
$0.980
$3.080
83.9
$0.182
-
Try
203K
$1.200
$4.000
84.7
$0.24
-
Try
203K
$1.200
$4.000
80.9
$0.24
-
Try

* Some models use tiered pricing based on prompt length. Displayed prices are for prompts under 200k tokens.

Showing lowest price across all providers. Benchmarks from Artificial Analysis and HuggingFace Open LLM Leaderboard.

Cost vs. Quality

Compare Z-ai models by benchmark performance and cost.

X-axis:
Y-axis:
Loading chart...
Z-ai Models

Community Ratings

Rate how well Z-ai models perform across popular AI coding tools.

Historical Pricing Trends

Price Changes Over Time

No historical pricing data available for this provider.

Try Z-ai Models

Cost Calculator

Estimate your costs for any Z-ai model based on expected token usage.

Open Cost Calculator

Try in Playground

Test Z-ai models directly in your browser.

Open Playground

Frequently Asked Questions

Currently, GLM-4.7-Flash is the most affordable at $0.06 per 1M input tokens.
Count your input tokens (prompt) and expected output tokens (response). Use the formula: (input_tokens/1M × input_price) + (output_tokens/1M × output_price). Our pricing calculator can help automate this.
Z-ai pricing varies by model. Input costs range from $0.06 to $1.20 per 1M tokens. See the table above for specific model pricing.