Price Per TokenPrice Per Token

AGIEval Chinese Leaderboard

AGIEval Chinese — reasoning tasks from Chinese standardized exams (Gaokao, civil service).

Data from LayerLens

Models

32

Best Score

90.1

Average

76.5

Std Dev

11.4

Categories
Reasoning and LogicMultilingual
Provider
Model
Input $/M
Output $/M
AGIEval Chinese
Actions
$0.270
$0.410
90.1
$0.455
$1.820
89.4
$0.455
$1.820
89.4
$0.280
$1.100
89.0
$0.350
$1.710
88.2
$0.350
$1.710
88.2
$0.700
$2.500
87.8
$0.150
$0.400
87.3
$0.080
$0.240
86.7
$0.080
$0.240
86.7
$0.270
$0.410
85.8
$0.080
$0.280
85.5
$0.080
$0.280
85.5
$0.071
$0.100
84.4
$0.120
$0.390
78.4
$0.300
$0.500
77.4
$0.320
$0.890
75.8
$1.100
$4.400
74.8
$0.400
$2.000
74.2
$0.400
$2.000
74.2
$2.500
$10.000
73.6
$0.100
$0.400
73.3
$3.000
$15.000
71.4
$2.000
$8.000
69.3
$2.500
$10.000
68.3
$0.800
$3.200
64.8
$2.000
$6.000
63.1
$0.060
$0.140
60.8
$0.040
$0.150
60.3
$0.100
$0.300
59.8
$0.035
$0.140
56.2
$2.500
$10.000
49.7

Pricing from OpenRouter. Benchmarks from Artificial Analysis.

OpenClaw

Deploy OpenClaw in Under 1 Minute We handle hosting, scaling, and maintenance

93 out of our 301 tracked models have had a price change in March.

Get our weekly newsletter on pricing changes, new releases, and tools.

About AGIEval Chinese

AGIEval Chinese — reasoning tasks from Chinese standardized exams (Gaokao, civil service).

This leaderboard shows all models with AGIEval Chinese benchmark scores, ranked from highest to lowest. Pricing data is included to help you compare performance against cost.

Advertise with us