Price Per TokenPrice Per Token

MMLU Leaderboard

Massive Multitask Language Understanding — tests knowledge across 57 subjects.

Data from LayerLens

Models

29

Best Score

90.5

Average

77.0

Std Dev

18.4

Categories
General Knowledge
Provider
Model
Input $/M
Output $/M
MMLU
Actions
$0.450
$2.150
90.5
$0.300
$0.500
89.2
$1.100
$4.400
88.9
$0.550
$2.200
88.3
$0.039
$0.190
87.6
$0.150
$0.400
85.9
$0.300
$2.500
85.7
$0.300
$2.500
85.7
$0.150
$0.600
85.5
$3.000
$15.000
85.3
$3.000
$15.000
85.3
$0.080
$0.280
85.3
$0.080
$0.280
85.3
$0.100
$0.400
84.8
$2.000
$8.000
84.6
$0.280
$1.100
84.5
$2.500
$10.000
84.1
$0.320
$0.890
83.4
$0.400
$2.000
82.6
$0.800
$3.200
78.3
$0.060
$0.140
77.6
$2.000
$6.000
77.2
$0.060
$0.180
76.0
$0.100
$0.300
73.5
$0.800
$4.000
72.8
$0.035
$0.140
68.9
$4.000
$4.000
37.1
$0.080
$0.300
24.6
$0.051
$0.340
15.3

Pricing from OpenRouter. Benchmarks from Artificial Analysis.

OpenClaw

Deploy OpenClaw in Under 1 Minute We handle hosting, scaling, and maintenance

93 out of our 301 tracked models have had a price change in March.

Get our weekly newsletter on pricing changes, new releases, and tools.

About MMLU

Massive Multitask Language Understanding — tests knowledge across 57 subjects.

This leaderboard shows all models with MMLU benchmark scores, ranked from highest to lowest. Pricing data is included to help you compare performance against cost.

Advertise with us