Price Per TokenPrice Per Token

WMDP Leaderboard

Weapons of Mass Destruction Proxy — benchmark testing knowledge safety boundaries.

Data from LayerLens

Models

18

Best Score

86.8

Average

68.3

Std Dev

16.8

Categories
Reasoning and Logic
Provider
Model
Input $/M
Output $/M
WMDP
Actions
$0.500
$3.000
86.8
$0.500
$3.000
86.8
$1.100
$4.400
80.5
$3.000
$15.000
78.2
$3.000
$15.000
78.2
$3.000
$15.000
74.3
$0.100
$0.400
72.0
$0.320
$0.890
71.8
$2.000
$8.000
71.4
$2.500
$10.000
69.8
$2.500
$10.000
69.6
$4.000
$4.000
67.7
$0.800
$3.200
67.0
$2.000
$6.000
65.3
$0.800
$4.000
64.3
$0.100
$0.300
61.0
$0.060
$0.140
58.0
$0.051
$0.340
6.7

Pricing from OpenRouter. Benchmarks from Artificial Analysis.

OpenClaw

Deploy OpenClaw in Under 1 Minute We handle hosting, scaling, and maintenance

93 out of our 301 tracked models have had a price change in March.

Get our weekly newsletter on pricing changes, new releases, and tools.

About WMDP

Weapons of Mass Destruction Proxy — benchmark testing knowledge safety boundaries.

This leaderboard shows all models with WMDP benchmark scores, ranked from highest to lowest. Pricing data is included to help you compare performance against cost.

Advertise with us