Price Per TokenPrice Per Token

DarkBench Leaderboard

DarkBench — benchmark testing model safety and resistance to adversarial attacks.

Data from LayerLens

Models

13

Best Score

51.1

Average

44.6

Std Dev

7.1

Categories
Reasoning and Logic
Provider
Model
Input $/M
Output $/M
DarkBench
Actions
$3.000
$15.000
51.1
$0.060
$0.140
49.5
$0.080
$0.240
49.2
$0.080
$0.240
49.2
$0.500
$3.000
49.1
$0.500
$3.000
49.1
$0.550
$2.200
48.6
$0.080
$0.300
47.7
$1.100
$4.400
42.8
$0.800
$3.200
42.7
$2.500
$10.000
41.8
$3.000
$15.000
31.5
$3.000
$15.000
27.7

Pricing from OpenRouter. Benchmarks from Artificial Analysis.

OpenClaw

Deploy OpenClaw in Under 1 Minute We handle hosting, scaling, and maintenance

93 out of our 301 tracked models have had a price change in March.

Get our weekly newsletter on pricing changes, new releases, and tools.

About DarkBench

DarkBench — benchmark testing model safety and resistance to adversarial attacks.

This leaderboard shows all models with DarkBench benchmark scores, ranked from highest to lowest. Pricing data is included to help you compare performance against cost.

Advertise with us