Price Per TokenPrice Per Token

Knights and Knaves Leaderboard

Logic puzzle benchmark based on knights (truth-tellers) and knaves (liars) puzzles.

Data from LayerLens

Models

26

Best Score

99.7

Average

52.2

Std Dev

29.5

Categories
Reasoning and Logic
Provider
Model
Input $/M
Output $/M
Knights and Knaves
Actions
$1.100
$4.400
99.7
$1.100
$4.400
99.7
$0.450
$2.150
97.9
$0.700
$2.500
97.3
$3.000
$15.000
94.0
$0.030
$0.140
94.0
$2.000
$8.000
77.1
$0.400
$2.000
60.7
$0.320
$0.890
60.3
$0.150
$0.600
59.4
$0.040
$0.150
57.6
$0.060
$0.180
55.6
$0.100
$0.400
52.9
$0.020
$0.040
40.6
$2.500
$10.000
39.1
$0.060
$0.140
38.3
$4.000
$4.000
33.9
$2.500
$10.000
32.0
$2.000
$6.000
31.4
$0.080
$0.300
30.7
$0.800
$3.200
28.3
$0.100
$0.300
24.0
$0.035
$0.140
19.3
$0.800
$4.000
15.0
$2.500
$10.000
12.5
$0.051
$0.340
6.1

Pricing from OpenRouter. Benchmarks from Artificial Analysis.

OpenClaw

Deploy OpenClaw in Under 1 Minute We handle hosting, scaling, and maintenance

93 out of our 301 tracked models have had a price change in March.

Get our weekly newsletter on pricing changes, new releases, and tools.

About Knights and Knaves

Logic puzzle benchmark based on knights (truth-tellers) and knaves (liars) puzzles.

This leaderboard shows all models with Knights and Knaves benchmark scores, ranked from highest to lowest. Pricing data is included to help you compare performance against cost.

Advertise with us