Price Per TokenPrice Per Token

Formal Logic Extended Leaderboard

Extended formal logic benchmark testing deductive and propositional reasoning.

Data from LayerLens

As of April 18, 2026, the top-scoring model on Formal Logic Extended is o3 Mini at 99.8%, followed by R1 0528 at 99.2% and R1 at 98.4%. 19 models have been evaluated on this benchmark.

Last updated: April 18, 2026

Models

19

Best Score

99.8

Average

64.4

Std Dev

34.4

Categories
Reasoning and Logic
Provider
Model
Input $/M
Output $/M
Formal Logic Extended
Actions
$0.550
$2.200
99.8
$0.500
$2.150
99.2
$0.550
$2.000
98.4
$3.000
$15.000
95.6
$2.000
$8.000
90.6
$0.150
$0.580
89.5
$0.150
$0.580
89.5
$3.000
$15.000
87.4
$0.014
$0.028
83.6
$2.500
$10.000
80.7
$3.000
$15.000
71.9
$0.800
$4.000
60.3
$0.900
$0.900
58.2
$0.030
$0.050
52.2
$0.070
$0.280
45.1
$0.150
$0.600
10.8
$0.065
$0.140
9.7
$0.080
$0.300
0.1
$2.500
$10.000
-

Pricing from OpenRouter. Benchmarks from Artificial Analysis.

Get our weekly newsletter on pricing changes, new releases, and tools.

Join the Price Per Token Community
8 Ways to Use Fewer Tokens

About Formal Logic Extended

Extended formal logic benchmark testing deductive and propositional reasoning.

This leaderboard shows all models with Formal Logic Extended benchmark scores, ranked from highest to lowest. Pricing data is included to help you compare performance against cost.

Frequently Asked Questions

Extended formal logic benchmark testing deductive and propositional reasoning.
As of April 18, 2026, o3 Mini leads the Formal Logic Extended leaderboard with a score of 99.8. Rankings change as new models are released and evaluated.
Currently 19 models have been evaluated on Formal Logic Extended, with an average score of 64.4 and standard deviation of 34.4.
Benchmark scores are updated when new evaluations are published by our data sources (Artificial Analysis and LayerLens). Pricing data is refreshed daily from OpenRouter.