Price Per TokenPrice Per Token

SWE-bench Lite Leaderboard

Software Engineering benchmark testing ability to resolve real GitHub issues.

Data from LayerLens

Models

48

Best Score

62.7

Average

23.7

Std Dev

19.3

Categories
Multi-turn
Provider
Model
Input $/M
Output $/M
SWE-bench Lite
Actions
$5.000
$25.000
62.7
$5.000
$25.000
62.7
$1.250
$10.000
54.3
$1.000
$5.000
54.3
$1.000
$5.000
54.3
$5.000
$25.000
49.3
$5.000
$25.000
49.3
$0.220
$1.000
44.7
$0.550
$2.200
42.0
$0.350
$1.710
42.0
$0.350
$1.710
42.0
$1.250
$10.000
40.0
$0.255
$1.000
39.0
$0.250
$2.000
38.3
$0.400
$2.000
36.5
$0.071
$0.100
36.3
$1.250
$10.000
36.3
$0.500
$1.500
33.3
$0.320
$0.890
29.1
$0.800
$4.000
27.7
$0.300
$2.500
26.1
$0.300
$2.500
26.1
$0.300
$0.900
26.0
$0.300
$0.900
26.0
$0.550
$3.500
20.0
$0.080
$0.240
16.3
$0.080
$0.240
16.3
$0.150
$0.750
14.3
$0.150
$0.750
14.3
$0.500
$3.000
12.7
$0.500
$3.000
12.7
$0.039
$0.190
9.0
$0.150
$0.600
8.0
$3.000
$15.000
7.7
$0.550
$2.000
7.7
$0.200
$0.500
7.0
$0.060
$0.180
5.7
$0.080
$0.300
4.0
$0.800
$3.200
2.7
$0.200
$0.500
0.7
$0.200
$0.500
0.7
$0.060
$0.140
-
$0.400
$2.200
-
$0.270
$0.410
-
$0.270
$0.410
-
$0.450
$2.200
-
$0.450
$2.200
-
$0.400
$2.400
-

Pricing from OpenRouter. Benchmarks from Artificial Analysis.

OpenClaw

Deploy OpenClaw in Under 1 Minute We handle hosting, scaling, and maintenance

93 out of our 301 tracked models have had a price change in March.

Get our weekly newsletter on pricing changes, new releases, and tools.

About SWE-bench Lite

Software Engineering benchmark testing ability to resolve real GitHub issues.

This leaderboard shows all models with SWE-bench Lite benchmark scores, ranked from highest to lowest. Pricing data is included to help you compare performance against cost.

Advertise with us