BIRD-CRITIC — multi-turn benchmark testing SQL generation and database interaction.
Data from LayerLens
Models
27
Best Score
34.0
Average
27.9
Std Dev
3.6
Provider | Model | Input $/M | Output $/M | BIRD-CRITIC | Actions |
|---|---|---|---|---|---|
$5.000 | $25.000 | 34.0 | |||
$5.000 | $25.000 | 34.0 | |||
$0.300 | $1.400 | 33.0 | |||
$0.300 | $1.400 | 33.0 | |||
$0.450 | $2.200 | 31.3 | |||
$0.450 | $2.200 | 31.3 | |||
$0.220 | $1.000 | 31.0 | |||
$0.071 | $0.100 | 30.9 | |||
$2.500 | $10.000 | 30.6 | |||
$3.000 | $15.000 | 29.7 | |||
$3.000 | $15.000 | 29.7 | |||
$0.300 | $2.500 | 29.7 | |||
$0.300 | $2.500 | 29.7 | |||
$0.039 | $0.190 | 25.8 | |||
$0.040 | $0.150 | 25.8 | |||
$0.270 | $0.950 | 25.7 | |||
$0.100 | $0.400 | 25.3 | |||
$0.100 | $0.400 | 25.3 | |||
$0.300 | $0.900 | 25.3 | |||
$0.300 | $0.900 | 25.3 | |||
$0.500 | $1.500 | 25.0 | |||
$0.060 | $0.400 | 25.0 | |||
$0.060 | $0.400 | 25.0 | |||
$0.200 | $1.100 | 24.7 | |||
$0.050 | $0.200 | 22.7 | |||
$0.050 | $0.200 | 22.7 | |||
$0.280 | $1.100 | 21.7 |
Pricing from OpenRouter. Benchmarks from Artificial Analysis.

Deploy OpenClaw in Under 1 Minute— We handle hosting, scaling, and maintenance
93 out of our 301 tracked models have had a price change in March.
Get our weekly newsletter on pricing changes, new releases, and tools.
BIRD-CRITIC — multi-turn benchmark testing SQL generation and database interaction.
This leaderboard shows all models with BIRD-CRITIC benchmark scores, ranked from highest to lowest. Pricing data is included to help you compare performance against cost.
Built by @aellman
2026 68 Ventures, LLC. All rights reserved.