Track the evolution of AI models over time. Compare benchmark scores, pricing trends, and the open source vs closed source frontier.
How does model performance correlate with API pricing? Higher benchmark scores with lower prices indicate better value.
Track the frontier of AI capabilities as new models are released each month.
Provider frontier comparison - MMLU-Pro benchmark (includes reasoning variants)
Provider frontier comparison - Graduate-level science (includes reasoning variants)
Provider frontier comparison - Real-world coding (includes reasoning variants)
Provider frontier comparison - Competitive programming (includes reasoning variants)
Provider frontier comparison - Competition math (includes reasoning variants)
Provider frontier comparison - Maximum context window
Compare the frontier capabilities between open source and proprietary models over time.
Comparing the best open source and closed source models each month
Graduate-level science benchmark comparison
Real-world coding benchmark comparison
Competitive programming benchmark comparison
Competition math benchmark comparison
Maximum context window comparison
Data includes 296 models from 2023-05 to 2025-12
Benchmark data sourced from Artificial Analysis. Pricing data updated daily.