Compare LLM performance for retrieval-augmented generation. Models are ranked by MMLU-Pro (knowledge breadth) with reasoning and instruction-following scores.

Best LLMs for OpenClaw— Vote for which model works best with OpenClaw
102 out of our 300 tracked models have had a price change in February.
Get our weekly newsletter on pricing changes, new releases, and tools.
This leaderboard ranks AI models by their MMLU-Pro benchmark score, helping you find the best llm for rag.
All models shown have benchmark data available. Pricing is shown per million tokens from OpenRouter. Compare LLM performance for retrieval-augmented generation. Models are ranked by MMLU-Pro (knowledge breadth) with reasoning and instruction-following scores.
Built by @aellman
2026 68 Ventures, LLC. All rights reserved.