vs
DeepInfra vs Groq
Compare pricing across 8 shared models. DeepInfra offers 67 models, Groq offers 15.
8 Ways to Use Fewer Tokens
Get the free PDF guide — practical tips to cut your token usage and API costs. Subscribe to the Price Per Token newsletter and download instantly.
8
Shared Models
7
DeepInfra Cheaper
0
Groq Cheaper
1
Same Price
Price Comparison — Shared Models
| Model ↑ | DeepInfra Input | Groq Input | DeepInfra Output | Groq Output | Cheaper |
|---|---|---|---|---|---|
| GPT-OSS-120b | $0.150 | $0.150 | $0.600 | $0.600 | Same |
| GPT-OSS-20b | $0.030 | $0.075 | $0.140 | $0.300 | DeepInfra |
| Llama 3.1 8B Instruct | $0.020 | $0.050 | $0.030 | $0.080 | DeepInfra |
| Llama 3.3 70B Instruct | $0.100 | $0.590 | $0.320 | $0.790 | DeepInfra |
| Llama 4 Scout | $0.100 | $0.110 | $0.300 | $0.340 | DeepInfra |
| meta-llama-llama-guard-4-12b | $0.180 | $0.200 | $0.180 | $0.200 | DeepInfra |
| Qwen3 32B | $0.080 | $0.290 | $0.280 | $0.590 | DeepInfra |
| R1 Distill Llama 70B | $0.700 | $0.750 | $0.800 | $0.990 | DeepInfra |
Model Coverage
Only on DeepInfra(59)DeepSeek V3$0.320/M
DeepSeek V3 0324$0.200/M
DeepSeek V3.1$0.210/M
DeepSeek V3.1 Terminus$0.270/M
DeepSeek V3.2$0.260/M
DeepSeek V4 Pro$1.30/M
Gemma 3 12B$0.050/M
Gemma 3 27B$0.080/M
Gemma 3 4B$0.050/M
Gemma 4 26B A4B Instruct$0.070/M
Gemma 4 31B Instruct$0.130/M
GLM 4.6$0.430/M
GLM 4.7$0.400/M
GLM 5$0.600/M
GLM 5.1$1.05/M
GLM-4.7-Flash$0.060/M
Hermes 3 405B Instruct$1.00/M
Hermes 3 70B Instruct$0.700/M
Kimi K2.5$0.450/M
Kimi K2.6$0.750/M
Llama 3 8B Lunaris$0.040/M
Llama 3.1 70B Instruct$0.400/M
Llama 3.1 Euryale 70B v2.2$0.850/M
Llama 3.2 11B Vision Instruct$0.345/M
Llama 4 Maverick$0.150/M
MiMo v2.5$0.400/M
MiMo v2.5 Pro$1.00/M
MiniMax M2.5$0.150/M
MiniMax M2.7$0.300/M
Mistral Nemo$0.020/M
Mistral Small 24B Instruct 2501$0.050/M
Mistral Small 3.2 24B$0.075/M
MythoMax 13B$0.400/M
Nemotron 3 Nano 30B A3B$0.050/M
Nemotron 3 Super 120B A12B$0.100/M
Nemotron Nano 9B V2$0.040/M
Nemotron-3 Super 120B A12B$0.100/M
Phi 4$0.070/M
Qwen2.5 72B Instruct$0.360/M
Qwen3 14B$0.120/M
Qwen3 235B A22B Instruct 2507$0.090/M
Qwen3 235B A22B Thinking 2507$0.230/M
Qwen3 30B A3B$0.120/M
Qwen3 Coder 480B A35B (exacto)$0.300/M
Qwen3 Max$1.20/M
Qwen3 Max Thinking$1.20/M
Qwen3 Next 80B A3B Instruct$0.090/M
Qwen3 VL 235B A22B Instruct$0.200/M
Qwen3 VL 30B A3B Instruct$0.150/M
Qwen3.5 397B A17B$0.450/M
Qwen3.5 9B$0.100/M
Qwen3.5-122B-A10B$0.290/M
Qwen3.5-27B$0.260/M
Qwen3.5-35B-A3B$0.140/M
Qwen3.6 35B A3B$0.150/M
R1 0528$0.500/M
Step 3.5 Flash$0.090/M
Shared(8)
8
models available on both
DeepInfraGroq
67 total15 total
Only on Groq(7)Gemma 7B Instruct$0.070/M
Kimi K2 0711$1.00/M
Kimi K2 0905 (exacto)$1.00/M
Llama 3 8B Instruct$0.050/M
meta-llama-llama-guard-3-8b$0.200/M
Mixtral 8x7B$0.240/M
openai-gpt-oss-safeguard-20b$0.075/M
Full Provider Pricing
Frequently Asked Questions
Built by @aellman
Tools
Directories
Models & Pricing
Endpoints
Rankings
- All Rankings
- All Benchmarks
- Best LLM for Coding
- Best LLM for Math
- Best LLM for Writing
- Best LLM for RAG
- Best Local LLM
- Best LLM for OpenClaw
- Best LLM for Cursor
- Best LLM for Windsurf
- Best LLM for Cline
- Best LLM for Aider
- Best LLM for GitHub Copilot
- Best LLM for Bolt
- Best LLM for Continue.dev
- MMLU-Pro
- GPQA
- LiveCodeBench
- Aider
- AIME
- MATH (Hard)
- Big-Bench Hard
2026 68 Ventures, LLC. All rights reserved.