vs
DeepInfra vs Fireworks AI
Compare pricing across 38 shared models. DeepInfra offers 67 models, Fireworks AI offers 201.
8 Ways to Use Fewer Tokens
Get the free PDF guide — practical tips to cut your token usage and API costs. Subscribe to the Price Per Token newsletter and download instantly.
38
Shared Models
30
DeepInfra Cheaper
7
Fireworks AI Cheaper
1
Same Price
Price Comparison — Shared Models
| Model ↑ | DeepInfra Input | Fireworks AI Input | DeepInfra Output | Fireworks AI Output | Cheaper |
|---|---|---|---|---|---|
| DeepSeek V4 Pro | $1.30 | $1.74 | $2.60 | $3.48 | DeepInfra |
| Gemma 3 12B | $0.050 | $0.200 | $0.150 | $0.200 | DeepInfra |
| Gemma 3 27B | $0.080 | $0.900 | $0.160 | $0.900 | DeepInfra |
| Gemma 3 4B | $0.050 | $0.200 | $0.100 | $0.200 | DeepInfra |
| Gemma 4 26B A4B Instruct | $0.070 | $0.900 | $0.340 | $0.900 | DeepInfra |
| Gemma 4 31B Instruct | $0.130 | $0.900 | $0.380 | $0.900 | DeepInfra |
| GLM 5.1 | $1.05 | $1.40 | $3.50 | $4.40 | DeepInfra |
| GPT-OSS-120b | $0.150 | $0.100 | $0.600 | $0.100 | Fireworks AI |
| GPT-OSS-20b | $0.030 | $0.070 | $0.140 | $0.300 | DeepInfra |
| Kimi K2.5 | $0.450 | $0.600 | $2.25 | $3.00 | DeepInfra |
| Kimi K2.6 | $0.750 | $0.950 | $3.50 | $4.00 | DeepInfra |
| Llama 3.1 70B Instruct | $0.400 | $0.900 | $0.400 | $0.900 | DeepInfra |
| Llama 3.1 8B Instruct | $0.020 | $0.200 | $0.030 | $0.200 | DeepInfra |
| Llama 3.2 11B Vision Instruct | $0.345 | $0.200 | $0.345 | $0.200 | Fireworks AI |
| Llama 3.3 70B Instruct | $0.100 | $0.900 | $0.320 | $0.900 | DeepInfra |
| MiniMax M2.7 | $0.300 | $0.300 | $1.20 | $1.20 | Same |
| Mistral Small 24B Instruct 2501 | $0.050 | $0.900 | $0.080 | $0.900 | DeepInfra |
| MythoMax 13B | $0.400 | $0.200 | $0.400 | $0.200 | Fireworks AI |
| Nemotron 3 Nano 30B A3B | $0.050 | $0.900 | $0.200 | $0.900 | DeepInfra |
| Nemotron 3 Super 120B A12B | $0.100 | $0.900 | $0.500 | $0.900 | DeepInfra |
| Nemotron Nano 9B V2 | $0.040 | $0.200 | $0.160 | $0.200 | DeepInfra |
| Qwen2.5 72B Instruct | $0.360 | $0.900 | $0.400 | $0.900 | DeepInfra |
| Qwen3 14B | $0.120 | $0.200 | $0.240 | $0.200 | DeepInfra |
| Qwen3 235B A22B Instruct 2507 | $0.090 | $0.900 | $0.100 | $0.900 | DeepInfra |
| Qwen3 235B A22B Thinking 2507 | $0.230 | $0.900 | $2.30 | $0.900 | Fireworks AI |
| Qwen3 30B A3B | $0.120 | $0.900 | $0.500 | $0.900 | DeepInfra |
| Qwen3 32B | $0.080 | $0.900 | $0.280 | $0.900 | DeepInfra |
| Qwen3 Coder 480B A35B (exacto) | $0.300 | $0.900 | $1.00 | $0.900 | DeepInfra |
| Qwen3 Next 80B A3B Instruct | $0.090 | $0.900 | $1.10 | $0.900 | DeepInfra |
| Qwen3 VL 235B A22B Instruct | $0.200 | $0.900 | $0.880 | $0.900 | DeepInfra |
| Qwen3 VL 30B A3B Instruct | $0.150 | $0.900 | $0.600 | $0.900 | DeepInfra |
| Qwen3.5 397B A17B | $0.450 | $0.900 | $3.00 | $0.900 | Fireworks AI |
| Qwen3.5 9B | $0.100 | $0.200 | $0.150 | $0.200 | DeepInfra |
| Qwen3.5-122B-A10B | $0.290 | $0.900 | $2.40 | $0.900 | Fireworks AI |
| Qwen3.5-27B | $0.260 | $0.900 | $2.60 | $0.900 | Fireworks AI |
| Qwen3.5-35B-A3B | $0.140 | $0.900 | $1.00 | $0.900 | DeepInfra |
| Qwen3.6 35B A3B | $0.150 | $0.900 | $0.950 | $0.900 | DeepInfra |
| R1 Distill Llama 70B | $0.700 | $0.900 | $0.800 | $0.900 | DeepInfra |
Model Coverage
Only on DeepInfra(29)DeepSeek V3$0.320/M
DeepSeek V3 0324$0.200/M
DeepSeek V3.1$0.210/M
DeepSeek V3.1 Terminus$0.270/M
DeepSeek V3.2$0.260/M
GLM 4.6$0.430/M
GLM 4.7$0.400/M
GLM 5$0.600/M
GLM-4.7-Flash$0.060/M
Hermes 3 405B Instruct$1.00/M
Hermes 3 70B Instruct$0.700/M
Llama 3 8B Lunaris$0.040/M
Llama 3.1 Euryale 70B v2.2$0.850/M
Llama 4 Maverick$0.150/M
Llama 4 Scout$0.100/M
meta-llama-llama-guard-4-12b$0.180/M
MiMo v2.5$0.400/M
MiMo v2.5 Pro$1.00/M
MiniMax M2.5$0.150/M
Mistral Nemo$0.020/M
Mistral Small 3.2 24B$0.075/M
Nemotron-3 Super 120B A12B$0.100/M
Phi 4$0.070/M
Qwen3 Max$1.20/M
Qwen3 Max Thinking$1.20/M
R1 0528$0.500/M
Step 3.5 Flash$0.090/M
Shared(38)
38
models available on both
DeepInfraFireworks AI
67 total201 total
Only on Fireworks AI(163)Chronos Hermes 13B v2$0.200/M
Code Llama 13B$0.200/M
Code Llama 13B Instruct$0.200/M
Code Llama 13B Python$0.200/M
Code Llama 34B$0.900/M
Code Llama 34B Instruct$0.900/M
Code Llama 34B Python$0.900/M
Code Llama 70B$0.900/M
Code Llama 70B Instruct$0.900/M
Code Llama 70B Python$0.900/M
Code Llama 7B$0.200/M
Code Llama 7B Instruct$0.200/M
CodeGemma 2B$0.100/M
CodeGemma 7B$0.200/M
CodeQwen 1.5 7B$0.200/M
Cogito v1 Preview Llama 3B$0.100/M
Cogito v1 Preview Llama 70B$0.900/M
Cogito v1 Preview Llama 8B$0.200/M
Cogito v1 Preview Qwen 14B$0.200/M
Cogito v1 Preview Qwen 32B$0.900/M
Cogito v2.1 671B$0.900/M
DeepSeek Coder 1.3B Base$0.100/M
DeepSeek Coder 33B Instruct$0.900/M
DeepSeek Coder 7B Base$0.200/M
DeepSeek Coder 7B Base v1.5$0.200/M
DeepSeek Coder 7B Instruct v1.5$0.200/M
DeepSeek R1 0528 Qwen3 8B$0.200/M
Devstral 2 2512$0.900/M
Dolphin 2.6 Mixtral 8x7B$0.500/M
Dolphin 2.9.2 Qwen2 72B$0.900/M
ERNIE 4.5 21B A3B$0.900/M
ERNIE 4.5 300B A47B$0.900/M
FARE 20B$0.900/M
Gemma 2 9B$0.200/M
Gemma 2B$0.100/M
Gemma 3 1B$0.100/M
Gemma 4 E4B IT$0.200/M
Gemma 7B$0.200/M
Gemma 7B Instruct$0.200/M
Hermes 2 Pro Mistral 7B$0.200/M
InternVL3 38B$0.900/M
InternVL3 78B$0.900/M
InternVL3 8B$0.200/M
KAT Dev 32B$0.900/M
KAT Dev 72B Exp$0.900/M
Kimi K2 Thinking$0.600/M
Llama 2 13B$0.200/M
Llama 2 13B Chat$0.200/M
Llama 2 70B$0.900/M
Llama 2 7B$0.200/M
Llama 2 7B Chat$0.200/M
Llama 3 70B Instruct$0.900/M
Llama 3 70B Instruct (HF)$0.900/M
Llama 3 8B$0.200/M
Llama 3 8B Instruct$0.200/M
Llama 3 8B Instruct (HF)$0.200/M
Llama 3.1 405B Instruct$0.900/M
Llama 3.1 405B Instruct Long$0.900/M
Llama 3.1 70B Instruct 1B$0.900/M
Llama 3.1 Nemotron 70B Instruct$0.900/M
Llama 3.2 1B$0.100/M
Llama 3.2 1B Instruct$0.100/M
Llama 3.2 3B$0.100/M
Llama 3.2 3B Instruct$0.100/M
Llama 3.2 90B Vision Instruct$0.900/M
MedGemma 27B$0.900/M
meta-llama-llama-guard-2-8b$0.200/M
meta-llama-llama-guard-3-1b$0.100/M
meta-llama-llama-guard-3-8b$0.200/M
meta-llama-llamaguard-7b$0.200/M
Ministral 3 14B 2512$0.200/M
Ministral 3 3B 2512$0.100/M
Ministral 3 8B 2512$0.200/M
Mistral 7B$0.200/M
Mistral 7B Instruct v0.2$0.200/M
Mistral 7B Instruct v0.3$0.200/M
Mistral 7B OpenOrca$0.200/M
Mistral 7B v0.2$0.200/M
Mixtral 8x22B$1.20/M
Mixtral 8x22B Instruct$1.20/M
Mixtral 8x7B$0.500/M
Mixtral 8x7B Instruct$0.500/M
Mixtral 8x7B Instruct (HF)$0.500/M
Molmo 2 4B$0.200/M
Molmo 2 8B$0.200/M
Nemotron Nano 12B 2 VL$0.200/M
Nemotron Nano 12B V2$0.200/M
Nous Capybara 7B v1.9$0.200/M
Nous Hermes 2 Mixtral 8x7B DPO$0.500/M
Nous Hermes Llama 2 13B$0.200/M
Nous Hermes Llama 2 70B$0.900/M
Nous Hermes Llama 2 7B$0.200/M
openai-gpt-oss-safeguard-120b$0.900/M
openai-gpt-oss-safeguard-20b$0.900/M
OpenChat 3.5 0106$0.200/M
OpenHermes 2 Mistral 7B$0.200/M
OpenHermes 2.5 Mistral 7B$0.200/M
Phind CodeLlama 34B Python v1$0.900/M
Phind CodeLlama 34B v1$0.900/M
Phind CodeLlama 34B v2$0.900/M
Pythia 12B$0.200/M
Qwen 1.5 72B Chat$0.900/M
Qwen2 72B Instruct$0.900/M
Qwen2 7B Instruct$0.200/M
Qwen2 VL 2B Instruct$0.100/M
Qwen2 VL 72B Instruct$0.900/M
Qwen2 VL 7B Instruct$0.200/M
Qwen2.5 0.5B Instruct$0.200/M
Qwen2.5 1.5B Instruct$0.200/M
Qwen2.5 14B$0.200/M
Qwen2.5 14B Instruct$0.200/M
Qwen2.5 32B$0.900/M
Qwen2.5 32B Instruct$0.900/M
Qwen2.5 72B$0.900/M
Qwen2.5 7B$0.200/M
Qwen2.5 7B Instruct$0.200/M
Qwen2.5 Coder 0.5B$0.200/M
Qwen2.5 Coder 0.5B Instruct$0.200/M
Qwen2.5 Coder 1.5B$0.200/M
Qwen2.5 Coder 1.5B Instruct$0.200/M
Qwen2.5 Coder 14B$0.200/M
Qwen2.5 Coder 14B Instruct$0.200/M
Qwen2.5 Coder 32B$0.900/M
Qwen2.5 Coder 32B Instruct$0.900/M
Qwen2.5 Coder 32B Instruct 128K$0.900/M
Qwen2.5 Coder 32B Instruct 64K$0.900/M
Qwen2.5 Coder 3B$0.100/M
Qwen2.5 Coder 3B Instruct$0.100/M
Qwen2.5 Coder 7B$0.200/M
Qwen2.5 Coder 7B Instruct$0.200/M
Qwen2.5 Math 72B Instruct$0.900/M
Qwen2.5 VL 32B Instruct$0.900/M
Qwen2.5 VL 3B Instruct$0.100/M
Qwen2.5 VL 72B Instruct$0.900/M
Qwen2.5-VL 7B Instruct$0.200/M
Qwen3 0.6B$0.200/M
Qwen3 1.7B$0.200/M
Qwen3 235B A22B$0.900/M
Qwen3 30B A3B Instruct 2507$0.900/M
Qwen3 30B A3B Thinking 2507$0.900/M
Qwen3 4B$0.200/M
Qwen3 4B Instruct 2507$0.200/M
Qwen3 8B$0.200/M
Qwen3 Coder 30B A3B Instruct$0.900/M
Qwen3 Coder 480B Instruct BF16$0.900/M
Qwen3 Next 80B A3B Thinking$0.900/M
Qwen3 Omni 30B A3B Instruct$0.900/M
Qwen3 VL 235B A22B Thinking$0.900/M
Qwen3 VL 30B A3B Thinking$0.900/M
Qwen3 VL 32B Instruct$0.900/M
Qwen3 VL 8B Instruct$0.200/M
QwQ 32B$0.900/M
QwQ 32B Preview$0.900/M
R1 Distill Llama 8B$0.200/M
R1 Distill Qwen 1.5B$0.200/M
R1 Distill Qwen 14B$0.200/M
R1 Distill Qwen 32B$0.900/M
R1 Distill Qwen 7B$0.200/M
Seed OSS 36B Instruct$0.900/M
Snorkel Mistral PairRM DPO$0.200/M
Toppy M 7B$0.200/M
Zephyr 7B Beta$0.200/M
Full Provider Pricing
Frequently Asked Questions
Built by @aellman
Tools
Directories
Models & Pricing
Endpoints
Rankings
- All Rankings
- All Benchmarks
- Best LLM for Coding
- Best LLM for Math
- Best LLM for Writing
- Best LLM for RAG
- Best Local LLM
- Best LLM for OpenClaw
- Best LLM for Cursor
- Best LLM for Windsurf
- Best LLM for Cline
- Best LLM for Aider
- Best LLM for GitHub Copilot
- Best LLM for Bolt
- Best LLM for Continue.dev
- MMLU-Pro
- GPQA
- LiveCodeBench
- Aider
- AIME
- MATH (Hard)
- Big-Bench Hard
2026 68 Ventures, LLC. All rights reserved.