Price Per TokenPrice Per Token
DeepInfravsFireworks AI

DeepInfra vs Fireworks AI

Compare pricing across 44 shared models. DeepInfra offers 60 models, Fireworks AI offers 200.

8 Ways to Use Fewer Tokens

Get the free PDF guide — practical tips to cut your token usage and API costs. Subscribe to the Price Per Token newsletter and download instantly.

44
Shared Models
36
DeepInfra Cheaper
7
Fireworks AI Cheaper
1
Same Price

Price Comparison — Shared Models

Model ↑DeepInfra Input Fireworks AI Input DeepInfra Output Fireworks AI Output Cheaper
DeepSeek V3 $0.320 $0.560 $0.890 $1.68DeepInfra
DeepSeek V3 0324 $0.200 $0.560 $0.770 $1.68DeepInfra
DeepSeek V3.1 $0.210 $0.560 $0.790 $1.68DeepInfra
DeepSeek V3.1 Terminus $0.210 $0.560 $0.790 $1.68DeepInfra
DeepSeek V3.2 $0.260 $0.560 $0.380 $1.68DeepInfra
Gemma 3 12B $0.040 $0.200 $0.130 $0.200DeepInfra
Gemma 3 27B $0.080 $0.900 $0.160 $0.900DeepInfra
Gemma 3 4B $0.040 $0.200 $0.080 $0.200DeepInfra
GLM 4.7 $0.400 $0.600 $1.75 $2.20DeepInfra
GLM 5 $0.800 $1.00 $2.56 $3.20DeepInfra
GLM-4.7-Flash $0.060 $0.600 $0.400 $2.20DeepInfra
GPT-OSS-120b $0.150 $0.100 $0.600 $0.100Fireworks AI
GPT-OSS-20b $0.030 $0.100 $0.140 $0.100DeepInfra
Kimi K2 0711 $0.550 $0.600 $2.20 $2.50DeepInfra
Kimi K2 0905 (exacto) $0.400 $0.600 $2.00 $2.50DeepInfra
Kimi K2 Thinking $0.470 $0.600 $2.00 $2.50DeepInfra
Kimi K2.5 $0.450 $0.600 $2.25 $3.00DeepInfra
Llama 3 8B Instruct $0.030 $0.200 $0.040 $0.200DeepInfra
Llama 3.1 70B Instruct $0.400 $0.900 $0.400 $0.900DeepInfra
Llama 3.1 8B Instruct $0.020 $0.200 $0.050 $0.200DeepInfra
Llama 3.1 Nemotron 70B Instruct $1.20 $0.900 $1.20 $0.900Fireworks AI
Llama 3.2 11B Vision Instruct $0.049 $0.200 $0.049 $0.200DeepInfra
Llama 3.3 70B Instruct $0.100 $0.900 $0.320 $0.900DeepInfra
MiniMax M2 $0.254 $0.300 $1.02 $1.20DeepInfra
MiniMax M2.1 $0.270 $0.300 $0.950 $1.20DeepInfra
MiniMax M2.5 $0.270 $0.300 $0.950 $1.20DeepInfra
Mistral Small 24B Instruct 2501 $0.050 $0.900 $0.080 $0.900DeepInfra
Mixtral 8x7B Instruct $0.540 $0.500 $0.540 $0.500Fireworks AI
MythoMax 13B $0.400 $0.200 $0.400 $0.200Fireworks AI
Nemotron 3 Nano 30B A3B $0.050 $0.900 $0.200 $0.900DeepInfra
Nemotron Nano 12B 2 VL $0.200 $0.200 $0.600 $0.200Fireworks AI
Nemotron Nano 9B V2 $0.040 $0.200 $0.160 $0.200DeepInfra
Qwen2.5 72B Instruct $0.120 $0.900 $0.390 $0.900DeepInfra
Qwen2.5 VL 32B Instruct $0.200 $0.900 $0.600 $0.900DeepInfra
Qwen3 14B $0.120 $0.200 $0.240 $0.200DeepInfra
Qwen3 235B A22B Instruct 2507 $0.071 $0.900 $0.100 $0.900DeepInfra
Qwen3 235B A22B Thinking 2507 $0.230 $0.900 $2.30 $0.900Fireworks AI
Qwen3 30B A3B $0.080 $0.900 $0.280 $0.900DeepInfra
Qwen3 32B $0.080 $0.900 $0.280 $0.900DeepInfra
Qwen3 Coder 480B A35B (exacto) $0.400 $0.900 $1.60 $0.900Fireworks AI
Qwen3 Next 80B A3B Instruct $0.090 $0.900 $1.10 $0.900DeepInfra
Qwen3 VL 235B A22B Instruct $0.200 $0.900 $0.880 $0.900DeepInfra
Qwen3 VL 30B A3B Instruct $0.150 $0.150 $0.600 $0.600Same
R1 Distill Llama 70B $0.700 $0.900 $0.800 $0.900DeepInfra

Model Coverage

Shared(44)
44
models available on both
DeepInfraFireworks AI
60 total200 total
Fireworks AIOnly on Fireworks AI(156)
CodeGemma 2B$0.100/M
CodeGemma 7B$0.200/M
FARE 20B$0.900/M
Gemma 2 9B$0.200/M
Gemma 2B$0.100/M
Gemma 7B$0.200/M
InternVL3 8B$0.200/M
KAT Dev 32B$0.900/M
Llama 2 13B$0.200/M
Llama 2 70B$0.900/M
Llama 2 7B$0.200/M
Llama 3 8B$0.200/M
Llama 3.2 1B$0.100/M
Llama 3.2 3B$0.100/M
MedGemma 27B$0.900/M
Mistral 7B$0.200/M
Mixtral 8x7B$0.500/M
Molmo 2 4B$0.200/M
Molmo 2 8B$0.200/M
Pythia 12B$0.200/M
Qwen2.5 14B$0.200/M
Qwen2.5 32B$0.900/M
Qwen2.5 72B$0.900/M
Qwen2.5 7B$0.200/M
Qwen3 0.6B$0.200/M
Qwen3 1.7B$0.200/M
Qwen3 4B$0.200/M
Qwen3 8B$0.200/M
QwQ 32B$0.900/M
Toppy M 7B$0.200/M

Full Provider Pricing

Frequently Asked Questions

DeepInfra is cheaper on 36 out of 44 shared models. Fireworks AI is cheaper on 7 models. 1 models have the same price.
DeepInfra and Fireworks AI share 44 models. DeepInfra has 16 exclusive models, while Fireworks AI has 156 exclusive models.
Fireworks AI offers 200 models compared to DeepInfra's 60. However, model count alone doesn't determine the better provider — consider pricing, latency, and which specific models you need.