Price Per Token

|Follow:

DeepInfra

vs

Fireworks AI

DeepInfra vs Fireworks AI

Compare pricing across 38 shared models. DeepInfra offers 67 models, Fireworks AI offers 201.

8 Ways to Use Fewer Tokens

Get the free PDF guide — practical tips to cut your token usage and API costs. Subscribe to the Price Per Token newsletter and download instantly.

38

Shared Models

30

DeepInfra Cheaper

7

Fireworks AI Cheaper

1

Same Price

Price Comparison — Shared Models

Model ↑	DeepInfra Input	Fireworks AI Input	DeepInfra Output	Fireworks AI Output	Cheaper
DeepSeek V4 Pro	$1.30	$1.74	$2.60	$3.48	DeepInfra
Gemma 3 12B	$0.050	$0.200	$0.150	$0.200	DeepInfra
Gemma 3 27B	$0.080	$0.900	$0.160	$0.900	DeepInfra
Gemma 3 4B	$0.050	$0.200	$0.100	$0.200	DeepInfra
Gemma 4 26B A4B Instruct	$0.070	$0.900	$0.340	$0.900	DeepInfra
Gemma 4 31B Instruct	$0.130	$0.900	$0.380	$0.900	DeepInfra
GLM 5.1	$1.05	$1.40	$3.50	$4.40	DeepInfra
GPT-OSS-120b	$0.150	$0.100	$0.600	$0.100	Fireworks AI
GPT-OSS-20b	$0.030	$0.070	$0.140	$0.300	DeepInfra
Kimi K2.5	$0.450	$0.600	$2.25	$3.00	DeepInfra
Kimi K2.6	$0.750	$0.950	$3.50	$4.00	DeepInfra
Llama 3.1 70B Instruct	$0.400	$0.900	$0.400	$0.900	DeepInfra
Llama 3.1 8B Instruct	$0.020	$0.200	$0.030	$0.200	DeepInfra
Llama 3.2 11B Vision Instruct	$0.345	$0.200	$0.345	$0.200	Fireworks AI
Llama 3.3 70B Instruct	$0.100	$0.900	$0.320	$0.900	DeepInfra
MiniMax M2.7	$0.300	$0.300	$1.20	$1.20	Same
Mistral Small 24B Instruct 2501	$0.050	$0.900	$0.080	$0.900	DeepInfra
MythoMax 13B	$0.400	$0.200	$0.400	$0.200	Fireworks AI
Nemotron 3 Nano 30B A3B	$0.050	$0.900	$0.200	$0.900	DeepInfra
Nemotron 3 Super 120B A12B	$0.100	$0.900	$0.500	$0.900	DeepInfra
Nemotron Nano 9B V2	$0.040	$0.200	$0.160	$0.200	DeepInfra
Qwen2.5 72B Instruct	$0.360	$0.900	$0.400	$0.900	DeepInfra
Qwen3 14B	$0.120	$0.200	$0.240	$0.200	DeepInfra
Qwen3 235B A22B Instruct 2507	$0.090	$0.900	$0.100	$0.900	DeepInfra
Qwen3 235B A22B Thinking 2507	$0.230	$0.900	$2.30	$0.900	Fireworks AI
Qwen3 30B A3B	$0.120	$0.900	$0.500	$0.900	DeepInfra
Qwen3 32B	$0.080	$0.900	$0.280	$0.900	DeepInfra
Qwen3 Coder 480B A35B (exacto)	$0.300	$0.900	$1.00	$0.900	DeepInfra
Qwen3 Next 80B A3B Instruct	$0.090	$0.900	$1.10	$0.900	DeepInfra
Qwen3 VL 235B A22B Instruct	$0.200	$0.900	$0.880	$0.900	DeepInfra
Qwen3 VL 30B A3B Instruct	$0.150	$0.900	$0.600	$0.900	DeepInfra
Qwen3.5 397B A17B	$0.450	$0.900	$3.00	$0.900	Fireworks AI
Qwen3.5 9B	$0.100	$0.200	$0.150	$0.200	DeepInfra
Qwen3.5-122B-A10B	$0.290	$0.900	$2.40	$0.900	Fireworks AI
Qwen3.5-27B	$0.260	$0.900	$2.60	$0.900	Fireworks AI
Qwen3.5-35B-A3B	$0.140	$0.900	$1.00	$0.900	DeepInfra
Qwen3.6 35B A3B	$0.150	$0.900	$0.950	$0.900	DeepInfra
R1 Distill Llama 70B	$0.700	$0.900	$0.800	$0.900	DeepInfra

Model Coverage

DeepInfra

Only on DeepInfra(29)

DeepSeek V3$0.320/M

DeepSeek V3 0324$0.200/M

DeepSeek V3.1$0.210/M

DeepSeek V3.1 Terminus$0.270/M

DeepSeek V3.2$0.260/M

DeepSeek V4 Flash (Non-Reasoning)$0.100/M

GLM 4.6$0.430/M

GLM 4.7$0.400/M

GLM 5$0.600/M

GLM-4.7-Flash$0.060/M

Hermes 3 405B Instruct$1.00/M

Hermes 3 70B Instruct$0.700/M

Llama 3 8B Lunaris$0.040/M

Llama 3.1 Euryale 70B v2.2$0.850/M

Llama 3.3 Nemotron Super 49B V1.5$0.400/M

Llama 4 Maverick$0.150/M

Llama 4 Scout$0.100/M

meta-llama-llama-guard-4-12b$0.180/M

MiMo v2.5$0.400/M

MiMo v2.5 Pro$1.00/M

MiniMax M2.5$0.150/M

Mistral Nemo$0.020/M

Mistral Small 3.2 24B$0.075/M

Nemotron-3 Super 120B A12B$0.100/M

Phi 4$0.070/M

Qwen3 Max$1.20/M

Qwen3 Max Thinking$1.20/M

R1 0528$0.500/M

Step 3.5 Flash$0.090/M

Shared(38)

38

models available on both

DeepInfraFireworks AI

67 total201 total

Fireworks AI

Only on Fireworks AI(163)

Chronos Hermes 13B v2$0.200/M

Code Llama 13B$0.200/M

Code Llama 13B Instruct$0.200/M

Code Llama 13B Python$0.200/M

Code Llama 34B$0.900/M

Code Llama 34B Instruct$0.900/M

Code Llama 34B Python$0.900/M

Code Llama 70B$0.900/M

Code Llama 70B Instruct$0.900/M

Code Llama 70B Python$0.900/M

Code Llama 7B$0.200/M

Code Llama 7B Instruct$0.200/M

CodeGemma 2B$0.100/M

CodeGemma 7B$0.200/M

CodeQwen 1.5 7B$0.200/M

Cogito v1 Preview Llama 3B$0.100/M

Cogito v1 Preview Llama 70B$0.900/M

Cogito v1 Preview Llama 8B$0.200/M

Cogito v1 Preview Qwen 14B$0.200/M

Cogito v1 Preview Qwen 32B$0.900/M

Cogito v2.1 671B$0.900/M

DeepSeek Coder 1.3B Base$0.100/M

DeepSeek Coder 33B Instruct$0.900/M

DeepSeek Coder 7B Base$0.200/M

DeepSeek Coder 7B Base v1.5$0.200/M

DeepSeek Coder 7B Instruct v1.5$0.200/M

DeepSeek R1 0528 Qwen3 8B$0.200/M

Devstral 2 2512$0.900/M

Dolphin 2.6 Mixtral 8x7B$0.500/M

Dolphin 2.9.2 Qwen2 72B$0.900/M

ERNIE 4.5 21B A3B$0.900/M

ERNIE 4.5 300B A47B$0.900/M

FARE 20B$0.900/M

Gemma 2 9B$0.200/M

Gemma 2B$0.100/M

Gemma 3 1B$0.100/M

Gemma 4 E4B IT$0.200/M

Gemma 7B$0.200/M

Gemma 7B Instruct$0.200/M

Hermes 2 Pro Mistral 7B$0.200/M

InternVL3 38B$0.900/M

InternVL3 78B$0.900/M

InternVL3 8B$0.200/M

KAT Dev 32B$0.900/M

KAT Dev 72B Exp$0.900/M

Kimi K2 Thinking$0.600/M

Llama 2 13B$0.200/M

Llama 2 13B Chat$0.200/M

Llama 2 70B$0.900/M

Llama 2 7B$0.200/M

Llama 2 7B Chat$0.200/M

Llama 3 70B Instruct$0.900/M

Llama 3 70B Instruct (HF)$0.900/M

Llama 3 8B$0.200/M

Llama 3 8B Instruct$0.200/M

Llama 3 8B Instruct (HF)$0.200/M

Llama 3.1 405B Instruct$0.900/M

Llama 3.1 405B Instruct Long$0.900/M

Llama 3.1 70B Instruct 1B$0.900/M

Llama 3.1 Nemotron 70B Instruct$0.900/M

Llama 3.2 1B$0.100/M

Llama 3.2 1B Instruct$0.100/M

Llama 3.2 3B$0.100/M

Llama 3.2 3B Instruct$0.100/M

Llama 3.2 90B Vision Instruct$0.900/M

MedGemma 27B$0.900/M

meta-llama-llama-guard-2-8b$0.200/M

meta-llama-llama-guard-3-1b$0.100/M

meta-llama-llama-guard-3-8b$0.200/M

meta-llama-llamaguard-7b$0.200/M

Ministral 3 14B 2512$0.200/M

Ministral 3 3B 2512$0.100/M

Ministral 3 8B 2512$0.200/M

Mistral 7B$0.200/M

Mistral 7B Instruct v0.2$0.200/M

Mistral 7B Instruct v0.3$0.200/M

Mistral 7B OpenOrca$0.200/M

Mistral 7B v0.2$0.200/M

Mixtral 8x22B$1.20/M

Mixtral 8x22B Instruct$1.20/M

Mixtral 8x7B$0.500/M

Mixtral 8x7B Instruct$0.500/M

Mixtral 8x7B Instruct (HF)$0.500/M

Molmo 2 4B$0.200/M

Molmo 2 8B$0.200/M

Nemotron Nano 12B 2 VL$0.200/M

Nemotron Nano 12B V2$0.200/M

Nous Capybara 7B v1.9$0.200/M

Nous Hermes 2 Mixtral 8x7B DPO$0.500/M

Nous Hermes Llama 2 13B$0.200/M

Nous Hermes Llama 2 70B$0.900/M

Nous Hermes Llama 2 7B$0.200/M

openai-gpt-oss-safeguard-120b$0.900/M

openai-gpt-oss-safeguard-20b$0.900/M

OpenChat 3.5 0106$0.200/M

OpenHermes 2 Mistral 7B$0.200/M

OpenHermes 2.5 Mistral 7B$0.200/M

Phind CodeLlama 34B Python v1$0.900/M

Phind CodeLlama 34B v1$0.900/M

Phind CodeLlama 34B v2$0.900/M

Pythia 12B$0.200/M

Qwen 1.5 72B Chat$0.900/M

Qwen2 72B Instruct$0.900/M

Qwen2 7B Instruct$0.200/M

Qwen2 VL 2B Instruct$0.100/M

Qwen2 VL 72B Instruct$0.900/M

Qwen2 VL 7B Instruct$0.200/M

Qwen2.5 0.5B Instruct$0.200/M

Qwen2.5 1.5B Instruct$0.200/M

Qwen2.5 14B$0.200/M

Qwen2.5 14B Instruct$0.200/M

Qwen2.5 32B$0.900/M

Qwen2.5 32B Instruct$0.900/M

Qwen2.5 72B$0.900/M

Qwen2.5 7B$0.200/M

Qwen2.5 7B Instruct$0.200/M

Qwen2.5 Coder 0.5B$0.200/M

Qwen2.5 Coder 0.5B Instruct$0.200/M

Qwen2.5 Coder 1.5B$0.200/M

Qwen2.5 Coder 1.5B Instruct$0.200/M

Qwen2.5 Coder 14B$0.200/M

Qwen2.5 Coder 14B Instruct$0.200/M

Qwen2.5 Coder 32B$0.900/M

Qwen2.5 Coder 32B Instruct$0.900/M

Qwen2.5 Coder 32B Instruct 128K$0.900/M

Qwen2.5 Coder 32B Instruct 32K RoPE$0.900/M

Qwen2.5 Coder 32B Instruct 64K$0.900/M

Qwen2.5 Coder 3B$0.100/M

Qwen2.5 Coder 3B Instruct$0.100/M

Qwen2.5 Coder 7B$0.200/M

Qwen2.5 Coder 7B Instruct$0.200/M

Qwen2.5 Math 72B Instruct$0.900/M

Qwen2.5 VL 32B Instruct$0.900/M

Qwen2.5 VL 3B Instruct$0.100/M

Qwen2.5 VL 72B Instruct$0.900/M

Qwen2.5-VL 7B Instruct$0.200/M

Qwen3 0.6B$0.200/M

Qwen3 1.7B$0.200/M

Qwen3 235B A22B$0.900/M

Qwen3 30B A3B Instruct 2507$0.900/M

Qwen3 30B A3B Thinking 2507$0.900/M

Qwen3 4B$0.200/M

Qwen3 4B Instruct 2507$0.200/M

Qwen3 8B$0.200/M

Qwen3 Coder 30B A3B Instruct$0.900/M

Qwen3 Coder 480B Instruct BF16$0.900/M

Qwen3 Next 80B A3B Thinking$0.900/M

Qwen3 Omni 30B A3B Instruct$0.900/M

Qwen3 VL 235B A22B Thinking$0.900/M

Qwen3 VL 30B A3B Thinking$0.900/M

Qwen3 VL 32B Instruct$0.900/M

Qwen3 VL 8B Instruct$0.200/M

QwQ 32B$0.900/M

QwQ 32B Preview$0.900/M

R1 Distill Llama 8B$0.200/M

R1 Distill Qwen 1.5B$0.200/M

R1 Distill Qwen 14B$0.200/M

R1 Distill Qwen 32B$0.900/M

R1 Distill Qwen 7B$0.200/M

Seed OSS 36B Instruct$0.900/M

Snorkel Mistral PairRM DPO$0.200/M

Toppy M 7B$0.200/M

Zephyr 7B Beta$0.200/M

Full Provider Pricing

Fireworks AI

Fireworks AI Full Pricing

View all 201 models with detailed pricing

Frequently Asked Questions

Built by @aellman

Tools

Directories

Models & Pricing

Endpoints

Rankings

News

Follow us:

Advertise | Terms of Service | Privacy Policy

2026 68 Ventures, LLC. All rights reserved.