Price Per TokenPrice Per Token
CerebrasvsFireworks AI

Cerebras vs Fireworks AI

Compare pricing across 3 shared models. Cerebras offers 4 models, Fireworks AI offers 201.

8 Ways to Use Fewer Tokens

Get the free PDF guide — practical tips to cut your token usage and API costs. Subscribe to the Price Per Token newsletter and download instantly.

3
Shared Models
2
Cerebras Cheaper
1
Fireworks AI Cheaper
0
Same Price

Price Comparison — Shared Models

Model ↑Cerebras Input Fireworks AI Input Cerebras Output Fireworks AI Output Cheaper
GPT-OSS-120b $0.350 $0.150 $0.750 $0.600Fireworks AI
Llama 3.1 8B Instruct $0.100 $0.200 $0.100 $0.200Cerebras
Qwen3 235B A22B Instruct 2507 $0.600 $0.900 $1.20 $0.900Cerebras

Model Coverage

CerebrasOnly on Cerebras(1)
GLM 4.7$2.25/M
Shared(3)
3
models available on both
CerebrasFireworks AI
4 total201 total
Fireworks AIOnly on Fireworks AI(198)
CodeGemma 2B$0.100/M
CodeGemma 7B$0.200/M
FARE 20B$0.900/M
Gemma 2 9B$0.200/M
Gemma 2B$0.100/M
Gemma 3 12B$0.200/M
Gemma 3 1B$0.100/M
Gemma 3 27B$0.900/M
Gemma 3 4B$0.200/M
Gemma 7B$0.200/M
GLM 5$1.00/M
GLM 5.1$1.40/M
GPT-OSS-20b$0.070/M
InternVL3 8B$0.200/M
KAT Dev 32B$0.900/M
Kimi K2.5$0.600/M
Kimi K2.6$0.950/M
Llama 2 13B$0.200/M
Llama 2 70B$0.900/M
Llama 2 7B$0.200/M
Llama 3 8B$0.200/M
Llama 3.2 1B$0.100/M
Llama 3.2 3B$0.100/M
MedGemma 27B$0.900/M
MiniMax M2.7$0.300/M
Mistral 7B$0.200/M
Mixtral 8x7B$0.500/M
Molmo 2 4B$0.200/M
Molmo 2 8B$0.200/M
MythoMax 13B$0.200/M
Pythia 12B$0.200/M
Qwen2.5 14B$0.200/M
Qwen2.5 32B$0.900/M
Qwen2.5 72B$0.900/M
Qwen2.5 7B$0.200/M
Qwen3 0.6B$0.200/M
Qwen3 1.7B$0.200/M
Qwen3 14B$0.200/M
Qwen3 32B$0.900/M
Qwen3 4B$0.200/M
Qwen3 8B$0.200/M
Qwen3.5 9B$0.200/M
Qwen3.5-27B$0.900/M
QwQ 32B$0.900/M
Toppy M 7B$0.200/M

Full Provider Pricing

Frequently Asked Questions

Cerebras is cheaper on 2 out of 3 shared models. Fireworks AI is cheaper on 1 models. 0 models have the same price.
Cerebras and Fireworks AI share 3 models. Cerebras has 1 exclusive model, while Fireworks AI has 198 exclusive models.
Fireworks AI offers 201 models compared to Cerebras's 4. However, model count alone doesn't determine the better provider — consider pricing, latency, and which specific models you need.