Cerebras vs DeepInfra

Compare pricing across 2 shared models. Cerebras offers 2 models, DeepInfra offers 67.

8 Ways to Use Fewer Tokens

Get the free PDF guide — practical tips to cut your token usage and API costs. Subscribe to the Price Per Token newsletter and download instantly.

Shared Models

Cerebras Cheaper

DeepInfra Cheaper

Same Price

Price Comparison — Shared Models

Model ↑	Cerebras Input	DeepInfra Input	Cerebras Output	DeepInfra Output	Cheaper
GLM 4.7	$2.25	$0.400	$2.75	$1.75	DeepInfra
GPT-OSS-120b	$0.350	$0.150	$0.750	$0.600	DeepInfra

Model Coverage

Only on Cerebras(0)

No exclusive models

Shared(2)

models available on both

CerebrasDeepInfra

2 total67 total

Only on DeepInfra(65)

DeepSeek V3$0.320/M

DeepSeek V3 0324$0.200/M

DeepSeek V3.1$0.210/M

DeepSeek V3.1 Terminus$0.270/M

DeepSeek V3.2$0.260/M

DeepSeek V4 Flash (Non-Reasoning)$0.100/M

DeepSeek V4 Pro$1.30/M

Gemma 3 12B$0.050/M

Gemma 3 27B$0.080/M

Gemma 3 4B$0.050/M

Gemma 4 26B A4B Instruct$0.070/M

Gemma 4 31B Instruct$0.130/M

GLM 4.6$0.430/M

GLM 5$0.600/M

GLM 5.1$1.05/M

GLM-4.7-Flash$0.060/M

GPT-OSS-20b$0.030/M

Hermes 3 405B Instruct$1.00/M

Hermes 3 70B Instruct$0.700/M

Kimi K2.5$0.450/M

Kimi K2.6$0.750/M

Llama 3 8B Lunaris$0.040/M

Llama 3.1 70B Instruct$0.400/M

Llama 3.1 8B Instruct$0.020/M

Llama 3.1 Euryale 70B v2.2$0.850/M

Llama 3.2 11B Vision Instruct$0.345/M

Llama 3.3 70B Instruct$0.100/M

Llama 3.3 Nemotron Super 49B V1.5$0.400/M

Llama 4 Maverick$0.150/M

Llama 4 Scout$0.100/M

meta-llama-llama-guard-4-12b$0.180/M

MiMo v2.5$0.400/M

MiMo v2.5 Pro$1.00/M

MiniMax M2.5$0.150/M

MiniMax M2.7$0.300/M

Mistral Nemo$0.020/M

Mistral Small 24B Instruct 2501$0.050/M

Mistral Small 3.2 24B$0.075/M

MythoMax 13B$0.400/M

Nemotron 3 Nano 30B A3B$0.050/M

Nemotron 3 Super 120B A12B$0.100/M

Nemotron Nano 9B V2$0.040/M

Nemotron-3 Super 120B A12B$0.100/M

Phi 4$0.070/M

Qwen2.5 72B Instruct$0.360/M

Qwen3 14B$0.120/M

Qwen3 235B A22B Instruct 2507$0.090/M

Qwen3 235B A22B Thinking 2507$0.230/M

Qwen3 30B A3B$0.120/M

Qwen3 32B$0.080/M

Qwen3 Coder 480B A35B (exacto)$0.300/M

Qwen3 Max$1.20/M

Qwen3 Max Thinking$1.20/M

Qwen3 Next 80B A3B Instruct$0.090/M

Qwen3 VL 235B A22B Instruct$0.200/M

Qwen3 VL 30B A3B Instruct$0.150/M

Qwen3.5 397B A17B$0.450/M

Qwen3.5 9B$0.100/M

Qwen3.5-122B-A10B$0.290/M

Qwen3.5-27B$0.260/M

Qwen3.5-35B-A3B$0.140/M

Qwen3.6 35B A3B$0.150/M

R1 0528$0.500/M

R1 Distill Llama 70B$0.700/M

Step 3.5 Flash$0.090/M

Cerebras vs DeepInfra

8 Ways to Use Fewer Tokens

Price Comparison — Shared Models

Model Coverage

Full Provider Pricing

Frequently Asked Questions

Tools

Directories

Models & Pricing

Endpoints

Rankings

News