Price Per Token

|Follow:

DeepInfra

vs

Groq

DeepInfra vs Groq

Compare pricing across 8 shared models. DeepInfra offers 67 models, Groq offers 15.

8 Ways to Use Fewer Tokens

Get the free PDF guide — practical tips to cut your token usage and API costs. Subscribe to the Price Per Token newsletter and download instantly.

8

Shared Models

7

DeepInfra Cheaper

0

Groq Cheaper

1

Same Price

Price Comparison — Shared Models

Model ↑	DeepInfra Input	Groq Input	DeepInfra Output	Groq Output	Cheaper
GPT-OSS-120b	$0.150	$0.150	$0.600	$0.600	Same
GPT-OSS-20b	$0.030	$0.075	$0.140	$0.300	DeepInfra
Llama 3.1 8B Instruct	$0.020	$0.050	$0.030	$0.080	DeepInfra
Llama 3.3 70B Instruct	$0.100	$0.590	$0.320	$0.790	DeepInfra
Llama 4 Scout	$0.100	$0.110	$0.300	$0.340	DeepInfra
meta-llama-llama-guard-4-12b	$0.180	$0.200	$0.180	$0.200	DeepInfra
Qwen3 32B	$0.080	$0.290	$0.280	$0.590	DeepInfra
R1 Distill Llama 70B	$0.700	$0.750	$0.800	$0.990	DeepInfra

Model Coverage

DeepInfra

Only on DeepInfra(59)

DeepSeek V3$0.320/M

DeepSeek V3 0324$0.200/M

DeepSeek V3.1$0.210/M

DeepSeek V3.1 Terminus$0.270/M

DeepSeek V3.2$0.260/M

DeepSeek V4 Flash (Non-Reasoning)$0.100/M

DeepSeek V4 Pro$1.30/M

Gemma 3 12B$0.050/M

Gemma 3 27B$0.080/M

Gemma 3 4B$0.050/M

Gemma 4 26B A4B Instruct$0.070/M

Gemma 4 31B Instruct$0.130/M

GLM 4.6$0.430/M

GLM 4.7$0.400/M

GLM 5$0.600/M

GLM 5.1$1.05/M

GLM-4.7-Flash$0.060/M

Hermes 3 405B Instruct$1.00/M

Hermes 3 70B Instruct$0.700/M

Kimi K2.5$0.450/M

Kimi K2.6$0.750/M

Llama 3 8B Lunaris$0.040/M

Llama 3.1 70B Instruct$0.400/M

Llama 3.1 Euryale 70B v2.2$0.850/M

Llama 3.2 11B Vision Instruct$0.345/M

Llama 3.3 Nemotron Super 49B V1.5$0.400/M

Llama 4 Maverick$0.150/M

MiMo v2.5$0.400/M

MiMo v2.5 Pro$1.00/M

MiniMax M2.5$0.150/M

MiniMax M2.7$0.300/M

Mistral Nemo$0.020/M

Mistral Small 24B Instruct 2501$0.050/M

Mistral Small 3.2 24B$0.075/M

MythoMax 13B$0.400/M

Nemotron 3 Nano 30B A3B$0.050/M

Nemotron 3 Super 120B A12B$0.100/M

Nemotron Nano 9B V2$0.040/M

Nemotron-3 Super 120B A12B$0.100/M

Phi 4$0.070/M

Qwen2.5 72B Instruct$0.360/M

Qwen3 14B$0.120/M

Qwen3 235B A22B Instruct 2507$0.090/M

Qwen3 235B A22B Thinking 2507$0.230/M

Qwen3 30B A3B$0.120/M

Qwen3 Coder 480B A35B (exacto)$0.300/M

Qwen3 Max$1.20/M

Qwen3 Max Thinking$1.20/M

Qwen3 Next 80B A3B Instruct$0.090/M

Qwen3 VL 235B A22B Instruct$0.200/M

Qwen3 VL 30B A3B Instruct$0.150/M

Qwen3.5 397B A17B$0.450/M

Qwen3.5 9B$0.100/M

Qwen3.5-122B-A10B$0.290/M

Qwen3.5-27B$0.260/M

Qwen3.5-35B-A3B$0.140/M

Qwen3.6 35B A3B$0.150/M

R1 0528$0.500/M

Step 3.5 Flash$0.090/M

Shared(8)

8

models available on both

DeepInfraGroq

67 total15 total

Groq

Only on Groq(7)

Gemma 7B Instruct$0.070/M

Kimi K2 0711$1.00/M

Kimi K2 0905 (exacto)$1.00/M

Llama 3 8B Instruct$0.050/M

meta-llama-llama-guard-3-8b$0.200/M

Mixtral 8x7B$0.240/M

openai-gpt-oss-safeguard-20b$0.075/M

Full Provider Pricing

Groq

Groq Full Pricing

View all 15 models with detailed pricing

Frequently Asked Questions

Built by @aellman

Tools

Directories

Models & Pricing

Endpoints

Rankings

News

Follow us:

Advertise | Terms of Service | Privacy Policy

2026 68 Ventures, LLC. All rights reserved.