Price Per Token

|Follow:

vs

DeepInfra

AWS Bedrock vs DeepInfra

Compare pricing across 19 shared models. AWS Bedrock offers 71 models, DeepInfra offers 66.

8 Ways to Use Fewer Tokens

Get the free PDF guide — practical tips to cut your token usage and API costs. Subscribe to the Price Per Token newsletter and download instantly.

19

Shared Models

0

AWS Bedrock Cheaper

17

DeepInfra Cheaper

2

Same Price

Price Comparison — Shared Models

Model ↑	AWS Bedrock Input	DeepInfra Input	AWS Bedrock Output	DeepInfra Output	Cheaper
DeepSeek V3.2	$0.620	$0.260	$1.85	$0.380	DeepInfra
Gemma 3 12B	$0.090	$0.040	$0.290	$0.130	DeepInfra
Gemma 3 27B	$0.230	$0.080	$0.380	$0.160	DeepInfra
Gemma 3 4B	$0.040	$0.040	$0.080	$0.080	Same
GLM 4.7	$0.600	$0.400	$2.20	$1.75	DeepInfra
GLM 5	$1.00	$0.600	$3.20	$2.08	DeepInfra
GLM-4.7-Flash	$0.070	$0.060	$0.400	$0.400	DeepInfra
GPT-OSS-120b	$0.150	$0.150	$0.600	$0.600	Same
GPT-OSS-20b	$0.070	$0.030	$0.150	$0.140	DeepInfra
Kimi K2.5	$0.600	$0.450	$3.00	$2.25	DeepInfra
Llama 3.1 70B Instruct	$0.720	$0.400	$0.720	$0.400	DeepInfra
MiniMax M2.5	$0.300	$0.150	$1.20	$1.15	DeepInfra
Mistral Small 3.2 24B	$0.500	$0.075	$1.50	$0.200	DeepInfra
Nemotron 3 Nano 30B A3B	$0.060	$0.050	$0.240	$0.200	DeepInfra
Nemotron 3 Super 120B A12B	$0.150	$0.100	$0.650	$0.500	DeepInfra
Nemotron Nano 9B V2	$0.060	$0.040	$0.230	$0.160	DeepInfra
Qwen3 32B	$0.200	$0.080	$0.780	$0.280	DeepInfra
Qwen3 Next 80B A3B Instruct	$0.150	$0.090	$1.20	$1.10	DeepInfra
Qwen3 VL 235B A22B Instruct	$0.530	$0.200	$2.66	$0.880	DeepInfra

Model Coverage

Only on AWS Bedrock(49)

Claude 2$8.00/M

Claude 3 Haiku$0.250/M

Claude 3 Opus$15.00/M

Claude 3 Sonnet$3.00/M

Claude 3.5 Haiku$0.800/M

Claude 3.5 Sonnet$3.00/M

Claude 3.7 Sonnet$3.00/M

Claude Haiku 4.5$1.00/M

Claude Instant$0.800/M

Claude Opus 4$15.00/M

Claude Opus 4.1$15.00/M

Claude Opus 4.5$5.00/M

Claude Opus 4.6$5.00/M

Claude Opus 4.7$5.00/M

Claude Sonnet 4$3.00/M

Claude Sonnet 4.5$3.00/M

Claude Sonnet 4.6$3.00/M

cohere-embed-3-english$0.100/M

cohere-embed-3-multilingual$0.100/M

cohere-embed-4$0.120/M

Command R$0.500/M

Devstral 2 123B$0.400/M

Jamba 1.5 Large$2.00/M

Jamba 1.5 Mini$0.200/M

Jamba Instruct$0.500/M

Jurassic-2 Mid$12.50/M

Jurassic-2 Ultra$18.80/M

Kimi K2 Thinking$0.600/M

Llama 2 13B Chat$0.750/M

Llama 2 70B Chat$1.95/M

MiniMax M2$0.300/M

MiniMax M2.1$0.300/M

Ministral 3 14B 2512$0.200/M

Ministral 3B$0.100/M

Ministral 8B$0.150/M

Mistral Large$0.500/M

mistral-ai-voxtral-mini-3b-2507$0.040/M

Nemotron Nano 12B 2 VL$0.200/M

Nova 2 Lite$0.300/M

Nova Lite 1.0$0.060/M

Nova Micro 1.0$0.035/M

Nova Premier 1.0$2.50/M

Nova Pro 1.0$0.800/M

openai-gpt-oss-safeguard-120b$0.150/M

openai-gpt-oss-safeguard-20b$0.070/M

Palmyra X5$0.600/M

Qwen3 Coder 30B A3B Instruct$0.150/M

Qwen3 Coder Next$0.500/M

Voxtral Small 24B 2507$0.100/M

Shared(19)

19

models available on both

AWS BedrockDeepInfra

68 total66 total

DeepInfra

Only on DeepInfra(47)

DeepSeek V3$0.320/M

DeepSeek V3 0324$0.200/M

DeepSeek V3.1$0.210/M

DeepSeek V3.1 Terminus$0.270/M

DeepSeek V4 Flash (Non-Reasoning)$0.100/M

DeepSeek V4 Pro$1.30/M

Gemma 4 26B A4B Instruct$0.070/M

Gemma 4 31B Instruct$0.130/M

GLM 4.6$0.430/M

GLM 5.1$1.05/M

Hermes 3 405B Instruct$1.00/M

Hermes 3 70B Instruct$0.300/M

Kimi K2.6$0.750/M

Llama 3 8B Lunaris$0.040/M

Llama 3.1 8B Instruct$0.020/M

Llama 3.1 Euryale 70B v2.2$0.850/M

Llama 3.2 11B Vision Instruct$0.245/M

Llama 3.3 70B Instruct$0.100/M

Llama 3.3 Nemotron Super 49B V1.5$0.100/M

Llama 4 Maverick$0.150/M

Llama 4 Scout$0.080/M

meta-llama-llama-guard-4-12b$0.180/M

MiMo v2.5$0.400/M

MiMo v2.5 Pro$1.00/M

Mistral Nemo$0.020/M

Mistral Small 24B Instruct 2501$0.050/M

MythoMax 13B$0.400/M

Nemotron-3 Super 120B A12B$0.100/M

Phi 4$0.070/M

Qwen2.5 72B Instruct$0.360/M

Qwen3 14B$0.120/M

Qwen3 235B A22B Instruct 2507$0.071/M

Qwen3 235B A22B Thinking 2507$0.230/M

Qwen3 30B A3B$0.090/M

Qwen3 Coder 480B A35B (exacto)$0.300/M

Qwen3 Max$1.20/M

Qwen3 Max Thinking$1.20/M

Qwen3 VL 30B A3B Instruct$0.150/M

Qwen3.5 397B A17B$0.490/M

Qwen3.5 9B$0.040/M

Qwen3.5-122B-A10B$0.290/M

Qwen3.5-27B$0.260/M

Qwen3.5-35B-A3B$0.140/M

Qwen3.6 35B A3B$0.150/M

R1 0528$0.500/M

R1 Distill Llama 70B$0.700/M

Step 3.5 Flash$0.090/M

Full Provider Pricing

AWS Bedrock Full Pricing

View all 71 models with detailed pricing

Frequently Asked Questions

Built by @aellman

Tools

Directories

Models & Pricing

Endpoints

Rankings

News

Follow us:

Advertise | Terms of Service | Privacy Policy

2026 68 Ventures, LLC. All rights reserved.