
AWS Bedrock vs DeepInfra
Compare pricing across 21 shared models. AWS Bedrock offers 61 models, DeepInfra offers 71.
8 Ways to Use Fewer Tokens
Get the free PDF guide — practical tips to cut your token usage and API costs. Subscribe to the Price Per Token newsletter and download instantly.
21
Shared Models
0
AWS Bedrock Cheaper
18
DeepInfra Cheaper
3
Same Price
Price Comparison — Shared Models
| Model ↑ | AWS Bedrock Input | DeepInfra Input | AWS Bedrock Output | DeepInfra Output | Cheaper |
|---|---|---|---|---|---|
| DeepSeek V3.2 | $0.620 | $0.260 | $1.85 | $0.380 | DeepInfra |
| Gemma 3 12B | $0.090 | $0.040 | $0.290 | $0.130 | DeepInfra |
| Gemma 3 27B | $0.230 | $0.080 | $0.380 | $0.160 | DeepInfra |
| Gemma 3 4B | $0.040 | $0.040 | $0.080 | $0.080 | Same |
| GLM 4.7 | $0.600 | $0.400 | $2.20 | $1.75 | DeepInfra |
| GLM 5 | $1.00 | $0.800 | $3.20 | $2.56 | DeepInfra |
| GLM-4.7-Flash | $0.070 | $0.060 | $0.400 | $0.400 | DeepInfra |
| GPT-OSS-120b | $0.150 | $0.150 | $0.600 | $0.600 | Same |
| GPT-OSS-20b | $0.070 | $0.030 | $0.150 | $0.140 | DeepInfra |
| Kimi K2 Thinking | $0.600 | $0.470 | $2.50 | $2.00 | DeepInfra |
| Kimi K2.5 | $0.600 | $0.450 | $3.00 | $2.25 | DeepInfra |
| MiniMax M2.1 | $0.300 | $0.270 | $1.20 | $0.950 | DeepInfra |
| MiniMax M2.5 | $0.300 | $0.270 | $1.20 | $0.950 | DeepInfra |
| Mistral Small 3.2 24B | $0.500 | $0.075 | $1.50 | $0.200 | DeepInfra |
| Nemotron 3 Nano 30B A3B | $0.060 | $0.050 | $0.240 | $0.200 | DeepInfra |
| Nemotron 3 Super 120B A12B | $0.150 | $0.100 | $0.650 | $0.500 | DeepInfra |
| Nemotron Nano 12B 2 VL | $0.200 | $0.200 | $0.600 | $0.600 | Same |
| Nemotron Nano 9B V2 | $0.060 | $0.040 | $0.230 | $0.160 | DeepInfra |
| Qwen3 32B | $0.200 | $0.080 | $0.780 | $0.280 | DeepInfra |
| Qwen3 Next 80B A3B Instruct | $0.150 | $0.090 | $1.20 | $1.10 | DeepInfra |
| Qwen3 VL 235B A22B Instruct | $0.530 | $0.200 | $2.66 | $0.880 | DeepInfra |
Model Coverage
Claude 3 Haiku$0.250/M
Claude 3 Opus$15.00/M
Claude 3.5 Haiku$0.800/M
Claude 3.5 Sonnet$6.00/M
Claude 3.7 Sonnet$3.00/M
Claude Haiku 4.5$1.00/M
Claude Opus 4$15.00/M
Claude Opus 4.1$15.00/M
Claude Opus 4.5$5.00/M
Claude Opus 4.6$5.00/M
Claude Sonnet 4$3.00/M
Claude Sonnet 4.5$3.00/M
Claude Sonnet 4.6$3.00/M
cohere-embed-3-english$0.100/M
cohere-embed-3-multilingual$0.100/M
cohere-embed-4$0.120/M
Command R$0.500/M
Devstral Medium$0.400/M
Jamba Instruct$0.500/M
Llama 2 70B Chat$1.95/M
MiniMax M2$0.300/M
Ministral 3 14B 2512$0.200/M
Ministral 3B$0.100/M
Ministral 8B$0.150/M
Mistral Large$0.500/M
mistral-ai-voxtral-mini-3b-2507$0.040/M
Nova 2 Lite$0.300/M
Nova Lite 1.0$0.060/M
Nova Micro 1.0$0.035/M
Nova Premier 1.0$2.50/M
Nova Pro 1.0$0.800/M
openai-gpt-oss-safeguard-120b$0.150/M
openai-gpt-oss-safeguard-20b$0.070/M
Palmyra X5$0.600/M
Qwen3 Coder 30B A3B Instruct$0.150/M
Qwen3 Coder Next$0.500/M
Voxtral Small 24B 2507$0.100/M
Shared(21)
21
models available on both
AWS BedrockDeepInfra
58 total71 total
Only on DeepInfra(50)DeepSeek V3$0.320/M
DeepSeek V3 0324$0.200/M
DeepSeek V3.1$0.210/M
DeepSeek V3.1 Terminus$0.210/M
GLM 4.6$0.430/M
GLM 4.6V$0.300/M
Hermes 3 405B Instruct$1.00/M
Hermes 3 70B Instruct$0.300/M
Kimi K2 0905 (exacto)$0.400/M
Llama 3 8B Instruct$0.030/M
Llama 3 8B Lunaris$0.040/M
Llama 3.1 70B Instruct$0.400/M
Llama 3.1 8B Instruct$0.020/M
Llama 3.1 Euryale 70B v2.2$0.850/M
Llama 3.2 11B Vision Instruct$0.049/M
Llama 3.3 70B Instruct$0.100/M
Llama 3.3 Euryale 70B$0.850/M
Llama 4 Maverick$0.150/M
Llama 4 Scout$0.080/M
meta-llama-llama-guard-4-12b$0.180/M
Mistral Nemo$0.020/M
Mistral Small 24B Instruct 2501$0.050/M
Mixtral 8x7B Instruct$0.540/M
MythoMax 13B$0.400/M
Nemotron-3 Super 120B A12B$0.100/M
Olmo 3.1 32B Instruct$0.200/M
Phi 4$0.070/M
Qwen2.5 72B Instruct$0.120/M
Qwen2.5 VL 32B Instruct$0.200/M
Qwen3 14B$0.120/M
Qwen3 235B A22B Instruct 2507$0.071/M
Qwen3 235B A22B Thinking 2507$0.230/M
Qwen3 30B A3B$0.080/M
Qwen3 Coder 480B A35B (exacto)$0.220/M
Qwen3 Max$1.20/M
Qwen3 Max Thinking$1.20/M
Qwen3 VL 30B A3B Instruct$0.150/M
Qwen3.5 0.8B$0.010/M
Qwen3.5 2B (Non-reasoning)$0.020/M
Qwen3.5 397B A17B$0.540/M
Qwen3.5 4B (Non-reasoning)$0.030/M
Qwen3.5 9B$0.040/M
Qwen3.5-122B-A10B$0.290/M
Qwen3.5-27B$0.260/M
Qwen3.5-35B-A3B$0.220/M
R1 0528$0.500/M
R1 Distill Llama 70B$0.700/M
Step 3.5 Flash$0.100/M
Full Provider Pricing
Frequently Asked Questions
Built by @aellman
Tools
Directories
Models & Pricing
Endpoints
Rankings
- All Rankings
- All Benchmarks
- Best LLM for Coding
- Best LLM for Math
- Best LLM for Writing
- Best LLM for RAG
- Best Local LLM
- Best LLM for OpenClaw
- Best LLM for Cursor
- Best LLM for Windsurf
- Best LLM for Cline
- Best LLM for Aider
- Best LLM for GitHub Copilot
- Best LLM for Bolt
- Best LLM for Continue.dev
- MMLU-Pro
- GPQA
- LiveCodeBench
- Aider
- AIME
- MATH (Hard)
- Big-Bench Hard
2026 68 Ventures, LLC. All rights reserved.