
Azure OpenAI vs DeepInfra
Compare pricing across 1 shared models. Azure OpenAI offers 87 models, DeepInfra offers 71.
8 Ways to Use Fewer Tokens
Get the free PDF guide — practical tips to cut your token usage and API costs. Subscribe to the Price Per Token newsletter and download instantly.
1
Shared Models
0
Azure OpenAI Cheaper
0
DeepInfra Cheaper
1
Same Price
Price Comparison — Shared Models
| Model ↑ | Azure OpenAI Input | DeepInfra Input | Azure OpenAI Output | DeepInfra Output | Cheaper |
|---|---|---|---|---|---|
| GPT-OSS-120b | $0.150 | $0.150 | $0.600 | $0.600 | Same |
Model Coverage
Azure OpenAI$75.00/M
Babbage$0.500/M
ChatGPT-4o$5.00/M
Claude Opus 4.6$5.00/M
Claude Sonnet 4.6$3.00/M
Codex Mini$0.750/M
curie$2.00/M
GPT-3.5 Turbo$0.750/M
GPT-3.5 Turbo 16k$3.00/M
GPT-3.5 Turbo Instruct$1.50/M
GPT-4$30.00/M
GPT-4 Turbo$10.00/M
GPT-4.1$2.00/M
GPT-4.1 Mini$0.200/M
GPT-4.1 Nano$0.100/M
GPT-4o$2.50/M
GPT-4o-mini$0.150/M
GPT-5$0.625/M
GPT-5 Chat$1.25/M
GPT-5 Codex$1.25/M
GPT-5 Mini$0.250/M
GPT-5 Nano$0.050/M
GPT-5 Pro$15.00/M
GPT-5.1$1.25/M
GPT-5.1 Chat$0.625/M
GPT-5.1-Codex$1.25/M
GPT-5.1-Codex-Max$1.25/M
GPT-5.1-Codex-Mini$0.250/M
GPT-5.2$1.75/M
GPT-5.2 Chat$1.75/M
GPT-5.2 Pro$10.50/M
GPT-5.2-Codex$1.75/M
GPT-5.3 Chat$1.75/M
GPT-5.3 Codex$1.75/M
GPT-5.4$2.50/M
GPT-5.4 Mini$0.750/M
GPT-5.4 Nano$0.200/M
GPT-5.4 Pro$30.00/M
Grok 3$3.00/M
Grok 3 Mini$0.250/M
o1$15.00/M
o1 Mini$0.550/M
o1-pro$150.00/M
o3$2.00/M
o3 Deep Research$10.00/M
o3 Mini$1.10/M
o3 Pro$20.00/M
o4 Mini$0.550/M
openai-gpt-3.5-turbo-0613$1.00/M
openai-gpt-4-0314$15.00/M
openai-gpt-4-1106-preview$10.00/M
openai-gpt-4-turbo-2024-04-09$10.00/M
openai-gpt-4-turbo-preview$10.00/M
openai-gpt-4.1-mini-2025-04-14$0.400/M
openai-gpt-4o-0513$5.00/M
openai-gpt-4o-0806$2.50/M
openai-gpt-4o-1120$2.50/M
openai-gpt-4o-2024-05-13$5.00/M
openai-gpt-4o-2024-08-06$2.50/M
openai-gpt-4o-2024-11-20$2.50/M
openai-gpt-4o-mini-0718$0.150/M
openai-gpt-4o-mini-2024-07-18$0.075/M
openai-gpt-4o-realtime$5.00/M
openai-gpt-4o-search-preview$2.50/M
openai-gpt-5-mini-2025-08-07$0.250/M
openai-gpt-5.2-2025-12-11$1.75/M
openai-o1-1217$15.00/M
openai-o1-mini-2024-09-12$1.10/M
openai-o1-preview$15.00/M
openai-o1-preview-2024-09-12$7.50/M
openai-o3-0416$2.00/M
openai-o3-deep-research-0626$10.00/M
openai-o3-mini-0131$1.10/M
openai-o4-mini-0416$1.10/M
openai-o4-mini-2025-04-16$0.550/M
openai-text-embedding-ada-002$0.100/M
openai-text-embedding-ada-002-v2$0.050/M
R1$1.49/M
text-ada-001$0.200/M
text-davinci-002$20.00/M
text-davinci-003$20.00/M
Shared(1)
1
models available on both
Azure OpenAIDeepInfra
83 total71 total
Only on DeepInfra(70)DeepSeek V3$0.320/M
DeepSeek V3 0324$0.200/M
DeepSeek V3.1$0.210/M
DeepSeek V3.1 Terminus$0.210/M
DeepSeek V3.2$0.260/M
Gemma 3 12B$0.040/M
Gemma 3 27B$0.080/M
Gemma 3 4B$0.040/M
Gemma 4 26B A4B Instruct$0.070/M
Gemma 4 31B Instruct$0.130/M
GLM 4.6$0.430/M
GLM 4.6V$0.300/M
GLM 4.7$0.400/M
GLM 5$0.800/M
GLM 5.1$1.40/M
GLM-4.7-Flash$0.060/M
GPT-OSS-20b$0.030/M
Hermes 3 405B Instruct$1.00/M
Hermes 3 70B Instruct$0.300/M
Kimi K2.5$0.450/M
Llama 3 8B Instruct$0.030/M
Llama 3 8B Lunaris$0.040/M
Llama 3.1 70B Instruct$0.400/M
Llama 3.1 8B Instruct$0.020/M
Llama 3.1 Euryale 70B v2.2$0.850/M
Llama 3.2 11B Vision Instruct$0.245/M
Llama 3.3 70B Instruct$0.100/M
Llama 3.3 Euryale 70B$0.850/M
Llama 4 Maverick$0.150/M
Llama 4 Scout$0.080/M
meta-llama-llama-guard-4-12b$0.180/M
MiniMax M2.5$0.270/M
Mistral Nemo$0.020/M
Mistral Small 24B Instruct 2501$0.050/M
Mistral Small 3.2 24B$0.075/M
Mixtral 8x7B Instruct$0.540/M
MythoMax 13B$0.400/M
Nemotron 3 Nano 30B A3B$0.050/M
Nemotron 3 Super 120B A12B$0.100/M
Nemotron Nano 12B 2 VL$0.200/M
Nemotron Nano 9B V2$0.040/M
Nemotron-3 Super 120B A12B$0.100/M
Olmo 3.1 32B Instruct$0.200/M
Phi 4$0.070/M
Qwen2.5 72B Instruct$0.120/M
Qwen3 14B$0.120/M
Qwen3 235B A22B Instruct 2507$0.071/M
Qwen3 235B A22B Thinking 2507$0.230/M
Qwen3 30B A3B$0.080/M
Qwen3 32B$0.080/M
Qwen3 Coder 480B A35B (exacto)$0.220/M
Qwen3 Max$1.20/M
Qwen3 Max Thinking$1.20/M
Qwen3 Next 80B A3B Instruct$0.090/M
Qwen3 VL 235B A22B Instruct$0.200/M
Qwen3 VL 30B A3B Instruct$0.150/M
Qwen3.5 0.8B$0.010/M
Qwen3.5 2B (Non-reasoning)$0.020/M
Qwen3.5 397B A17B$0.540/M
Qwen3.5 4B (Non-reasoning)$0.030/M
Qwen3.5 9B$0.040/M
Qwen3.5-122B-A10B$0.290/M
Qwen3.5-27B$0.260/M
Qwen3.5-35B-A3B$0.200/M
Qwen3.6 35B A3B$0.200/M
R1 0528$0.500/M
R1 Distill Llama 70B$0.700/M
Step 3.5 Flash$0.100/M
Full Provider Pricing
Frequently Asked Questions
Built by @aellman
Tools
Directories
Models & Pricing
Endpoints
Rankings
- All Rankings
- All Benchmarks
- Best LLM for Coding
- Best LLM for Math
- Best LLM for Writing
- Best LLM for RAG
- Best Local LLM
- Best LLM for OpenClaw
- Best LLM for Cursor
- Best LLM for Windsurf
- Best LLM for Cline
- Best LLM for Aider
- Best LLM for GitHub Copilot
- Best LLM for Bolt
- Best LLM for Continue.dev
- MMLU-Pro
- GPQA
- LiveCodeBench
- Aider
- AIME
- MATH (Hard)
- Big-Bench Hard
2026 68 Ventures, LLC. All rights reserved.