Azure OpenAI vs DeepInfra

Compare pricing across 1 shared models. Azure OpenAI offers 89 models, DeepInfra offers 67.

8 Ways to Use Fewer Tokens

Get the free PDF guide — practical tips to cut your token usage and API costs. Subscribe to the Price Per Token newsletter and download instantly.

Shared Models

Azure OpenAI Cheaper

DeepInfra Cheaper

Same Price

Price Comparison — Shared Models

Model ↑	Azure OpenAI Input	DeepInfra Input	Azure OpenAI Output	DeepInfra Output	Cheaper
GPT-OSS-120b	$0.150	$0.039	$0.600	$0.190	DeepInfra

Model Coverage

Only on Azure OpenAI(84)

Azure OpenAI$75.00/M

Babbage$0.500/M

ChatGPT-4o$5.00/M

Claude Opus 4.6$5.00/M

Claude Sonnet 4.6$3.00/M

Codex Mini$1.50/M

curie$2.00/M

GPT-3.5 Turbo$0.750/M

GPT-3.5 Turbo 16k$3.00/M

GPT-3.5 Turbo Instruct$1.50/M

GPT-4$30.00/M

GPT-4 Turbo$10.00/M

GPT-4.1$2.00/M

GPT-4.1 Mini$0.400/M

GPT-4.1 Nano$0.100/M

GPT-4o$2.50/M

GPT-4o-mini$0.150/M

GPT-5$0.625/M

GPT-5 Chat$1.25/M

GPT-5 Codex$1.25/M

GPT-5 Mini$0.125/M

GPT-5 Nano$0.050/M

GPT-5 Pro$15.00/M

GPT-5.1$1.25/M

GPT-5.1 Chat$0.625/M

GPT-5.1-Codex$1.25/M

GPT-5.1-Codex-Max$1.25/M

GPT-5.1-Codex-Mini$0.250/M

GPT-5.2$1.75/M

GPT-5.2 Chat$1.75/M

GPT-5.2 Pro$10.50/M

GPT-5.2-Codex$1.75/M

GPT-5.3 Chat$1.75/M

GPT-5.3 Codex$1.75/M

GPT-5.4$2.50/M

GPT-5.4 Mini$0.750/M

GPT-5.4 Nano$0.200/M

GPT-5.4 Pro$30.00/M

GPT-5.5 Pro$5.00/M

GPT-5.5 Short Context PP$12.50/M

Grok 3$3.00/M

Grok 3 Mini$0.250/M

o1$15.00/M

o1 Mini$0.550/M

o1-pro$150.00/M

o3$2.00/M

o3 Deep Research$10.00/M

o3 Mini$0.550/M

o3 Pro$20.00/M

o4 Mini$1.10/M

openai-gpt-3.5-turbo-0613$1.00/M

openai-gpt-4-0314$15.00/M

openai-gpt-4-1106-preview$10.00/M

openai-gpt-4-turbo-2024-04-09$10.00/M

openai-gpt-4-turbo-preview$10.00/M

openai-gpt-4.1-mini-2025-04-14$0.400/M

openai-gpt-4o-0513$5.00/M

openai-gpt-4o-0806$2.50/M

openai-gpt-4o-1120$2.50/M

openai-gpt-4o-2024-05-13$5.00/M

openai-gpt-4o-2024-08-06$2.50/M

openai-gpt-4o-2024-11-20$2.50/M

openai-gpt-4o-mini-0718$0.150/M

openai-gpt-4o-mini-2024-07-18$0.200/M

openai-gpt-4o-mini-search-preview$0.150/M

openai-gpt-4o-realtime$4.00/M

openai-gpt-4o-search-preview$2.50/M

openai-gpt-5-mini-2025-08-07$0.125/M

openai-gpt-5.2-2025-12-11$0.875/M

openai-o1-1217$15.00/M

openai-o1-mini-2024-09-12$0.550/M

openai-o1-preview$15.00/M

openai-o1-preview-2024-09-12$7.50/M

openai-o3-0416$2.00/M

openai-o3-deep-research-0626$10.00/M

openai-o3-mini-0131$1.10/M

openai-o4-mini-0416$1.10/M

openai-o4-mini-2025-04-16$0.550/M

openai-text-embedding-ada-002$0.050/M

openai-text-embedding-ada-002-v2$0.100/M

R1$1.49/M

text-ada-001$0.200/M

text-davinci-002$10.00/M

text-davinci-003$20.00/M

Shared(1)

models available on both

Azure OpenAIDeepInfra

85 total67 total

Only on DeepInfra(66)

DeepSeek V3$0.320/M

DeepSeek V3 0324$0.200/M

DeepSeek V3.1$0.210/M

DeepSeek V3.1 Terminus$0.270/M

DeepSeek V3.2$0.260/M

DeepSeek V4 Flash (Non-Reasoning)$0.100/M

DeepSeek V4 Pro$1.30/M

Gemma 3 12B$0.050/M

Gemma 3 27B$0.080/M

Gemma 3 4B$0.050/M

Gemma 4 26B A4B Instruct$0.070/M

Gemma 4 31B Instruct$0.130/M

GLM 4.6$0.430/M

GLM 4.7$0.400/M

GLM 5$0.600/M

GLM 5.1$1.05/M

GLM-4.7-Flash$0.060/M

GPT-OSS-20b$0.030/M

Hermes 3 405B Instruct$1.00/M

Hermes 3 70B Instruct$0.700/M

Kimi K2.5$0.450/M

Kimi K2.6$0.750/M

Llama 3 8B Lunaris$0.040/M

Llama 3.1 70B Instruct$0.400/M

Llama 3.1 8B Instruct$0.020/M

Llama 3.1 Euryale 70B v2.2$0.850/M

Llama 3.2 11B Vision Instruct$0.345/M

Llama 3.3 70B Instruct$0.100/M

Llama 3.3 Nemotron Super 49B V1.5$0.400/M

Llama 4 Maverick$0.150/M

Llama 4 Scout$0.100/M

meta-llama-llama-guard-4-12b$0.180/M

MiMo v2.5$0.400/M

MiMo v2.5 Pro$1.00/M

MiniMax M2.5$0.150/M

MiniMax M2.7$0.300/M

Mistral Nemo$0.020/M

Mistral Small 24B Instruct 2501$0.050/M

Mistral Small 3.2 24B$0.075/M

MythoMax 13B$0.400/M

Nemotron 3 Nano 30B A3B$0.050/M

Nemotron 3 Super 120B A12B$0.100/M

Nemotron Nano 9B V2$0.040/M

Nemotron-3 Super 120B A12B$0.100/M

Phi 4$0.070/M

Qwen2.5 72B Instruct$0.360/M

Qwen3 14B$0.120/M

Qwen3 235B A22B Instruct 2507$0.090/M

Qwen3 235B A22B Thinking 2507$0.230/M

Qwen3 30B A3B$0.120/M

Qwen3 32B$0.080/M

Qwen3 Coder 480B A35B (exacto)$0.300/M

Qwen3 Max$1.20/M

Qwen3 Max Thinking$1.20/M

Qwen3 Next 80B A3B Instruct$0.090/M

Qwen3 VL 235B A22B Instruct$0.200/M

Qwen3 VL 30B A3B Instruct$0.150/M

Qwen3.5 397B A17B$0.450/M

Qwen3.5 9B$0.100/M

Qwen3.5-122B-A10B$0.290/M

Qwen3.5-27B$0.260/M

Qwen3.5-35B-A3B$0.140/M

Qwen3.6 35B A3B$0.150/M

R1 0528$0.500/M

R1 Distill Llama 70B$0.700/M

Step 3.5 Flash$0.090/M

Azure OpenAI vs DeepInfra

8 Ways to Use Fewer Tokens

Price Comparison — Shared Models

Model Coverage

Full Provider Pricing

Frequently Asked Questions

Tools

Directories

Models & Pricing

Endpoints

Rankings

News