Nebius AI Pricing
Compare Nebius AI inference pricing for 56 models. European cloud infrastructure with GPU clusters for LLM inference.
Last updated: Mar 22, 2026
Nebius AI Overview
56
Total Models
56
LLMs
0
Embedding Models
$0.02
Cheapest LLM Input/1M
Nebius AI Model Pricing
Provider | Model | Context | Input/1M | Output/1M |
|---|---|---|---|---|
NB | google/gemma-2-2b-it | — | $0.020 | $0.060 |
NB | google/gemma-2-9b-it-fast | — | $0.030 | $0.090 |
NB | Qwen/Qwen2.5-Coder-7B-fast | — | $0.030 | $0.090 |
NB | openai/gpt-oss-20b | — | $0.050 | $0.200 |
NB | Qwen/Qwen3-30B-A3B-Instruct-2507 | — | $0.100 | $0.300 |
NB | Qwen/Qwen3-30B-A3B-Thinking-2507 | — | $0.100 | $0.300 |
NB | Qwen/Qwen3-32B | — | $0.100 | $0.300 |
NB | Qwen/Qwen3-Coder-30B-A3B-Instruct | — | $0.100 | $0.300 |
NB | meta-llama/Llama-3.3-70B-Instruct | — | $0.130 | $0.400 |
NB | openai/gpt-oss-120b | — | $0.150 | $0.600 |
NB | openai/gpt-oss-120b-fast | — | $0.150 | $0.600 |
NB | Qwen/Qwen3-235B-A22B-Instruct-2507 | — | $0.200 | $0.600 |
NB | Qwen/Qwen3-235B-A22B-Thinking-2507 | — | $0.200 | $0.800 |
NB | Qwen/Qwen3-235B-A22B-Thinking-2507-fast | — | $0.200 | $0.800 |
NB | Qwen/Qwen3-32B-fast | — | $0.200 | $0.600 |
NB | meta-llama/Llama-3.3-70B-Instruct-fast | — | $0.250 | $0.750 |
NB | Qwen/Qwen3-Coder-480B-A35B-Instruct | — | $0.400 | $1.800 |
NB | deepseek-ai/DeepSeek-V3-0324 | — | $0.500 | $1.500 |
NB | deepseek-ai/DeepSeek-V3.2 | — | $0.500 | $1.500 |
NB | deepseek-ai/DeepSeek-V3.2-fast | — | $0.500 | $1.500 |
NB | moonshotai/Kimi-K2-Instruct | — | $0.500 | $2.400 |
NB | nvidia/Llama-3_1-Nemotron-Ultra-253B-v1 | — | $0.600 | $1.800 |
NB | zai-org/GLM-4.5 | — | $0.600 | $2.200 |
NB | zai-org/GLM-4.5-Air | — | $0.600 | $2.200 |
NB | deepseek-ai/DeepSeek-V3-0324-fast | — | $0.750 | $2.250 |
NB | deepseek-ai/DeepSeek-R1-0528 | — | $0.800 | $2.400 |
NB | deepseek-ai/DeepSeek-R1-0528-fast | — | $2.000 | $6.000 |
NB | BAAI/bge-en-icl | — | $N/A | $N/A |
NB | BAAI/bge-multilingual-gemma2 | — | $N/A | $N/A |
NB | black-forest-labs/flux-dev | — | $N/A | $N/A |
NB | black-forest-labs/flux-schnell | — | $N/A | $N/A |
NB | google/gemma-3-27b-it | — | $N/A | $N/A |
NB | google/gemma-3-27b-it-fast | — | $N/A | $N/A |
NB | intfloat/e5-mistral-7b-instruct | — | $N/A | $N/A |
NB | meta-llama/Llama-Guard-3-8B | — | $N/A | $N/A |
NB | meta-llama/Meta-Llama-3.1-8B-Instruct | — | $N/A | $N/A |
NB | meta-llama/Meta-Llama-3.1-8B-Instruct-fast | — | $N/A | $N/A |
NB | MiniMaxAI/MiniMax-M2.1 | — | $N/A | $N/A |
NB | MiniMaxAI/MiniMax-M2.5 | — | $N/A | $N/A |
NB | moonshotai/Kimi-K2.5 | — | $N/A | $N/A |
NB | moonshotai/Kimi-K2.5-fast | — | $N/A | $N/A |
NB | moonshotai/Kimi-K2-Thinking | — | $N/A | $N/A |
NB | NousResearch/Hermes-4-405B | — | $N/A | $N/A |
NB | NousResearch/Hermes-4-70B | — | $N/A | $N/A |
NB | nvidia/nemotron-3-super-120b-a12b | — | $N/A | $N/A |
NB | nvidia/Nemotron-Nano-V2-12b | — | $N/A | $N/A |
NB | nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B | — | $N/A | $N/A |
NB | PrimeIntellect/INTELLECT-3 | — | $N/A | $N/A |
NB | Qwen/Qwen2.5-VL-72B-Instruct | — | $N/A | $N/A |
NB | Qwen/Qwen3.5-397B-A17B | — | $N/A | $N/A |
NB | Qwen/Qwen3.5-397B-A17B-fast | — | $N/A | $N/A |
NB | Qwen/Qwen3-Embedding-8B | — | $N/A | $N/A |
NB | Qwen/Qwen3-Next-80B-A3B-Thinking | — | $N/A | $N/A |
NB | Qwen/Qwen3-Next-80B-A3B-Thinking-fast | — | $N/A | $N/A |
NB | zai-org/GLM-4.7-FP8 | — | $N/A | $N/A |
NB | zai-org/GLM-5 | — | $N/A | $N/A |
Pricing from Nebius AI. Prices per 1M tokens.
Compare Nebius AI with Other Providers

Nebius AI vs Groq
Compare pricing & models

Nebius AI vs Together AI
Compare pricing & models

Nebius AI vs Fireworks AI
Compare pricing & models

Nebius AI vs DeepInfra
Compare pricing & models

Nebius AI vs Cerebras
Compare pricing & models

Nebius AI vs SambaNova
Compare pricing & models
Nebius AI vs Cloudflare Workers AI
Compare pricing & models
Nebius AI vs AWS Bedrock
Compare pricing & models
Nebius AI vs Azure OpenAI
Compare pricing & models
Nebius AI vs Google AI Studio
Compare pricing & models
Nebius AI vs OpenRouter
Compare pricing & models

Nebius AI vs Novita AI
Compare pricing & models