Price Per TokenPrice Per Token
Nebius

Nebius AI Pricing

Compare Nebius AI inference pricing for 56 models. European cloud infrastructure with GPU clusters for LLM inference.

Last updated: Mar 22, 2026

Nebius AI Overview

56
Total Models
56
LLMs
0
Embedding Models
$0.02
Cheapest LLM Input/1M

Nebius AI Model Pricing

Provider
Model
Context
Input/1M
Output/1M
google/gemma-2-2b-it
$0.020
$0.060
google/gemma-2-9b-it-fast
$0.030
$0.090
Qwen/Qwen2.5-Coder-7B-fast
$0.030
$0.090
openai/gpt-oss-20b
$0.050
$0.200
Qwen/Qwen3-30B-A3B-Instruct-2507
$0.100
$0.300
Qwen/Qwen3-30B-A3B-Thinking-2507
$0.100
$0.300
Qwen/Qwen3-32B
$0.100
$0.300
Qwen/Qwen3-Coder-30B-A3B-Instruct
$0.100
$0.300
meta-llama/Llama-3.3-70B-Instruct
$0.130
$0.400
openai/gpt-oss-120b
$0.150
$0.600
openai/gpt-oss-120b-fast
$0.150
$0.600
Qwen/Qwen3-235B-A22B-Instruct-2507
$0.200
$0.600
Qwen/Qwen3-235B-A22B-Thinking-2507
$0.200
$0.800
Qwen/Qwen3-235B-A22B-Thinking-2507-fast
$0.200
$0.800
Qwen/Qwen3-32B-fast
$0.200
$0.600
meta-llama/Llama-3.3-70B-Instruct-fast
$0.250
$0.750
Qwen/Qwen3-Coder-480B-A35B-Instruct
$0.400
$1.800
deepseek-ai/DeepSeek-V3-0324
$0.500
$1.500
deepseek-ai/DeepSeek-V3.2
$0.500
$1.500
deepseek-ai/DeepSeek-V3.2-fast
$0.500
$1.500
moonshotai/Kimi-K2-Instruct
$0.500
$2.400
nvidia/Llama-3_1-Nemotron-Ultra-253B-v1
$0.600
$1.800
zai-org/GLM-4.5
$0.600
$2.200
zai-org/GLM-4.5-Air
$0.600
$2.200
deepseek-ai/DeepSeek-V3-0324-fast
$0.750
$2.250
deepseek-ai/DeepSeek-R1-0528
$0.800
$2.400
deepseek-ai/DeepSeek-R1-0528-fast
$2.000
$6.000
BAAI/bge-en-icl
$N/A
$N/A
BAAI/bge-multilingual-gemma2
$N/A
$N/A
black-forest-labs/flux-dev
$N/A
$N/A
black-forest-labs/flux-schnell
$N/A
$N/A
google/gemma-3-27b-it
$N/A
$N/A
google/gemma-3-27b-it-fast
$N/A
$N/A
intfloat/e5-mistral-7b-instruct
$N/A
$N/A
meta-llama/Llama-Guard-3-8B
$N/A
$N/A
meta-llama/Meta-Llama-3.1-8B-Instruct
$N/A
$N/A
meta-llama/Meta-Llama-3.1-8B-Instruct-fast
$N/A
$N/A
MiniMaxAI/MiniMax-M2.1
$N/A
$N/A
MiniMaxAI/MiniMax-M2.5
$N/A
$N/A
moonshotai/Kimi-K2.5
$N/A
$N/A
moonshotai/Kimi-K2.5-fast
$N/A
$N/A
moonshotai/Kimi-K2-Thinking
$N/A
$N/A
NousResearch/Hermes-4-405B
$N/A
$N/A
NousResearch/Hermes-4-70B
$N/A
$N/A
nvidia/nemotron-3-super-120b-a12b
$N/A
$N/A
nvidia/Nemotron-Nano-V2-12b
$N/A
$N/A
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B
$N/A
$N/A
PrimeIntellect/INTELLECT-3
$N/A
$N/A
Qwen/Qwen2.5-VL-72B-Instruct
$N/A
$N/A
Qwen/Qwen3.5-397B-A17B
$N/A
$N/A
Qwen/Qwen3.5-397B-A17B-fast
$N/A
$N/A
Qwen/Qwen3-Embedding-8B
$N/A
$N/A
Qwen/Qwen3-Next-80B-A3B-Thinking
$N/A
$N/A
Qwen/Qwen3-Next-80B-A3B-Thinking-fast
$N/A
$N/A
zai-org/GLM-4.7-FP8
$N/A
$N/A
zai-org/GLM-5
$N/A
$N/A

Pricing from Nebius AI. Prices per 1M tokens.

Compare Nebius AI with Other Providers