Price Per TokenPrice Per Token
Nebius

Nebius AI Pricing

Compare Nebius AI inference pricing for 29 models. European cloud infrastructure with GPU clusters for LLM inference.

Last updated: Apr 22, 2026

Nebius AI Overview

29
Total Models
29
LLMs
0
Embedding Models
$—
Cheapest LLM Input/1M

Nebius AI Model Pricing

Provider
Model
Context
Input/1M
Output/1M
deepseek-ai/DeepSeek-V3.2
$N/A
$N/A
deepseek-ai/DeepSeek-V3.2-fast
$N/A
$N/A
google/gemma-2-2b-it
$N/A
$N/A
google/gemma-3-27b-it
$N/A
$N/A
meta-llama/Llama-3.3-70B-Instruct
$N/A
$N/A
meta-llama/Meta-Llama-3.1-8B-Instruct
$N/A
$N/A
MiniMaxAI/MiniMax-M2.5
$N/A
$N/A
MiniMaxAI/MiniMax-M2.5-fast
$N/A
$N/A
moonshotai/Kimi-K2.5
$N/A
$N/A
moonshotai/Kimi-K2.5-fast
$N/A
$N/A
NousResearch/Hermes-4-405B
$N/A
$N/A
NousResearch/Hermes-4-70B
$N/A
$N/A
nvidia/Llama-3_1-Nemotron-Ultra-253B-v1
$N/A
$N/A
nvidia/nemotron-3-super-120b-a12b
$N/A
$N/A
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B
$N/A
$N/A
openai/gpt-oss-120b
$N/A
$N/A
openai/gpt-oss-120b-fast
$N/A
$N/A
PrimeIntellect/INTELLECT-3
$N/A
$N/A
Qwen/Qwen2.5-VL-72B-Instruct
$N/A
$N/A
Qwen/Qwen3-235B-A22B-Instruct-2507
$N/A
$N/A
Qwen/Qwen3-235B-A22B-Thinking-2507-fast
$N/A
$N/A
Qwen/Qwen3-30B-A3B-Instruct-2507
$N/A
$N/A
Qwen/Qwen3-32B
$N/A
$N/A
Qwen/Qwen3.5-397B-A17B
$N/A
$N/A
Qwen/Qwen3.5-397B-A17B-fast
$N/A
$N/A
Qwen/Qwen3-Embedding-8B
$N/A
$N/A
Qwen/Qwen3-Next-80B-A3B-Thinking
$N/A
$N/A
Qwen/Qwen3-Next-80B-A3B-Thinking-fast
$N/A
$N/A
zai-org/GLM-5
$N/A
$N/A

Pricing from Nebius AI. Prices per 1M tokens.

Compare Nebius AI with Other Providers