Nousresearch API Pricing (Updated 2025)

This page tracks Nousresearch API pricing for all 5 models. Prices are shown per 1,000 tokens (cost per token) with clear examples so you can estimate spend quickly.

Current Prices (per 1,000 tokens)

Model
Input ($/1K tokens)
Output ($/1K tokens)
Context Length
deephermes-3-mistral-24b-preview
$0.000093
$0.000373
32,768
hermes-2-pro-llama-3-8b
$0.000025
$0.000040
131,072
hermes-3-llama-3.1-405b
$0.000700
$0.000800
131,072
hermes-3-llama-3.1-70b
$0.000100
$0.000280
131,072
nous-hermes-2-mixtral-8x7b-dpo
$0.000600
$0.000600
32,768

* Some models use tiered pricing based on prompt length. Displayed prices are for prompts ≤ 200k tokens.

What does "cost per token" mean?

Tokens are the basic units that AI models use to process text. Generally, 1,000 tokens ≈ ~750 words. The cost per token determines how much you pay for each unit of text processed by the model.

Formula: Total Cost = (tokens / 1,000) × price per 1K tokens

Examples

Example usage: A 500-word prompt + 300-word response ≈ ~1,067 tokens total.

  • • 1,000 tokens ≈ ~750 words of text
  • • Short email: ~200-400 tokens
  • • Blog post: ~1,000-3,000 tokens
  • • Research paper: ~10,000+ tokens

Compare with Other Providers

See how Nousresearch compares with other AI providers including OpenAI, Anthropic, Google, and more.

View full pricing comparison →

Frequently Asked Questions

Currently, hermes-2-pro-llama-3-8b is the most affordable at $0.000025 per 1,000 input tokens.
Count your input tokens (prompt) and expected output tokens (response). Use the formula: (input_tokens/1000 × input_price) + (output_tokens/1000 × output_price). Our pricing calculator can help automate this.
Nousresearch pricing varies by model. Input costs range from $0.000025 to $0.000700 per 1,000 tokens. See the table above for specific model pricing.