Nousresearch API Pricing (Updated 2025)
This page tracks Nousresearch API pricing for all 5 models. Prices are shown per 1,000 tokens (cost per token) with clear examples so you can estimate spend quickly.
Current Prices (per 1,000 tokens)
Model | Input ($/1K tokens) | Output ($/1K tokens) | Context Length |
---|---|---|---|
deephermes-3-mistral-24b-preview | $0.000093 | $0.000373 | 32,768 |
hermes-2-pro-llama-3-8b | $0.000025 | $0.000040 | 131,072 |
hermes-3-llama-3.1-405b | $0.000700 | $0.000800 | 131,072 |
hermes-3-llama-3.1-70b | $0.000100 | $0.000280 | 131,072 |
nous-hermes-2-mixtral-8x7b-dpo | $0.000600 | $0.000600 | 32,768 |
* Some models use tiered pricing based on prompt length. Displayed prices are for prompts ≤ 200k tokens.
What does "cost per token" mean?
Tokens are the basic units that AI models use to process text. Generally, 1,000 tokens ≈ ~750 words. The cost per token determines how much you pay for each unit of text processed by the model.
Formula: Total Cost = (tokens / 1,000) × price per 1K tokens
Examples
Example usage: A 500-word prompt + 300-word response ≈ ~1,067 tokens total.
- • 1,000 tokens ≈ ~750 words of text
- • Short email: ~200-400 tokens
- • Blog post: ~1,000-3,000 tokens
- • Research paper: ~10,000+ tokens
Compare with Other Providers
See how Nousresearch compares with other AI providers including OpenAI, Anthropic, Google, and more.
View full pricing comparison →