Price Per TokenPrice Per Token
Allenai
vs
Allenai

Olmo 3.1 32B Instruct vs Olmo 3.1 32B Think

A detailed comparison of pricing, benchmarks, and capabilities

Sponsor Price Per Token Reach 5000+ developers comparing LLM APIs

125 out of our 298 tracked models have had a price change in January.

Get our weekly newsletter on pricing changes, new releases, and tools.

Key Takeaways

Olmo 3.1 32B Instruct wins:

  • No clear advantages in compared metrics

Olmo 3.1 32B Think wins:

  • Cheaper input tokens
  • Cheaper output tokens
  • Faster response time
  • Higher intelligence benchmark
  • Better at coding
  • Better at math
  • Has reasoning mode
Price Advantage
Olmo 3.1 32B Think
Benchmark Advantage
Olmo 3.1 32B Think
Context Window
Olmo 3.1 32B Think
Speed
Olmo 3.1 32B Think

Pricing Comparison

Price Comparison

MetricOlmo 3.1 32B InstructOlmo 3.1 32B ThinkWinner
Input (per 1M tokens)$0.20$0.15 Olmo 3.1 32B Think
Output (per 1M tokens)$0.60$0.50 Olmo 3.1 32B Think
Using a 3:1 input/output ratio, Olmo 3.1 32B Think is 21% cheaper overall.

Olmo 3.1 32B Instruct Providers

DeepInfra $0.20 (Cheapest)

Olmo 3.1 32B Think Providers

Parasail $0.15 (Cheapest)

Benchmark Comparison

6
Benchmarks Compared
0
Olmo 3.1 32B Instruct Wins
3
Olmo 3.1 32B Think Wins

Benchmark Scores

BenchmarkOlmo 3.1 32B InstructOlmo 3.1 32B ThinkWinner
Intelligence Index
Overall intelligence score
12.014.2
Coding Index
Code generation & understanding
5.69.8
Math Index
Mathematical reasoning
-77.3-
MMLU Pro
Academic knowledge
-76.3-
GPQA
Graduate-level science
53.959.1
LiveCodeBench
Competitive programming
-69.5-
Olmo 3.1 32B Think shows stronger mathematical reasoning abilities.

Cost vs Quality

X-axis:
Y-axis:
Loading chart...
Other models

Context & Performance

Context Window

Olmo 3.1 32B Instruct
65,536
tokens
Olmo 3.1 32B Think
65,536
tokens
Max output: 65,536 tokens

Speed Performance

MetricOlmo 3.1 32B InstructOlmo 3.1 32B ThinkWinner
Tokens/second47.2 tok/s67.5 tok/s
Time to First Token0.26s0.59s
Olmo 3.1 32B Think responds 43% faster on average.

Capabilities

Feature Comparison

FeatureOlmo 3.1 32B InstructOlmo 3.1 32B Think
Vision (Image Input)
Tool/Function Calls
Reasoning Mode
Audio Input
Audio Output
PDF Input
Prompt Caching
Web Search

License & Release

PropertyOlmo 3.1 32B InstructOlmo 3.1 32B Think
LicenseProprietaryOpen Source
AuthorAllenaiAllenai
ReleasedJan 2026Dec 2025

Olmo 3.1 32B Instruct Modalities

Input
text
Output
text

Olmo 3.1 32B Think Modalities

Input
text
Output
text

Related Comparisons

Compare Olmo 3.1 32B Instruct with:

Compare Olmo 3.1 32B Think with:

Frequently Asked Questions

Olmo 3.1 32B Think has cheaper input pricing at $0.15/M tokens. Olmo 3.1 32B Think has cheaper output pricing at $0.50/M tokens.
Olmo 3.1 32B Think scores higher on coding benchmarks with a score of 9.8, compared to Olmo 3.1 32B Instruct's score of 5.6.
Olmo 3.1 32B Instruct has a 65,536 token context window, while Olmo 3.1 32B Think has a 65,536 token context window.
Olmo 3.1 32B Instruct does not support vision. Olmo 3.1 32B Think does not support vision.