Price Per TokenPrice Per Token
Allenai
vs
Allenai

Olmo 3 32B Think vs Olmo 3.1 32B Instruct

A detailed comparison of pricing, benchmarks, and capabilities

Sponsor Price Per Token Reach 5000+ developers comparing LLM APIs

125 out of our 298 tracked models have had a price change in January.

Get our weekly newsletter on pricing changes, new releases, and tools.

Key Takeaways

Olmo 3 32B Think wins:

  • Cheaper input tokens
  • Cheaper output tokens
  • Better at coding
  • Better at math
  • Has reasoning mode

Olmo 3.1 32B Instruct wins:

  • Faster response time
Price Advantage
Olmo 3 32B Think
Benchmark Advantage
Olmo 3 32B Think
Context Window
Olmo 3.1 32B Instruct
Speed
Olmo 3.1 32B Instruct

Pricing Comparison

Price Comparison

MetricOlmo 3 32B ThinkOlmo 3.1 32B InstructWinner
Input (per 1M tokens)$0.15$0.20 Olmo 3 32B Think
Output (per 1M tokens)$0.50$0.60 Olmo 3 32B Think
Using a 3:1 input/output ratio, Olmo 3 32B Think is 21% cheaper overall.

Olmo 3 32B Think Providers

Parasail $0.15 (Cheapest)

Olmo 3.1 32B Instruct Providers

DeepInfra $0.20 (Cheapest)

Benchmark Comparison

6
Benchmarks Compared
2
Olmo 3 32B Think Wins
0
Olmo 3.1 32B Instruct Wins

Benchmark Scores

BenchmarkOlmo 3 32B ThinkOlmo 3.1 32B InstructWinner
Intelligence Index
Overall intelligence score
12.012.0-
Coding Index
Code generation & understanding
10.55.6
Math Index
Mathematical reasoning
73.7--
MMLU Pro
Academic knowledge
75.9--
GPQA
Graduate-level science
61.053.9
LiveCodeBench
Competitive programming
67.2--
Olmo 3 32B Think shows stronger mathematical reasoning abilities.

Cost vs Quality

X-axis:
Y-axis:
Loading chart...
Olmo 3 32B Think
Other models

Context & Performance

Context Window

Olmo 3 32B Think
65,536
tokens
Max output: 65,536 tokens
Olmo 3.1 32B Instruct
65,536
tokens

Speed Performance

MetricOlmo 3 32B ThinkOlmo 3.1 32B InstructWinner
Tokens/second0.0 tok/s47.2 tok/s
Time to First Token0.00s0.26s
Olmo 3.1 32B Instruct responds Infinity% faster on average.

Capabilities

Feature Comparison

FeatureOlmo 3 32B ThinkOlmo 3.1 32B Instruct
Vision (Image Input)
Tool/Function Calls
Reasoning Mode
Audio Input
Audio Output
PDF Input
Prompt Caching
Web Search

License & Release

PropertyOlmo 3 32B ThinkOlmo 3.1 32B Instruct
LicenseOpen SourceProprietary
AuthorAllenaiAllenai
ReleasedNov 2025Jan 2026

Olmo 3 32B Think Modalities

Input
text
Output
text

Olmo 3.1 32B Instruct Modalities

Input
text
Output
text

Related Comparisons

Compare Olmo 3 32B Think with:

Compare Olmo 3.1 32B Instruct with:

Frequently Asked Questions

Olmo 3 32B Think has cheaper input pricing at $0.15/M tokens. Olmo 3 32B Think has cheaper output pricing at $0.50/M tokens.
Olmo 3 32B Think scores higher on coding benchmarks with a score of 10.5, compared to Olmo 3.1 32B Instruct's score of 5.6.
Olmo 3 32B Think has a 65,536 token context window, while Olmo 3.1 32B Instruct has a 65,536 token context window.
Olmo 3 32B Think does not support vision. Olmo 3.1 32B Instruct does not support vision.