Price Per TokenPrice Per Token
Mistral AI
Mistral AI
vs
Qwen
Qwen

Devstral 2 2512 vs Qwen3 235B A22B Instruct 2507

A detailed comparison of pricing, benchmarks, and capabilities

Get our weekly newsletter on pricing changes, new releases, and tools.

Join the Price Per Token Community

Key Takeaways

Devstral 2 2512 wins:

  • Better at coding

Qwen3 235B A22B Instruct 2507 wins:

  • Cheaper input tokens
  • Cheaper output tokens
  • Faster response time
  • Higher intelligence benchmark
  • Better at math
  • Has reasoning mode
Price Advantage
Qwen3 235B A22B Instruct 2507
Benchmark Advantage
Qwen3 235B A22B Instruct 2507
Context Window
Qwen3 235B A22B Instruct 2507
Speed
Qwen3 235B A22B Instruct 2507

Pricing Comparison

Benchmark Comparison

Context & Performance

Capabilities

Feature Comparison

FeatureDevstral 2 2512Qwen3 235B A22B Instruct 2507
Vision (Image Input)
Tool/Function Calls
Reasoning Mode
Audio Input
Audio Output
PDF Input
Prompt Caching
Web Search

License & Release

PropertyDevstral 2 2512Qwen3 235B A22B Instruct 2507
LicenseOpen SourceOpen Source
AuthorMistral AIQwen
ReleasedDec 2025Jul 2025

Devstral 2 2512 Modalities

Input
text
Output
text

Qwen3 235B A22B Instruct 2507 Modalities

Input
text
Output
text

Frequently Asked Questions

Qwen3 235B A22B Instruct 2507 has cheaper input pricing at $0.07/M tokens. Qwen3 235B A22B Instruct 2507 has cheaper output pricing at $0.10/M tokens.
Devstral 2 2512 scores higher on coding benchmarks with a score of 23.7, compared to Qwen3 235B A22B Instruct 2507's score of 22.1.
Devstral 2 2512 has a 262,144 token context window, while Qwen3 235B A22B Instruct 2507 has a 262,144 token context window.
Devstral 2 2512 does not support vision. Qwen3 235B A22B Instruct 2507 does not support vision.