GPT-OSS-120b vs Qwen2.5 VL 32B Instruct

Key Takeaways

GPT-OSS-120b wins:

Cheaper input tokens
Cheaper output tokens
Larger context window
Faster response time
Higher intelligence benchmark
Better at coding
Better at math

Qwen2.5 VL 32B Instruct wins:

Supports vision
Supports tool calls

Price Advantage

GPT-OSS-120b

Benchmark Advantage

GPT-OSS-120b

Context Window

GPT-OSS-120b

Speed

GPT-OSS-120b

Pricing Comparison

Price Comparison

Metric	GPT-OSS-120b	Qwen2.5 VL 32B Instruct	Winner
Input (per 1M tokens)	$0.04	$0.05	GPT-OSS-120b
Output (per 1M tokens)	$0.19	$0.22	GPT-OSS-120b
Cache Read (per 1M)	N/A	$25000.00	Qwen2.5 VL 32B Instruct

Using a 3:1 input/output ratio, GPT-OSS-120b is 17% cheaper overall.

GPT-OSS-120b Providers

Chutes $0.04 (Cheapest)

SiliconFlow $0.05

Novita $0.05

Clarifai $0.09

Google $0.09

Qwen2.5 VL 32B Instruct Providers

Chutes $0.05 (Cheapest)

DeepInfra $0.20

Benchmark Comparison

8

Benchmarks Compared

4

GPT-OSS-120b Wins

0

Qwen2.5 VL 32B Instruct Wins

Benchmark Scores

Benchmark	GPT-OSS-120b	Qwen2.5 VL 32B Instruct	Winner
Intelligence Index Overall intelligence score	33.3	13.2
Coding Index Code generation & understanding	28.6	-	-
Math Index Mathematical reasoning	93.4	-	-
MMLU Pro Academic knowledge	80.8	69.7
GPQA Graduate-level science	78.2	46.6
LiveCodeBench Competitive programming	87.8	24.8
Aider Real-world code editing	41.8	-	-
AIME Competition math	-	11.0	-

GPT-OSS-120b significantly outperforms in coding benchmarks.

Cost vs Quality

X-axis:

Y-axis:

Loading chart...

GPT-OSS-120b

Other models

Context & Performance

Context Window

GPT-OSS-120b

131,072

tokens

Qwen2.5 VL 32B Instruct

16,384

tokens

Max output: 16,384 tokens

GPT-OSS-120b has a 88% larger context window.

Speed Performance

Metric	GPT-OSS-120b	Qwen2.5 VL 32B Instruct	Winner
Tokens/second	311.5 tok/s	0.0 tok/s
Time to First Token	0.47s	0.00s

GPT-OSS-120b responds Infinity% faster on average.

Capabilities

Feature Comparison

Feature	GPT-OSS-120b	Qwen2.5 VL 32B Instruct
Vision (Image Input)
Tool/Function Calls
Reasoning Mode
Audio Input
Audio Output
PDF Input
Prompt Caching
Web Search

License & Release

Property	GPT-OSS-120b	Qwen2.5 VL 32B Instruct
License	Open Source	Open Source
Author	OpenAI	Qwen
Released	Aug 2025	Mar 2025

GPT-OSS-120b Modalities

Input

text

Output

text

Qwen2.5 VL 32B Instruct Modalities

Input

textimage

Output

text

Related Comparisons

Compare GPT-OSS-120b with:

Compare Qwen2.5 VL 32B Instruct with:

See all model comparisons

Key Takeaways

GPT-OSS-120b wins:

Qwen2.5 VL 32B Instruct wins:

Pricing Comparison

Price Comparison

GPT-OSS-120b Providers

Qwen2.5 VL 32B Instruct Providers

Benchmark Comparison

Benchmark Scores

Cost vs Quality

Context & Performance

Context Window

Speed Performance

Capabilities

Feature Comparison

License & Release

GPT-OSS-120b Modalities

Qwen2.5 VL 32B Instruct Modalities

Related Comparisons

Compare GPT-OSS-120b with:

Compare Qwen2.5 VL 32B Instruct with:

Frequently Asked Questions

Tools

Directories

Pricing

Rankings

News