Compare/GPT-5.1 Codex mini (high) vs Qwen3.5 35B A3B (Reasoning)

GPT-5.1 Codex mini (high)vsQwen3.5 35B A3B (Reasoning)

Side-by-side comparison of pricing, 12 benchmarks, and generation speed.

OpenAI

GPT-5.1 Codex mini (high)

Input
$0.25/M
Output
$2/M
Speed
171 tok/s
TTFT
3.18s
Alibaba

Qwen3.5 35B A3B (Reasoning)

Input
$0.25/M
Output
$2/M
Speed
131 tok/s
TTFT
1.06s

Winner by Category

Cheaper
Tie
Faster (tok/s)
GPT-5.1 Codex mini (high)
Lower Latency
Qwen3.5 35B A3B (Reasoning)
Benchmarks (8-3)
GPT-5.1 Codex mini (high)

Pricing Comparison

MetricGPT-5.1 Codex mini (high)Qwen3.5 35B A3B (Reasoning)
Input ($/M tokens)$0.25$0.25
Output ($/M tokens)$2$2
Cost for 1M input + 100K output tokens:
GPT-5.1 Codex mini (high)$0.45
Qwen3.5 35B A3B (Reasoning)$0.45

Speed Comparison

Output Speed (tokens/s) — higher is better
GPT-5.1 Codex mini (high)
171 tok/s
Qwen3.5 35B A3B (Reasoning)
131 tok/s
Time to First Token (seconds) — lower is better
GPT-5.1 Codex mini (high)
3.18s
Qwen3.5 35B A3B (Reasoning)
1.06s

Benchmark Comparison

Data from Artificial Analysis API — 12 benchmarks

Intelligence Index
38.637.1
Coding Index
36.430.3
Math Index
91.7
GPQA Diamond
81.3%84.5%
MMLU-Pro
82.0%
LiveCodeBench
83.6%
AIME 2025
91.7%
MATH-500
Humanity's Last Exam
16.9%19.7%
SciCode
42.6%37.7%
IFBench
67.9%72.5%
TerminalBench
33.3%26.5%
GPT-5.1 Codex mini (high)8 wins
3 winsQwen3.5 35B A3B (Reasoning)

Frequently Asked Questions

Which is cheaper, GPT-5.1 Codex mini (high) or Qwen3.5 35B A3B (Reasoning)?

Both models have similar pricing. Check the detailed breakdown above for input vs output token costs.

Which model performs better on benchmarks?

GPT-5.1 Codex mini (high) wins 8 out of 12 benchmarks compared to 3 for Qwen3.5 35B A3B (Reasoning). See the detailed benchmark chart above for per-category results.

Which is faster for real-time applications?

GPT-5.1 Codex mini (high) generates tokens faster at 171 tok/s vs 131 tok/s. However, Qwen3.5 35B A3B (Reasoning) has lower time-to-first-token (1.06s vs 3.18s).

When should I use GPT-5.1 Codex mini (high) vs Qwen3.5 35B A3B (Reasoning)?

Choose based on your priorities: both are similarly priced, GPT-5.1 Codex mini (high) for stronger benchmark performance, and GPT-5.1 Codex mini (high) for faster generation. For latency-sensitive apps, check the TTFT comparison above.