Compare/Jamba 1.6 Large vs Qwen3 235B A22B 2507 (Reasoning)

Jamba 1.6 LargevsQwen3 235B A22B 2507 (Reasoning)

Side-by-side comparison of pricing, 12 benchmarks, and generation speed.

AI21 Labs

Jamba 1.6 Large

Input
$2/M
Output
$8/M
Speed
55 tok/s
TTFT
0.77s
Alibaba

Qwen3 235B A22B 2507 (Reasoning)

Input
$0.7/M
Output
$8.4/M
Speed
40 tok/s
TTFT
1.43s

Winner by Category

Cheaper
Qwen3 235B A22B 2507 (Reasoning)
Faster (tok/s)
Jamba 1.6 Large
Lower Latency
Jamba 1.6 Large
Benchmarks (0-12)
Qwen3 235B A22B 2507 (Reasoning)

Pricing Comparison

MetricJamba 1.6 LargeQwen3 235B A22B 2507 (Reasoning)
Input ($/M tokens)$2$0.7
Output ($/M tokens)$8$8.4
Cost for 1M input + 100K output tokens:
Jamba 1.6 Large$2.80
Qwen3 235B A22B 2507 (Reasoning)$1.54

Speed Comparison

Output Speed (tokens/s) — higher is better
Jamba 1.6 Large
55 tok/s
Qwen3 235B A22B 2507 (Reasoning)
40 tok/s
Time to First Token (seconds) — lower is better
Jamba 1.6 Large
0.77s
Qwen3 235B A22B 2507 (Reasoning)
1.43s

Benchmark Comparison

Data from Artificial Analysis API — 12 benchmarks

Intelligence Index
10.629.5
Coding Index
23.2
Math Index
91.0
GPQA Diamond
38.7%79.0%
MMLU-Pro
56.5%84.3%
LiveCodeBench
17.2%78.8%
AIME 2025
91.0%
MATH-500
58.0%98.4%
Humanity's Last Exam
4.0%15.0%
SciCode
18.4%42.4%
IFBench
51.2%
TerminalBench
13.6%
Jamba 1.6 Large0 wins
12 winsQwen3 235B A22B 2507 (Reasoning)

Frequently Asked Questions

Which is cheaper, Jamba 1.6 Large or Qwen3 235B A22B 2507 (Reasoning)?

Qwen3 235B A22B 2507 (Reasoning) is cheaper overall. Its blended price (3:1 input/output ratio) is $2.63/M tokens vs $3.50/M for Jamba 1.6 Large.

Which model performs better on benchmarks?

Qwen3 235B A22B 2507 (Reasoning) wins 12 out of 12 benchmarks compared to 0 for Jamba 1.6 Large. See the detailed benchmark chart above for per-category results.

Which is faster for real-time applications?

Jamba 1.6 Large generates tokens faster at 55 tok/s vs 40 tok/s. Jamba 1.6 Large also has lower time-to-first-token (0.77s vs 1.43s).

When should I use Jamba 1.6 Large vs Qwen3 235B A22B 2507 (Reasoning)?

Choose based on your priorities: Qwen3 235B A22B 2507 (Reasoning) for lower cost, Qwen3 235B A22B 2507 (Reasoning) for stronger benchmark performance, and Jamba 1.6 Large for faster generation. For latency-sensitive apps, check the TTFT comparison above.