Alibaba

Qwen3 32B (Non-reasoning)

AI model by Alibaba. Real-time pricing and benchmark data.

Pricing (per 1M tokens)

Input$0.15
Output$0.59
Blended (3:1)$0.26

Source: Artificial Analysis

Performance

Output Speed102 tok/s
Time to First Token1132ms

Median values from Artificial Analysis

Compare with similar models

ModelInputOutputSpeed
Qwen3 32B (Non-reasoning)Current
$0.15$0.59102 tok/s
gpt-oss-120b (low)
$0.15$0.60276 tok/s
gpt-oss-120b (high)
$0.15$0.60272 tok/s
Mistral Small 4 (Non-reasoning)
$0.15$0.60140 tok/s
Mistral Small 4 (Reasoning)
$0.15$0.60149 tok/s
NVIDIA Nemotron Nano 12B v2 VL (Non-reasoning)
$0.20$0.60230 tok/s

Example Costs

Single Request
$0.0004
1.0K in / 500 out
1K Requests/day
$0.4450
1.0M in / 500.0K out
10K Requests/day
$4.45
10.0M in / 5.0M out