IBM

Granite 4.1 8B

AI model by IBM. Real-time pricing and benchmark data.

Pricing (per 1M tokens)

Input$0.05
Output$0.10
Blended (3:1)$0.06

Source: Artificial Analysis

Performance

Output Speed127 tok/s
Time to First Token474ms

Median values from Artificial Analysis

Compare with similar models

ModelInputOutputSpeed
Granite 4.1 8BCurrent
$0.05$0.10127 tok/s
Ministral 3 3B
$0.10$0.10180 tok/s
Qwen3.5 2B (Reasoning)
$0.02$0.100 tok/s
Qwen3.5 2B (Non-reasoning)
$0.02$0.10318 tok/s
Llama 3.1 Instruct 8B
$0.10$0.10194 tok/s
LFM2 24B A2B
$0.03$0.12142 tok/s

Example Costs

Single Request
$0.0001
1.0K in / 500 out
1K Requests/day
$0.1000
1.0M in / 500.0K out
10K Requests/day
$1.00
10.0M in / 5.0M out