Meta

Llama 3.1 Instruct 8B

AI model by Meta. Real-time pricing and benchmark data.

Pricing (per 1M tokens)

Input$0.10
Output$0.10
Blended (3:1)$0.10

Source: Artificial Analysis

Performance

Output Speed178 tok/s
Time to First Token441ms

Median values from Artificial Analysis

Compare with similar models

ModelInputOutputSpeed
Llama 3.1 Instruct 8BCurrent
$0.10$0.10178 tok/s
Ministral 3 3B
$0.10$0.10279 tok/s
Qwen3.5 2B (Non-reasoning)
$0.02$0.10272 tok/s
Qwen3.5 2B (Reasoning)
$0.02$0.10361 tok/s
LFM2 24B A2B
$0.03$0.12203 tok/s
Devstral Small (May '25)
$0.06$0.120 tok/s

Example Costs

Single Request
$0.0001
1.0K in / 500 out
1K Requests/day
$0.1500
1.0M in / 500.0K out
10K Requests/day
$1.50
10.0M in / 5.0M out