NVIDIA

Llama Nemotron Super 49B v1.5 (Non-reasoning)

AI model by NVIDIA. Real-time pricing and benchmark data.

Pricing (per 1M tokens)

Input$0.10
Output$0.40
Blended (3:1)$0.17

Source: Artificial Analysis

Performance

Output Speed67 tok/s
Time to First Token339ms

Median values from Artificial Analysis

Compare with similar models

ModelInputOutputSpeed
Llama Nemotron Super 49B v1.5 (Non-reasoning)Current
$0.10$0.4067 tok/s
Gemma 4 26B A4B (Reasoning)
$0.13$0.400 tok/s
Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning)
$0.10$0.400 tok/s
Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning)
$0.10$0.400 tok/s
Hermes 4 - Llama-3.1 70B (Non-reasoning)
$0.13$0.4080 tok/s
Hermes 4 - Llama-3.1 70B (Reasoning)
$0.13$0.4082 tok/s

Example Costs

Single Request
$0.0003
1.0K in / 500 out
1K Requests/day
$0.3000
1.0M in / 500.0K out
10K Requests/day
$3.00
10.0M in / 5.0M out