NVIDIA

NVIDIA Nemotron Nano 9B V2 (Reasoning)

AI model by NVIDIA. Real-time pricing and benchmark data.

Pricing (per 1M tokens)

Input$0.04
Output$0.16
Blended (3:1)$0.07

Source: Artificial Analysis

Performance

Output Speed165 tok/s
Time to First Token191ms

Median values from Artificial Analysis

Compare with similar models

ModelInputOutputSpeed
NVIDIA Nemotron Nano 9B V2 (Reasoning)Current
$0.04$0.16165 tok/s
Llama 3.2 Instruct 11B (Vision)
$0.16$0.1686 tok/s
Ministral 3 8B
$0.15$0.15165 tok/s
Qwen3.5 4B (Non-reasoning)
$0.03$0.15220 tok/s
Qwen3.5 4B (Reasoning)
$0.03$0.15227 tok/s
Solar Mini
$0.15$0.1595 tok/s

Example Costs

Single Request
$0.0001
1.0K in / 500 out
1K Requests/day
$0.1200
1.0M in / 500.0K out
10K Requests/day
$1.20
10.0M in / 5.0M out