NVIDIA

Nemotron 3 Nano Omni 30B A3B Reasoning

AI model by NVIDIA. Real-time pricing and benchmark data.

Pricing (per 1M tokens)

Input$0.07
Output$0.30
Blended (3:1)$0.13

Source: Artificial Analysis

Performance

Output Speed307 tok/s
Time to First Token585ms

Median values from Artificial Analysis

Compare with similar models

ModelInputOutputSpeed
Nemotron 3 Nano Omni 30B A3B ReasoningCurrent
$0.07$0.30307 tok/s
MiMo-V2-Flash (Feb 2026)
$0.10$0.30150 tok/s
MiMo-V2-Flash (Non-reasoning)
$0.10$0.30146 tok/s
Ling 2.6 Flash
$0.10$0.300 tok/s
Devstral Small (Jul '25)
$0.10$0.30192 tok/s
Step 3.5 Flash
$0.10$0.30193 tok/s

Example Costs

Single Request
$0.0002
1.0K in / 500 out
1K Requests/day
$0.2250
1.0M in / 500.0K out
10K Requests/day
$2.25
10.0M in / 5.0M out