NVIDIA
Llama 3.3 Nemotron Super 49B v1 (Non-reasoning)
AI model by NVIDIA. Real-time pricing and benchmark data.
Benchmarks
Coding Index7.6
Math Index7.7
GPQA Diamond51.7%
MMLU-Pro69.8%
LiveCodeBench28.0%
AIME 20257.7%
MATH-50077.5%
SciCode22.9%
IFBench39.5%
TerminalBench0.0%
Compare with similar models
| Model | Input | Output | Speed |
|---|---|---|---|
Llama 3.3 Nemotron Super 49B v1 (Non-reasoning)Current | $0.00 | $0.00 | 0 tok/s |
| gpt-oss-120B (high) | $0.15 | $0.60 | 232 tok/s |
| GPT-5.4 mini (xhigh) | $0.75 | $4.50 | 188 tok/s |
| GPT-5.4 nano (xhigh) | $0.20 | $1.25 | 206 tok/s |
| gpt-oss-120B (low) | $0.15 | $0.60 | 225 tok/s |
| GPT-5.4 nano (Non-Reasoning) | $0.20 | $1.25 | 190 tok/s |
Compare Llama 3.3 Nemotron Super 49B v1 (Non-reasoning) with
Example Costs
Single Request
<$0.0001
1.0K in / 500 out
1K Requests/day
<$0.0001
1.0M in / 500.0K out
10K Requests/day
<$0.0001
10.0M in / 5.0M out