NVIDIA
Llama 3.3 Nemotron Super 49B v1 (Non-reasoning)
AI model by NVIDIA. Real-time pricing and benchmark data.
Benchmarks
Coding Index7.6
Math Index7.7
GPQA Diamond51.7%
MMLU-Pro69.8%
LiveCodeBench28.0%
AIME 20257.7%
MATH-50077.5%
SciCode22.9%
IFBench39.5%
TerminalBench0.0%
Compare with similar models
| Model | Input | Output | Speed |
|---|---|---|---|
Llama 3.3 Nemotron Super 49B v1 (Non-reasoning)Current | $0.00 | $0.00 | 0 tok/s |
| GPT-5.4 (Non-reasoning) | $2.50 | $15.00 | 62 tok/s |
| gpt-oss-120b (low) | $0.15 | $0.60 | 276 tok/s |
| gpt-oss-120b (high) | $0.15 | $0.60 | 272 tok/s |
| gpt-oss-20B (low) | $0.06 | $0.20 | 264 tok/s |
| Grok-1 | $0.00 | $0.00 | 0 tok/s |
Compare Llama 3.3 Nemotron Super 49B v1 (Non-reasoning) with
Example Costs
Single Request
<$0.0001
1.0K in / 500 out
1K Requests/day
<$0.0001
1.0M in / 500.0K out
10K Requests/day
<$0.0001
10.0M in / 5.0M out