NVIDIA

Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)

Unknown Size

By NVIDIA • Released 2025-04-07

Capability Radar

Avg Score
39

Across all benchmarks

Participated
13
Benchmarks

Benchmark Performance

Benchmark Category Score
MATH-500
Reasoning
95.2
MMLU-Pro
Knowledge
82.5
GPQA Diamond
Knowledge
72.8
LiveCodeBench
Coding
64.1
AIME 2025
Reasoning
63.7
IFBench
Agent
38.2
SciCode
Reasoning Knowledge
34.7
Artificial Analysis Intelligence Index
Knowledge
15
Artificial Analysis Coding Index
Coding
13.1
𝜏²-Bench Telecom
Reasoning Knowledge
11.4
HLE
Knowledge Multi-Modal
8.1
LCR
Long-Context Reasoning
7.3
Terminal-Bench Hard
Agent Coding
2.3