xAI

Grok 2 (Dec '24)

Unknown Size

By xAI • Released 2024-12-12

Capability Radar

Avg Score
39

Across all benchmarks

Participated
7
Benchmarks

Benchmark Performance

Benchmark Category Score
MATH-500
Reasoning
77.8
MMLU-Pro
Knowledge
70.9
GPQA Diamond
Knowledge
51
SciCode
Reasoning Knowledge
28.5
LiveCodeBench
Coding
26.7
Artificial Analysis Intelligence Index
Knowledge
13.9
HLE
Knowledge Multi-Modal
3.8