xAI

Grok Beta

Unknown Size

By xAI • Released 2024-08-13

Capability Radar

Avg Score
38

Across all benchmarks

Participated
7
Benchmarks

Benchmark Performance

Benchmark Category Score
MATH-500
Reasoning
73.7
MMLU-Pro
Knowledge
70.3
GPQA Diamond
Knowledge
47.1
SciCode
Reasoning Knowledge
29.5
LiveCodeBench
Coding
24.1
Artificial Analysis Intelligence Index
Knowledge
13.3
HLE
Knowledge Multi-Modal
4.7