Z AI

GLM-4.6 (Reasoning)

Unknown Size

By Z AI • Released 2025-09-30

Capability Radar

Avg Score
52

Across all benchmarks

Participated
12
Benchmarks

Benchmark Performance

Benchmark Category Score
AIME 2025
Reasoning
86
MMLU-Pro
Knowledge
82.9
GPQA Diamond
Knowledge
78
𝜏²-Bench Telecom
Reasoning Knowledge
70.5
LiveCodeBench
Coding
69.5
LCR
Long-Context Reasoning
54.3
IFBench
Agent
43.4
SciCode
Reasoning Knowledge
38.4
Artificial Analysis Intelligence Index
Knowledge
32.5
Artificial Analysis Coding Index
Coding
29.5
Terminal-Bench Hard
Agent Coding
25
HLE
Knowledge Multi-Modal
13.3