Z AI

GLM-4.5 (Reasoning)

Unknown Size

By Z AI • Released 2025-07-28

Capability Radar

Avg Score
51

Across all benchmarks

Participated
14
Benchmarks

Benchmark Performance

Benchmark Category Score
MATH-500
Reasoning
97.9
MMLU-Pro
Knowledge
83.5
GPQA Diamond
Knowledge
78.2
LiveCodeBench
Coding
73.8
AIME 2025
Reasoning
73.7
SWE-bench (Bash Only)
Coding Agent
54.2
LCR
Long-Context Reasoning
48.3
IFBench
Agent
44.1
𝜏²-Bench Telecom
Reasoning Knowledge
43
SciCode
Reasoning Knowledge
34.8
Artificial Analysis Coding Index
Coding
26.3
Artificial Analysis Intelligence Index
Knowledge
26.2
Terminal-Bench Hard
Agent Coding
22
HLE
Knowledge Multi-Modal
12.2