Anthropic

Claude 4.5 Haiku (Reasoning)

Unknown Size

By Anthropic • Released 2025-10-15

Capability Radar

Avg Score
51

Across all benchmarks

Participated
12
Benchmarks

Benchmark Performance

Benchmark Category Score
AIME 2025
Reasoning
83.7
MMLU-Pro
Knowledge
76
LCR
Long-Context Reasoning
70.3
GPQA Diamond
Knowledge
67.2
LiveCodeBench
Coding
61.5
𝜏²-Bench Telecom
Reasoning Knowledge
54.7
IFBench
Agent
54.3
SciCode
Reasoning Knowledge
43.3
Artificial Analysis Intelligence Index
Knowledge
37
Artificial Analysis Coding Index
Coding
32.6
Terminal-Bench Hard
Agent Coding
27.3
HLE
Knowledge Multi-Modal
9.7