Anthropic

Claude Opus 4.6 (Adaptive Reasoning)

Unknown Size

By Anthropic • Released 2026-02-05

Capability Radar

Avg Score
60

Across all benchmarks

Participated
9
Benchmarks

Benchmark Performance

Benchmark Category Score
𝜏²-Bench Telecom
Reasoning Knowledge
92.1
GPQA Diamond
Knowledge
89.6
LCR
Long-Context Reasoning
70.7
IFBench
Agent
53.1
Artificial Analysis Intelligence Index
Knowledge
53
SciCode
Reasoning Knowledge
51.9
Artificial Analysis Coding Index
Coding
48.1
Terminal-Bench Hard
Agent Coding
46.2
HLE
Knowledge Multi-Modal
36.7