Anthropic

Claude Opus 4.6 (Non-reasoning)

Unknown Size

By Anthropic • Released 2026-02-05

Capability Radar

Avg Score
53

Across all benchmarks

Participated
9
Benchmarks

Benchmark Performance

Benchmark Category Score
𝜏²-Bench Telecom
Reasoning Knowledge
84.8
GPQA Diamond
Knowledge
84
LCR
Long-Context Reasoning
58.3
Terminal-Bench Hard
Agent Coding
48.5
Artificial Analysis Coding Index
Coding
47.6
Artificial Analysis Intelligence Index
Knowledge
46.4
SciCode
Reasoning Knowledge
45.7
IFBench
Agent
44.6
HLE
Knowledge Multi-Modal
18.6