Anthropic

Claude 3 Opus

Unknown Size

By Anthropic • Released 2024-03-04

Capability Radar

Avg Score
34

Across all benchmarks

Participated
8
Benchmarks

Benchmark Performance

Benchmark Category Score
MMLU-Pro
Knowledge
69.6
MATH-500
Reasoning
64.1
GPQA Diamond
Knowledge
48.9
LiveCodeBench
Coding
27.9
SciCode
Reasoning Knowledge
23.3
Artificial Analysis Coding Index
Coding
19.5
Artificial Analysis Intelligence Index
Knowledge
12.5
HLE
Knowledge Multi-Modal
3.1