Anthropic

Claude 3.5 Sonnet (June '24)

Unknown Size

By Anthropic • Released 2024-06-21

Capability Radar

Avg Score
39

Across all benchmarks

Participated
7
Benchmarks

Benchmark Performance

Benchmark Category Score
MMLU-Pro
Knowledge
75.1
MATH-500
Reasoning
69.5
GPQA Diamond
Knowledge
56
SciCode
Reasoning Knowledge
31.6
Artificial Analysis Coding Index
Coding
26
Artificial Analysis Intelligence Index
Knowledge
14.2
HLE
Knowledge Multi-Modal
3.7