Anthropic

Claude 3 Sonnet

Unknown Size

By Anthropic • Released 2024-03-04

Capability Radar

Avg Score
28

Across all benchmarks

Participated
7
Benchmarks

Benchmark Performance

Benchmark Category Score
MMLU-Pro
Knowledge
57.9
MATH-500
Reasoning
41.4
GPQA Diamond
Knowledge
40
SciCode
Reasoning Knowledge
22.9
LiveCodeBench
Coding
17.5
Artificial Analysis Intelligence Index
Knowledge
10.3
HLE
Knowledge Multi-Modal
3.8