Meta

Llama 3 Instruct 70B

Unknown Size

By Meta • Released 2024-04-18

Capability Radar

Avg Score
20

Across all benchmarks

Participated
12
Benchmarks

Benchmark Performance

Benchmark Category Score
MMLU-Pro
Knowledge
57.4
MATH-500
Reasoning
48.3
GPQA Diamond
Knowledge
37.9
IFBench
Agent
37.1
LiveCodeBench
Coding
19.8
SciCode
Reasoning Knowledge
18.9
Artificial Analysis Intelligence Index
Knowledge
10.2
Artificial Analysis Coding Index
Coding
6.8
HLE
Knowledge Multi-Modal
4.4
Terminal-Bench Hard
Agent Coding
0.8
LCR
Long-Context Reasoning
0
𝜏²-Bench Telecom
Reasoning Knowledge
0