Meta

Llama 3.1 Instruct 70B

Unknown Size

By Meta • Released 2024-07-23

Capability Radar

Avg Score
24

Across all benchmarks

Participated
13
Benchmarks

Benchmark Performance

Benchmark Category Score
MMLU-Pro
Knowledge
67.6
MATH-500
Reasoning
64.9
GPQA Diamond
Knowledge
40.9
IFBench
Agent
34.4
SciCode
Reasoning Knowledge
26.7
LiveCodeBench
Coding
23.2
𝜏²-Bench Telecom
Reasoning Knowledge
15.2
Artificial Analysis Intelligence Index
Knowledge
13.1
Artificial Analysis Coding Index
Coding
10.9
LCR
Long-Context Reasoning
6.3
HLE
Knowledge Multi-Modal
4.6
AIME 2025
Reasoning
4
Terminal-Bench Hard
Agent Coding
3