Meta

Llama 2 Chat 13B

Unknown Size

By Meta • Released 2023-07-18

Capability Radar

Avg Score
20

Across all benchmarks

Participated
7
Benchmarks

Benchmark Performance

Benchmark Category Score
MMLU-Pro
Knowledge
40.6
MATH-500
Reasoning
32.9
GPQA Diamond
Knowledge
32.1
SciCode
Reasoning Knowledge
11.8
LiveCodeBench
Coding
9.8
Artificial Analysis Intelligence Index
Knowledge
8.4
HLE
Knowledge Multi-Modal
4.7