Alibaba

Qwen2.5 Instruct 32B

Unknown Size

By Alibaba • Released 2024-09-19

Capability Radar

Avg Score
37

Across all benchmarks

Participated
7
Benchmarks

Benchmark Performance

Benchmark Category Score
MATH-500
Reasoning
80.5
MMLU-Pro
Knowledge
69.7
GPQA Diamond
Knowledge
46.6
LiveCodeBench
Coding
24.8
SciCode
Reasoning Knowledge
22.9
Artificial Analysis Intelligence Index
Knowledge
13.2
HLE
Knowledge Multi-Modal
3.8