Alibaba

Qwen2.5 Coder Instruct 32B

Unknown Size

By Alibaba • Released 2024-11-11

Capability Radar

Avg Score
33

Across all benchmarks

Participated
8
Benchmarks

Benchmark Performance

Benchmark Category Score
MATH-500
Reasoning
76.7
MMLU-Pro
Knowledge
63.5
GPQA Diamond
Knowledge
41.7
LiveCodeBench
Coding
29.5
SciCode
Reasoning Knowledge
27.1
Artificial Analysis Intelligence Index
Knowledge
12.9
SWE-bench (Bash Only)
Coding Agent
9
HLE
Knowledge Multi-Modal
3.8