Alibaba

Qwen3 32B (Reasoning)

Unknown Size

By Alibaba • Released 2025-04-28

Capability Radar

Avg Score
39

Across all benchmarks

Participated
13
Benchmarks

Benchmark Performance

Benchmark Category Score
MATH-500
Reasoning
96.1
MMLU-Pro
Knowledge
79.8
AIME 2025
Reasoning
73
GPQA Diamond
Knowledge
66.8
LiveCodeBench
Coding
54.6
IFBench
Agent
36.3
SciCode
Reasoning Knowledge
35.4
𝜏²-Bench Telecom
Reasoning Knowledge
29.8
Artificial Analysis Intelligence Index
Knowledge
16.5
Artificial Analysis Coding Index
Coding
13.8
HLE
Knowledge Multi-Modal
8.3
Terminal-Bench Hard
Agent Coding
3
LCR
Long-Context Reasoning
0