Alibaba

Qwen3 32B (Non-reasoning)

Unknown Size

By Alibaba • Released 2025-04-28

Capability Radar

Avg Score
34

Across all benchmarks

Participated
10
Benchmarks

Benchmark Performance

Benchmark Category Score
MATH-500
Reasoning
86.9
MMLU-Pro
Knowledge
72.7
GPQA Diamond
Knowledge
53.5
IFBench
Agent
31.5
LiveCodeBench
Coding
28.8
SciCode
Reasoning Knowledge
28
AIME 2025
Reasoning
19.7
Artificial Analysis Intelligence Index
Knowledge
14.5
HLE
Knowledge Multi-Modal
4.3
LCR
Long-Context Reasoning
0