Alibaba

Qwen3 VL 32B (Reasoning)

Unknown Size

By Alibaba • Released 2025-10-21

Capability Radar

Avg Score
47

Across all benchmarks

Participated
12
Benchmarks

Benchmark Performance

Benchmark Category Score
AIME 2025
Reasoning
84.7
MMLU-Pro
Knowledge
81.8
LiveCodeBench
Coding
73.8
GPQA Diamond
Knowledge
73.3
IFBench
Agent
59.4
LCR
Long-Context Reasoning
55.3
𝜏²-Bench Telecom
Reasoning Knowledge
45.6
SciCode
Reasoning Knowledge
28.5
Artificial Analysis Intelligence Index
Knowledge
24.5
Artificial Analysis Coding Index
Coding
14.5
HLE
Knowledge Multi-Modal
9.6
Terminal-Bench Hard
Agent Coding
7.6