Alibaba

Qwen3 VL 32B Instruct

Unknown Size

By Alibaba • Released 2025-10-21

Capability Radar

Avg Score
37

Across all benchmarks

Participated
12
Benchmarks

Benchmark Performance

Benchmark Category Score
MMLU-Pro
Knowledge
79.1
AIME 2025
Reasoning
68.3
GPQA Diamond
Knowledge
67.1
LiveCodeBench
Coding
51.4
IFBench
Agent
39.2
LCR
Long-Context Reasoning
31.3
SciCode
Reasoning Knowledge
30.1
𝜏²-Bench Telecom
Reasoning Knowledge
29.2
Artificial Analysis Intelligence Index
Knowledge
17.2
Artificial Analysis Coding Index
Coding
15.6
Terminal-Bench Hard
Agent Coding
8.3
HLE
Knowledge Multi-Modal
6.3