Alibaba

Qwen3 VL 30B A3B (Reasoning)

Unknown Size

By Alibaba • Released 2025-10-03

Capability Radar

Avg Score
40

Across all benchmarks

Participated
12
Benchmarks

Benchmark Performance

Benchmark Category Score
AIME 2025
Reasoning
82.3
MMLU-Pro
Knowledge
80.7
GPQA Diamond
Knowledge
72
LiveCodeBench
Coding
69.7
IFBench
Agent
45.1
LCR
Long-Context Reasoning
40.7
SciCode
Reasoning Knowledge
28.8
𝜏²-Bench Telecom
Reasoning Knowledge
19.9
Artificial Analysis Intelligence Index
Knowledge
19.6
Artificial Analysis Coding Index
Coding
13.1
HLE
Knowledge Multi-Modal
8.7
Terminal-Bench Hard
Agent Coding
5.3