Alibaba

Qwen3 VL 30B A3B Instruct

Unknown Size

By Alibaba • Released 2025-10-03

Capability Radar

Avg Score
35

Across all benchmarks

Participated
12
Benchmarks

Benchmark Performance

Benchmark Category Score
MMLU-Pro
Knowledge
76.4
AIME 2025
Reasoning
72.3
GPQA Diamond
Knowledge
69.5
LiveCodeBench
Coding
47.6
IFBench
Agent
33.1
SciCode
Reasoning Knowledge
30.8
LCR
Long-Context Reasoning
23.7
𝜏²-Bench Telecom
Reasoning Knowledge
19
Artificial Analysis Intelligence Index
Knowledge
16
Artificial Analysis Coding Index
Coding
14.3
HLE
Knowledge Multi-Modal
6.4
Terminal-Bench Hard
Agent Coding
6.1