Alibaba

Qwen3 VL 235B A22B Instruct

Unknown Size

By Alibaba • Released 2025-09-23

Capability Radar

Avg Score
40

Across all benchmarks

Participated
12
Benchmarks

Benchmark Performance

Benchmark Category Score
MMLU-Pro
Knowledge
82.3
GPQA Diamond
Knowledge
71.2
AIME 2025
Reasoning
70.7
LiveCodeBench
Coding
59.4
IFBench
Agent
42.7
SciCode
Reasoning Knowledge
35.9
𝜏²-Bench Telecom
Reasoning Knowledge
35.1
LCR
Long-Context Reasoning
31.7
Artificial Analysis Intelligence Index
Knowledge
20.6
Artificial Analysis Coding Index
Coding
16.5
Terminal-Bench Hard
Agent Coding
6.8
HLE
Knowledge Multi-Modal
6.3