Alibaba

Qwen3 235B A22B (Reasoning)

Unknown Size

By Alibaba • Released 2025-04-28

Capability Radar

Avg Score
42

Across all benchmarks

Participated
13
Benchmarks

Benchmark Performance

Benchmark Category Score
MATH-500
Reasoning
93
MMLU-Pro
Knowledge
82.8
AIME 2025
Reasoning
82
GPQA Diamond
Knowledge
70
LiveCodeBench
Coding
62.2
SciCode
Reasoning Knowledge
39.9
IFBench
Agent
38.7
𝜏²-Bench Telecom
Reasoning Knowledge
24
Artificial Analysis Intelligence Index
Knowledge
19.8
Artificial Analysis Coding Index
Coding
17.4
HLE
Knowledge Multi-Modal
11.7
Terminal-Bench Hard
Agent Coding
6.1
LCR
Long-Context Reasoning
0