Alibaba

Qwen3 235B A22B (Non-reasoning)

Unknown Size

By Alibaba • Released 2025-04-28

Capability Radar

Avg Score
32

Across all benchmarks

Participated
13
Benchmarks

Benchmark Performance

Benchmark Category Score
MATH-500
Reasoning
90.2
MMLU-Pro
Knowledge
76.2
GPQA Diamond
Knowledge
61.3
IFBench
Agent
36.6
LiveCodeBench
Coding
34.3
SciCode
Reasoning Knowledge
29.9
𝜏²-Bench Telecom
Reasoning Knowledge
27.2
AIME 2025
Reasoning
23.7
Artificial Analysis Intelligence Index
Knowledge
16.9
Artificial Analysis Coding Index
Coding
14
Terminal-Bench Hard
Agent Coding
6.1
HLE
Knowledge Multi-Modal
4.7
LCR
Long-Context Reasoning
0