Alibaba

Qwen3 Next 80B A3B (Reasoning)

Unknown Size

By Alibaba • Released 2025-09-11

Capability Radar

Avg Score
49

Across all benchmarks

Participated
12
Benchmarks

Benchmark Performance

Benchmark Category Score
AIME 2025
Reasoning
84.3
MMLU-Pro
Knowledge
82.4
LiveCodeBench
Coding
78.4
GPQA Diamond
Knowledge
75.9
IFBench
Agent
60.7
LCR
Long-Context Reasoning
60.3
𝜏²-Bench Telecom
Reasoning Knowledge
41.5
SciCode
Reasoning Knowledge
38.8
Artificial Analysis Intelligence Index
Knowledge
26.5
Artificial Analysis Coding Index
Coding
19.5
HLE
Knowledge Multi-Modal
11.7
Terminal-Bench Hard
Agent Coding
9.8