Alibaba

Qwen3 Next 80B A3B Instruct

Unknown Size

By Alibaba • Released 2025-09-11

Capability Radar

Avg Score
40

Across all benchmarks

Participated
12
Benchmarks

Benchmark Performance

Benchmark Category Score
MMLU-Pro
Knowledge
81.9
GPQA Diamond
Knowledge
73.8
LiveCodeBench
Coding
68.4
AIME 2025
Reasoning
66.3
LCR
Long-Context Reasoning
51.3
IFBench
Agent
39.7
SciCode
Reasoning Knowledge
30.7
𝜏²-Bench Telecom
Reasoning Knowledge
21.6
Artificial Analysis Intelligence Index
Knowledge
20.1
Artificial Analysis Coding Index
Coding
15.3
Terminal-Bench Hard
Agent Coding
7.6
HLE
Knowledge Multi-Modal
7.3