DeepSeek

DeepSeek V3.2 (Non-reasoning)

Unknown Size

By DeepSeek • Released 2025-12-01

Capability Radar

Avg Score
52

Across all benchmarks

Participated
13
Benchmarks

Benchmark Performance

Benchmark Category Score
MMLU-Pro
Knowledge
83.7
τ-bench
Agent Knowledge
80.4
𝜏²-Bench Telecom
Reasoning Knowledge
78.9
GPQA Diamond
Knowledge
75.1
LiveCodeBench
Coding
59.3
AIME 2025
Reasoning
59
IFBench
Agent
49
LCR
Long-Context Reasoning
39
SciCode
Reasoning Knowledge
38.7
Artificial Analysis Coding Index
Coding
34.6
Terminal-Bench Hard
Agent Coding
32.6
Artificial Analysis Intelligence Index
Knowledge
32.1
HLE
Knowledge Multi-Modal
10.5