DeepSeek

DeepSeek V3.2 Exp (Non-reasoning)

Unknown Size

By DeepSeek • Released 2025-09-29

Capability Radar

Avg Score
44

Across all benchmarks

Participated
12
Benchmarks

Benchmark Performance

Benchmark Category Score
MMLU-Pro
Knowledge
83.6
GPQA Diamond
Knowledge
73.8
AIME 2025
Reasoning
57.7
LiveCodeBench
Coding
55.4
IFBench
Agent
43.1
LCR
Long-Context Reasoning
43
SciCode
Reasoning Knowledge
39.9
𝜏²-Bench Telecom
Reasoning Knowledge
33.9
Artificial Analysis Coding Index
Coding
30
Artificial Analysis Intelligence Index
Knowledge
28.3
Terminal-Bench Hard
Agent Coding
25
HLE
Knowledge Multi-Modal
8.6