DeepSeek V3.2 (Non-reasoning)

Unknown Size

By DeepSeek • Released 2025-12-01

Capability Radar

Avg Score

52

Across all benchmarks

Participated

13

Benchmarks

Benchmark Performance

Benchmark	Category	Score
MMLU-Pro	Knowledge	83.7
τ-bench	Agent Knowledge	80.4
𝜏²-Bench Telecom	Reasoning Knowledge	78.9
GPQA Diamond	Knowledge	75.1
LiveCodeBench	Coding	59.3
AIME 2025	Reasoning	59
IFBench	Agent	49
LCR	Long-Context Reasoning	39
SciCode	Reasoning Knowledge	38.7
Artificial Analysis Coding Index	Coding	34.6
Terminal-Bench Hard	Agent Coding	32.6
Artificial Analysis Intelligence Index	Knowledge	32.1
HLE	Knowledge Multi-Modal	10.5