DeepSeek

DeepSeek V3 0324

Unknown Size

By DeepSeek • Released 2025-03-25

Capability Radar

Avg Score
42

Across all benchmarks

Participated
13
Benchmarks

Benchmark Performance

Benchmark Category Score
MATH-500
Reasoning
94.2
MMLU-Pro
Knowledge
81.9
GPQA Diamond
Knowledge
65.5
𝜏²-Bench Telecom
Reasoning Knowledge
47.1
AIME 2025
Reasoning
41
IFBench
Agent
41
LCR
Long-Context Reasoning
41
LiveCodeBench
Coding
40.5
SciCode
Reasoning Knowledge
35.8
Artificial Analysis Coding Index
Coding
22
Artificial Analysis Intelligence Index
Knowledge
21.8
Terminal-Bench Hard
Agent Coding
15.2
HLE
Knowledge Multi-Modal
5.2