Baidu

ERNIE 4.5 300B A47B

Unknown Size

By Baidu • Released 2025-06-30

Capability Radar

Avg Score
35

Across all benchmarks

Participated
13
Benchmarks

Benchmark Performance

Benchmark Category Score
MATH-500
Reasoning
93.1
GPQA Diamond
Knowledge
81.1
MMLU-Pro
Knowledge
77.6
LiveCodeBench
Coding
46.7
AIME 2025
Reasoning
41.3
IFBench
Agent
39.1
SciCode
Reasoning Knowledge
31.5
Artificial Analysis Intelligence Index
Knowledge
14.9
Artificial Analysis Coding Index
Coding
14.5
Terminal-Bench Hard
Agent Coding
6.1
HLE
Knowledge Multi-Modal
3.5
LCR
Long-Context Reasoning
2.3
𝜏²-Bench Telecom
Reasoning Knowledge
0