Kimi

Kimi K2.5 (Non-reasoning)

Unknown Size

By Kimi • Released 2026-01-27

Capability Radar

Avg Score
44

Across all benchmarks

Participated
9
Benchmarks

Benchmark Performance

Benchmark Category Score
𝜏²-Bench Telecom
Reasoning Knowledge
81.3
GPQA Diamond
Knowledge
78.9
LCR
Long-Context Reasoning
59
IFBench
Agent
43.7
SciCode
Reasoning Knowledge
39.6
Artificial Analysis Intelligence Index
Knowledge
37.2
Artificial Analysis Coding Index
Coding
25.8
Terminal-Bench Hard
Agent Coding
18.9
HLE
Knowledge Multi-Modal
12.3