MBZUAI Institute of Foundation Models

K2 Think V2

Unknown Size

By MBZUAI Institute of Foundation Models • Released 2025-12-15

Capability Radar

Avg Score
34

Across all benchmarks

Participated
9
Benchmarks

Benchmark Performance

Benchmark Category Score
GPQA Diamond
Knowledge
71.3
IFBench
Agent
62.8
LCR
Long-Context Reasoning
52.7
SciCode
Reasoning Knowledge
33
𝜏²-Bench Telecom
Reasoning Knowledge
25.4
Artificial Analysis Intelligence Index
Knowledge
24.5
Artificial Analysis Coding Index
Coding
15.5
HLE
Knowledge Multi-Modal
9.5
Terminal-Bench Hard
Agent Coding
6.8