Allen Institute for AI

Olmo 3.1 32B Instruct

Unknown Size

By Allen Institute for AI • Released 2026-01-13

Capability Radar

Avg Score
17

Across all benchmarks

Participated
9
Benchmarks

Benchmark Performance

Benchmark Category Score
GPQA Diamond
Knowledge
53.9
IFBench
Agent
39.2
𝜏²-Bench Telecom
Reasoning Knowledge
21.3
SciCode
Reasoning Knowledge
16.7
Artificial Analysis Intelligence Index
Knowledge
12
Artificial Analysis Coding Index
Coding
5.6
HLE
Knowledge Multi-Modal
4.9
LCR
Long-Context Reasoning
0
Terminal-Bench Hard
Agent Coding
0