Google

Gemini 2.5 Flash Preview (Sep '25) (Non-reasoning)

Unknown Size

By Google • Released 2025-09-25

Capability Radar

Avg Score
43

Across all benchmarks

Participated
12
Benchmarks

Benchmark Performance

Benchmark Category Score
MMLU-Pro
Knowledge
83.6
GPQA Diamond
Knowledge
76.6
LiveCodeBench
Coding
62.5
AIME 2025
Reasoning
56.7
LCR
Long-Context Reasoning
56.7
IFBench
Agent
43.5
SciCode
Reasoning Knowledge
37.5
𝜏²-Bench Telecom
Reasoning Knowledge
28.4
Artificial Analysis Intelligence Index
Knowledge
25.5
Artificial Analysis Coding Index
Coding
22.1
Terminal-Bench Hard
Agent Coding
14.4
HLE
Knowledge Multi-Modal
7.8