Skip to main content

Gemini 2.5 Pro

gemini
Run #87 · Formula v7 · Judge v6 · Benchmark v6

Judgment leader,Communication top tier,High availability

69.7
Overall Score
#5 / 11
Current Rank
04-27 04:18 SGT
Last Evaluated
Recommended Core Overall 84.32
Normal Updated 04-04 03:30

Core Dimensions (v6) v6

Code Execution 89.4 Grounding 78.1 Engineering Judgment 47.2 Task Communication 40 Integrity Rating 80.8
PASS
Integrity
Integrity Score 80.80
Code Execution
89.4
Grounding
78.1
Engineering Judgment
47.2
Task Communication
40
Integrity Rating
80.8
Show v5 legacy dimensions

Legacy Dimensions (v5) legacy

Code Execution 96.1 Knowledge 53.8 Long Context 83.4 Value 39.3 Stability 37.7 Availability 100
Code Execution
96.1
Knowledge
53.8
Long Context
83.4
Operational Metrics
Value
39.3
Stability
37.7
Availability
100.0

Recent Changes

communication_raw +10 Gemini 2.5 Pro:任务表达 +10

Score Trend

0 20 40 60 80 100 03-17 03-17 03-17 03-19 03-21 03-22 03-24 03-24 03-25 04-06 04-20 04-27 vv3 vv4 vv5 vv6

v6 scores are from the latest evaluation run

Back to Model List