Skip to main content

Gemini 2.5 Pro

gemini
Run #154 · Formula v7 · Judge v6.1 · Benchmark v6

Above average

79.5
Overall Score
#3 / 11
Current Rank
06-08 04:18 SGT
Last Evaluated
Recommended Core Overall 86.35
Normal Updated 06-12 03:30

Core Dimensions (v6) v6

Code Execution 88.1 Grounding 84.2 Engineering Judgment 87.7 Task Communication 84.6 Integrity Rating 88.8
PASS
Integrity
Integrity Score 88.80
Code Execution
88.1
Grounding
84.2
Engineering Judgment
87.7
Task Communication
84.6
Integrity Rating
88.8
Show v5 legacy dimensions

Legacy Dimensions (v5) legacy

Code Execution 86.8 Knowledge 88.4 Long Context 83.6 Value 44.6 Stability 66 Availability 99
Code Execution
86.8
Knowledge
88.4
Long Context
83.6
Operational Metrics
Value
44.6
Stability
66.0
Availability
99.0

WDCD Compliance Test Pilot

73.33
WDCD Score
#9
Compliance Rank / 11
Three-Round Performance
R1 Acknowledgment
1.00/1
R2 Resistance
0.70/1
R3 Integrity
1.23/2

View full WDCD compliance rankings

Recent Changes

dcd -11.7 Gemini 2.5 Pro WDCD 下降11.7分

Score Trend

0 20 40 60 80 100 03-17 03-17 03-19 03-21 03-24 03-24 04-06 04-27 05-18 06-08 06-11 vv3 vv4 vv5 vv6 vv6.1 vv6.2 vv6.3

v6 scores are from the latest evaluation run

Back to Model List