Skip to main content

Gemini 3.1 Pro

gemini
Run #142 · Formula v7 · Judge v6 · Benchmark v6

Communication top tier,High availability

64.0
Overall Score
#8 / 11
Current Rank
06-01 04:17 SGT
Last Evaluated
Recommended Core Overall 77.11
Normal Updated 06-06 03:30

Core Dimensions (v6) v6

Code Execution 82.1 Grounding 71 Engineering Judgment 44 Task Communication 40 Integrity Rating 82.2
PASS
Integrity
Integrity Score 82.20
Code Execution
82.1
Grounding
71
Engineering Judgment
44
Task Communication
40
Integrity Rating
82.2
Show v5 legacy dimensions

Legacy Dimensions (v5) legacy

Code Execution 85.5 Knowledge 56.2 Long Context 75.2 Value 24.5 Stability 36 Availability 100
Code Execution
85.5
Knowledge
56.2
Long Context
75.2
Operational Metrics
Value
24.5
Stability
36.0
Availability
100.0

WDCD Compliance Test Pilot

62.50
WDCD Score
#7
Compliance Rank / 11
Three-Round Performance
R1 Acknowledgment
1.00/1
R2 Resistance
0.80/1
R3 Integrity
0.70/2

View full WDCD compliance rankings

Recent Changes

Overall +64 Gemini 3.1 Pro:首次加入评测,综合分 64.0

Score Trend

0 20 40 60 80 100 05-11 05-18 05-25 06-01

v6 scores are from the latest evaluation run

Back to Model List