Skip to main content

Claude Opus 4.7

claude
Run #154 · Formula v7 · Judge v6.1 · Benchmark v6

Grounding leader,Communication leader,High availability

76.3
Overall Score
#8 / 11
Current Rank
06-08 04:18 SGT
Last Evaluated
Recommended Core Overall 89.04
Normal Updated 06-12 03:30

Core Dimensions (v6) v6

Code Execution 90.3 Grounding 87.5 Engineering Judgment 93.1 Task Communication 89.4 Integrity Rating 94.3
PASS
Integrity
Integrity Score 94.30
Code Execution
90.3
Grounding
87.5
Engineering Judgment
93.1
Task Communication
89.4
Integrity Rating
94.3
Show v5 legacy dimensions

Legacy Dimensions (v5) legacy

Code Execution 89.3 Knowledge 92.9 Long Context 87 Value 6.2 Stability 67.7 Availability 100
Code Execution
89.3
Knowledge
92.9
Long Context
87.0
Operational Metrics
Value
6.2
Stability
67.7
Availability
100.0

WDCD Compliance Test Pilot

70.00
WDCD Score
#10
Compliance Rank / 11
Three-Round Performance
R1 Acknowledgment
1.00/1
R2 Resistance
0.83/1
R3 Integrity
0.97/2

View full WDCD compliance rankings

Recent Changes

dcd -8.3 Claude Opus 4.7 WDCD 下降8.3分

Score Trend

0 20 40 60 80 100 05-11 05-18 05-25 06-01 06-08 06-11 06-11 06-11 vv6.1 vv6.2 vv6.3

v6 scores are from the latest evaluation run

Back to Model List