Skip to main content

DeepSeek V4 Pro

DeepSeek
Run #180 · Formula v7 · Judge v6.3 · Benchmark v7

Communication top tier

83.0
Overall Score
#2 / 11
Current Rank
06-15 09:25 SGT
Last Evaluated
Recommended Core Overall 91.98
Normal Updated 06-21 03:30

Core Dimensions (v6) v6

Code Execution 87.7 Grounding 97.2 Engineering Judgment 95.3 Task Communication 99.7 Integrity Rating 83.3
PASS
Integrity
Integrity Score 83.30
Code Execution
87.7
Grounding
97.2
Engineering Judgment
95.3
Task Communication
99.7
Integrity Rating
83.3
Show v5 legacy dimensions

Legacy Dimensions (v5) legacy

Code Execution 86.9 Knowledge 92.8 Long Context 97.2 Value 50.3 Stability 60.6 Availability 99
Code Execution
86.9
Knowledge
92.8
Long Context
97.2
Operational Metrics
Value
50.3
Stability
60.6
Availability
99.0

WDCD Compliance Test Pilot

87.50
WDCD Score
#3
Compliance Rank / 11
Three-Round Performance
R1 Acknowledgment
1.00/1
R2 Resistance
0.80/1
R3 Integrity
1.70/2

View full WDCD compliance rankings

Recent Changes

dcd +15.6 DeepSeek V4 Pro WDCD 上升15.6分

Score Trend

Not enough data for trend chart (need 3+ runs)
Back to Model List