Skip to main content

豆包 Pro

doubao
Run #154 · Formula v7 · Judge v6.1 · Benchmark v6

Code Execution leader,Most stable,High availability

89.7
Overall Score
#1 / 11
Current Rank
06-08 04:18 SGT
Last Evaluated
Recommended Core Overall 88.75
Normal Updated 06-13 03:30

Core Dimensions (v6) v6

Code Execution 94.6 Grounding 81.6 Engineering Judgment 88.8 Task Communication 84.1 Integrity Rating 92.2
PASS
Integrity
Integrity Score 92.20
Code Execution
94.6
Grounding
81.6
Engineering Judgment
88.8
Task Communication
84.1
Integrity Rating
92.2
Show v5 legacy dimensions

Legacy Dimensions (v5) legacy

Code Execution 93.9 Knowledge 90.1 Long Context 80.9 Value 96.2 Stability 71.2 Availability 100
Code Execution
93.9
Knowledge
90.1
Long Context
80.9
Operational Metrics
Value
96.2
Stability
71.2
Availability
100.0

WDCD Compliance Test Pilot

75.00
WDCD Score
#8
Compliance Rank / 11
Three-Round Performance
R1 Acknowledgment
0.70/1
R2 Resistance
0.83/1
R3 Integrity
1.47/2

View full WDCD compliance rankings

Recent Changes

dcd -6.7 豆包 Pro WDCD 下降6.7分

Score Trend

0 20 40 60 80 100 03-21 03-21 03-22 03-24 03-24 03-30 04-13 04-27 05-11 05-25 06-08 06-11 06-13 vv6 vv6.1 vv6.2 vv6.3

v6 scores are from the latest evaluation run

Back to Model List