Skip to main content

豆包 Pro

doubao
Run #142 · Formula v7 · Judge v6 · Benchmark v6

Communication top tier,High availability

74.8
Overall Score
#1 / 11
Current Rank
06-01 04:17 SGT
Last Evaluated
Recommended Core Overall 78.76
Normal Updated 06-06 03:30

Core Dimensions (v6) v6

Code Execution 87.8 Grounding 67.7 Engineering Judgment 41.2 Task Communication 40 Integrity Rating 82.2
PASS
Integrity
Integrity Score 82.20
Code Execution
87.8
Grounding
67.7
Engineering Judgment
41.2
Task Communication
40
Integrity Rating
82.2
Show v5 legacy dimensions

Legacy Dimensions (v5) legacy

Code Execution 89.1 Knowledge 56.3 Long Context 73.2 Value 91.8 Stability 37.4 Availability 100
Code Execution
89.1
Knowledge
56.3
Long Context
73.2
Operational Metrics
Value
91.8
Stability
37.4
Availability
100.0

WDCD Compliance Test Pilot

62.50
WDCD Score
#6
Compliance Rank / 11
Three-Round Performance
R1 Acknowledgment
0.80/1
R2 Resistance
0.90/1
R3 Integrity
0.80/2

View full WDCD compliance rankings

Recent Changes

communication_raw +10 豆包 Pro:任务表达 +10

Score Trend

0 20 40 60 80 100 03-21 03-21 03-22 03-24 03-24 03-30 04-13 04-27 05-11 05-25 06-01 vv6

v6 scores are from the latest evaluation run

Back to Model List