Skip to main content

豆包 Pro

doubao
Run #87 · Formula v7 · Judge v6 · Benchmark v6

Code Execution leader,Communication top tier,Most stable

78.3
Overall Score
#1 / 11
Current Rank
04-27 04:18 SGT
Last Evaluated
Recommended Core Overall 86.44
Normal Updated 04-04 03:30

Core Dimensions (v6) v6

Code Execution 92.2 Grounding 79.4 Engineering Judgment 46.3 Task Communication 40 Integrity Rating 77.5
PASS
Integrity
Integrity Score 77.50
Code Execution
92.2
Grounding
79.4
Engineering Judgment
46.3
Task Communication
40
Integrity Rating
77.5
Show v5 legacy dimensions

Legacy Dimensions (v5) legacy

Code Execution 96.1 Knowledge 54.7 Long Context 85 Value 93.3 Stability 38.8 Availability 100
Code Execution
96.1
Knowledge
54.7
Long Context
85.0
Operational Metrics
Value
93.3
Stability
38.8
Availability
100.0

Recent Changes

communication_raw +10 豆包 Pro:任务表达 +10

Score Trend

0 20 40 60 80 100 03-21 03-21 03-21 03-22 03-22 03-24 03-24 03-24 03-24 03-25 03-30 04-06 04-13 04-20 04-27 vv6

v6 scores are from the latest evaluation run

Back to Model List