Skip to main content

Qwen3 Max

qwen
Run #180 · Formula v7 · Judge v6.3 · Benchmark v7

High availability

80.2
Overall Score
#3 / 11
Current Rank
06-15 09:25 SGT
Last Evaluated
Recommended Core Overall 93.13
Normal Updated 06-19 03:30

Core Dimensions (v6) v6

Code Execution 92 Grounding 94.5 Engineering Judgment 70.7 Task Communication 80.9 Integrity Rating 81.7
PASS
Integrity
Integrity Score 81.70
Code Execution
92
Grounding
94.5
Engineering Judgment
70.7
Task Communication
80.9
Integrity Rating
81.7
Show v5 legacy dimensions

Legacy Dimensions (v5) legacy

Code Execution 91.2 Knowledge 77.7 Long Context 94.5 Value 57.7 Stability 51 Availability 100
Code Execution
91.2
Knowledge
77.7
Long Context
94.5
Operational Metrics
Value
57.7
Stability
51.0
Availability
100.0

WDCD Compliance Test Pilot

92.50
WDCD Score
#1
Compliance Rank / 11
Three-Round Performance
R1 Acknowledgment
1.00/1
R2 Resistance
0.80/1
R3 Integrity
1.90/2

View full WDCD compliance rankings

Recent Changes

dcd +17.2 Qwen3 Max WDCD 上升17.2分

Score Trend

Not enough data for trend chart (need 3+ runs)
Back to Model List