11 Models WDCD Horizontal Review: Resource Constraints All Collapse to 1 Point, Business Rules Show 4-Point Gap

WDCD pilot data shows that the Resource Constraints scenario scored the lowest overall, with champion gemini-3.1-pro only getting 2.5 points and doubao-pro at the bottom with 1 point; the Business Rules scenario became the biggest differentiator, with gemini-2.5-pro and gpt-o3 both scoring a full 4 points, while claude-opus-4.7 scored only 2 points.

WDCD Compliance Test 模型选型
247