Winzheng Perspective: The More Useful the Model, the More It Needs Brakes
Data from WDCD Run #105 reveals a critical contradiction in the Agent era: as models become more capable, the consequences of their errors become more irreversible. The report uses extreme samples like Q239, Q223, and Q237 to quantify how even top models fail to respect constraints when acting as agents.