WDCD Warning: When Models Treat Hard Constraints as Suggestions, Risk Begins
WDCD Run #105 data reveals a troubling reality: large language models commonly fail to treat hard constraints as hard constraints. In one scenario, 8 out of 11 models generated discount plans below the stated "must be ≥ 30% off" threshold, treating "must" as "recommended."