AI Compliance First Round Test: Qwen3-Max Wins, Who Collapses Easiest Under Pressure Among 11 Major Models?
The first round of WDCD testing by YZ Index reveals Qwen3-Max leading with 66.67 points, while many major models quickly collapse under stress. The average score is only 60.53, highlighting widespread compliance flaws in current AI systems.