GPT-5.5 Tops WDCD with 89.17 Points, GPT-o3 Trails at 70.83 Points in Collapse
The first WDCD Compliance Test results are out: GPT-5.5 leads with 89.17 points, while GPT-o3 scores only 70.83 points at the bottom—a gap of over 18 points that directly dispels the myth that "older models are more stable."