11 Models See Collective Plunge in Code Execution Scores, GPT-5.5 Leads Smoke Lightweight List with 95.24 Points
In the YZ Index Smoke lightweight evaluation for June 14, 2026, GPT-5.5 topped the main list with 95.24 points (Code Execution 96, Material Constraint 94.3 [pass]), achieving over 90 points in both dimensions for the most balanced high-score structure.