Claude Opus 4.7 and GPT-5.5 Tie for First on Smoke Leaderboard; Material Constraint Becomes the Biggest Differentiator

In today's lightweight evaluation by Smoke, Claude Opus 4.7 and GPT-5.5 tied for first on the main leaderboard with 92.53 points, both achieving perfect scores in code execution but highlighting material constraint as the key differentiator. As execution capabilities converge, the real competition shifts to adherence to given materials.

Claude Opus 4.7 GPT-5.5 Material Constraints
240