11 Models Tested on Bracket Matching: 7 Full Scores, 4 Zero Scores
In a bracket-matching debugging test, 7 out of 11 mainstream models achieved full scores while 4 scored zero, with the critical bug identified as a bare "return" returning None instead of a boolean value.