Strict Task Zeroed
Severity 8/10
2026-W20
GPT-o3 综合 Strict Question Zeroed
Score Comparison
| Dimension | Previous | Current | Change |
|---|---|---|---|
| Overall (v5) | 60.4 | 60.1 | -0.3 |
| Code Execution (v5) | 82.9 | 80.5 | -2.4 |
| Knowledge Synthesis (v5) | 55.2 | 55.2 | +0 |
| Grounding (v5) | 72.1 | 75.2 | +3.1 |
| Value | 8.5 | 8.4 | -0.1 |
| Stability | 37.4 | 35.9 | -1.5 |
| Availability | 100.0 | 100.0 | +0 |
Affected Dimensions
execution
Top Lost Tasks 1
#1
Debug: Matrix Rotation
100
0
Model Raw Response (excerpt)
```python
def rotate(matrix):
"""
原地将 N x N 矩阵顺时针旋转 90 度
"""
n = len(matrix)
# 1. 沿主对角线转置
for i in range(n):
for j in range(i + 1, n):
matrix[i][j], matrix[j][i] = matrix[j][i], matrix[i][j]
# 2. 反转每一行
for i in range(n):
matrix[i].reverse()
return matrix
if __name__ == "__main__":
m1 = [
[1, 2, 3],
[4, 5, 6],
[7, 8, 9]
]
rotate(m1)
print(m1)
# [[7, 4, 1], [8, 5, 2], [9, 6, 3]]
m2
Run #112 · Formula v7 · Judge v6 · Benchmark v6 · 2026-05-11 04:21 SGT
View GPT-o3 Full Profile