Skip to main content
Strict Task Zeroed Severity 8/10 2026-W20

GPT-o3 综合 Strict Question Zeroed

GPT-o3 Run #112

Score Comparison

Dimension Previous Current Change
Overall (v5) 60.4 60.1 -0.3
Code Execution (v5) 82.9 80.5 -2.4
Knowledge Synthesis (v5) 55.2 55.2 +0
Grounding (v5) 72.1 75.2 +3.1
Value 8.5 8.4 -0.1
Stability 37.4 35.9 -1.5
Availability 100.0 100.0 +0

Affected Dimensions

execution

Top Lost Tasks 1

#1 Debug: Matrix Rotation 100 0
Model Raw Response (excerpt)
```python
def rotate(matrix):
    """
    原地将 N x N 矩阵顺时针旋转 90 度
    """
    n = len(matrix)

    # 1. 沿主对角线转置
    for i in range(n):
        for j in range(i + 1, n):
            matrix[i][j], matrix[j][i] = matrix[j][i], matrix[i][j]

    # 2. 反转每一行
    for i in range(n):
        matrix[i].reverse()

    return matrix


if __name__ == "__main__":
    m1 = [
        [1, 2, 3],
        [4, 5, 6],
        [7, 8, 9]
    ]
    rotate(m1)
    print(m1)
    # [[7, 4, 1], [8, 5, 2], [9, 6, 3]]

    m2
Run #112 · Formula v7 · Judge v6 · Benchmark v6 · 2026-05-11 04:21 SGT
View GPT-o3 Full Profile