综合下跌
严重度 10/10
2026-W14
GPT-4o 代码执行 (v5)下跌 10.5 分
分数对比
| 维度 | 上期 | 本期 | 变化 |
|---|---|---|---|
| 主榜 (v5) | 81.1 | 49.3 | -31.8 |
| 代码执行 (v5) | 78.0 | 62.8 | -15.2 |
| 知识综合 (v5) | 79.0 | 47.2 | -31.8 |
| 材料约束 (v5) | 80.1 | 49.1 | -31 |
| 性价比 | 79.0 | 24.9 | -54.1 |
| 稳定性 | 80.0 | 27.8 | -52.2 |
| 可用性 | 100.0 | 79.0 | -21 |
受影响维度
代码执行 (v5) -23.7
可用性 -15.9
材料约束 (v5) -14.6
性价比 -6.1
稳定性 -2.8
丢分题目 Top 5
#1
SQL窗口函数
execution
100
0
-100
严格
模型原始回复(截取)
```sql
SELECT department, name, salary
FROM (
SELECT department, name, salary,
ROW_NUMBER() OVER (PARTITION BY department ORDER BY salary DESC, joined_at ASC) as rank
FROM employees
) ranked_employees
WHERE rank = 1;
```
#2
深度配置合并
execution
100
0
-100
严格
模型原始回复(截取)
[API ERROR] Rate limit reached for gpt-4o in organization org-5kL87cAHHWwzzzRXfZoA5jZm on tokens per min (TPM): Limit 30000, Used 30000, Requested 153. Please try again in 306ms. Visit https://platform.openai.com/account/rate-limits to learn more.
#3
语义化版本比较
execution
100
0
-100
严格
模型原始回复(截取)
[API ERROR] Rate limit reached for gpt-4o in organization org-5kL87cAHHWwzzzRXfZoA5jZm on tokens per min (TPM): Limit 30000, Used 30000, Requested 135. Please try again in 270ms. Visit https://platform.openai.com/account/rate-limits to learn more.
#4
Debug:Webhook 幂等处理
execution
100
0
-100
严格
模型原始回复(截取)
[API ERROR] Rate limit reached for gpt-4o in organization org-5kL87cAHHWwzzzRXfZoA5jZm on tokens per min (TPM): Limit 30000, Used 30000, Requested 206. Please try again in 412ms. Visit https://platform.openai.com/account/rate-limits to learn more.
#5
SQL:JOIN 导致金额重复
execution
100
0
-100
严格
模型原始回复(截取)
[API ERROR] Rate limit reached for gpt-4o in organization org-5kL87cAHHWwzzzRXfZoA5jZm on tokens per min (TPM): Limit 30000, Used 29894, Requested 127. Please try again in 42ms. Visit https://platform.openai.com/account/rate-limits to learn more.
Run #52 · 公式 v7 · 判分 v6 · 题库 v6 · 2026-03-30 04:16 SGT
查看 GPT-4o 完整档案