Dimension Drop
Severity 10/10
2026-W12
GPT-4o Grounding (v5)下跌 21.9 分
Score Comparison
| Dimension | Previous | Current | Change |
|---|---|---|---|
| Overall (v5) | 41.2 | 39.2 | -2 |
| Code Execution (v5) | 19.6 | 48.8 | +29.2 |
| Knowledge Synthesis (v5) | 35.4 | 33.4 | -2 |
| Grounding (v5) | 62.3 | 40.4 | -21.9 |
| Value | 18.6 | 19.4 | +0.8 |
| Stability | 52.8 | 32.2 | -20.6 |
| Availability | 100.0 | 65.0 | -35 |
Affected Dimensions
Grounding (v5)
Top Lost Tasks 5
#1
Root Cause Analysis and Evidence Boundaries
Grounding (v5)
66.7
0
-66.7
Model Raw Response (excerpt)
[API ERROR] Rate limit reached for gpt-4o in organization org-5kL87cAHHWwzzzRXfZoA5jZm on tokens per min (TPM): Limit 30000, Used 29612, Requested 800. Please try again in 824ms. Visit https://platform.openai.com/account/rate-limits to learn more.
#2
Breaking Changes List
Grounding (v5)
66.7
0
-66.7
Strict
Model Raw Response (excerpt)
[API ERROR] Rate limit reached for gpt-4o in organization org-5kL87cAHHWwzzzRXfZoA5jZm on tokens per min (TPM): Limit 30000, Used 29648, Requested 675. Please try again in 646ms. Visit https://platform.openai.com/account/rate-limits to learn more.
#3
Cost Variation Calculation
Grounding (v5)
66.7
0
-66.7
Strict
Model Raw Response (excerpt)
[API ERROR] Rate limit reached for gpt-4o in organization org-5kL87cAHHWwzzzRXfZoA5jZm on tokens per min (TPM): Limit 30000, Used 29325, Requested 695. Please try again in 40ms. Visit https://platform.openai.com/account/rate-limits to learn more.
#4
Sustainability of High-Quality Growth
Grounding (v5)
66.7
0
-66.7
Model Raw Response (excerpt)
[API ERROR] Rate limit reached for gpt-4o in organization org-5kL87cAHHWwzzzRXfZoA5jZm on tokens per min (TPM): Limit 30000, Used 30000, Requested 561. Please try again in 1.122s. Visit https://platform.openai.com/account/rate-limits to learn more.
#5
Priority Board Meeting Topics
Grounding (v5)
66.7
0
-66.7
Strict
Model Raw Response (excerpt)
[API ERROR] Rate limit reached for gpt-4o in organization org-5kL87cAHHWwzzzRXfZoA5jZm on tokens per min (TPM): Limit 30000, Used 29829, Requested 554. Please try again in 766ms. Visit https://platform.openai.com/account/rate-limits to learn more.
Run #37 · Formula v5 · Judge v6 · Benchmark v5.1 · 2026-03-22 14:26 SGT
View GPT-4o Full Profile