Skip to main content
Dimension Drop Severity 10/10 2026-W12

GPT-4o Grounding (v5)下跌 21.9 分

GPT-4o Run #37

Score Comparison

Dimension Previous Current Change
Overall (v5) 41.2 39.2 -2
Code Execution (v5) 19.6 48.8 +29.2
Knowledge Synthesis (v5) 35.4 33.4 -2
Grounding (v5) 62.3 40.4 -21.9
Value 18.6 19.4 +0.8
Stability 52.8 32.2 -20.6
Availability 100.0 65.0 -35

Affected Dimensions

Grounding (v5)

Top Lost Tasks 5

#1 Root Cause Analysis and Evidence Boundaries Grounding (v5) 66.7 0 -66.7
Model Raw Response (excerpt)
[API ERROR] Rate limit reached for gpt-4o in organization org-5kL87cAHHWwzzzRXfZoA5jZm on tokens per min (TPM): Limit 30000, Used 29612, Requested 800. Please try again in 824ms. Visit https://platform.openai.com/account/rate-limits to learn more.
#2 Breaking Changes List Grounding (v5) 66.7 0 -66.7 Strict
Model Raw Response (excerpt)
[API ERROR] Rate limit reached for gpt-4o in organization org-5kL87cAHHWwzzzRXfZoA5jZm on tokens per min (TPM): Limit 30000, Used 29648, Requested 675. Please try again in 646ms. Visit https://platform.openai.com/account/rate-limits to learn more.
#3 Cost Variation Calculation Grounding (v5) 66.7 0 -66.7 Strict
Model Raw Response (excerpt)
[API ERROR] Rate limit reached for gpt-4o in organization org-5kL87cAHHWwzzzRXfZoA5jZm on tokens per min (TPM): Limit 30000, Used 29325, Requested 695. Please try again in 40ms. Visit https://platform.openai.com/account/rate-limits to learn more.
#4 Sustainability of High-Quality Growth Grounding (v5) 66.7 0 -66.7
Model Raw Response (excerpt)
[API ERROR] Rate limit reached for gpt-4o in organization org-5kL87cAHHWwzzzRXfZoA5jZm on tokens per min (TPM): Limit 30000, Used 30000, Requested 561. Please try again in 1.122s. Visit https://platform.openai.com/account/rate-limits to learn more.
#5 Priority Board Meeting Topics Grounding (v5) 66.7 0 -66.7 Strict
Model Raw Response (excerpt)
[API ERROR] Rate limit reached for gpt-4o in organization org-5kL87cAHHWwzzzRXfZoA5jZm on tokens per min (TPM): Limit 30000, Used 29829, Requested 554. Please try again in 766ms. Visit https://platform.openai.com/account/rate-limits to learn more.
Run #37 · Formula v5 · Judge v6 · Benchmark v5.1 · 2026-03-22 14:26 SGT
View GPT-4o Full Profile