Skip to main content

GPT-4o

Change Analysis · 2026 Week15

GPT-4o 2026 Week15 Grounding (v5) dimension dropped 28.8 pts

Score Comparison

59.8 50.3 -9.5
Dimension Previous Current Change
Code Execution (v5) 86.5 75.1 -11.4
Knowledge Synthesis (v5) 45.8 46.5 +0.7
Grounding (v5) 63.7 34.9 -28.8
Value 31 24.5 -6.5
Stability 30.6 26.2 -4.4
Availability 94.9 84 -10.9

All matched tasks had no score changes, or no tasks could be matched to the previous evaluation.

Back to Movers