Skip to main content

GPT-o3

Change Analysis · 2026 Week17

GPT-o3 2026 Week17 Stability dimension rose 5.4 pts

Score Comparison

55.0 51.1 -3.9
Dimension Previous Current Change
Code Execution (v5) 84.7 75.9 -8.8
Knowledge Synthesis (v5) 47.2 47.8 +0.6
Grounding (v5) 56.9 47.3 -9.6
Value 7.7 7 -0.7
Stability 29 34.4 +5.4
Availability 93.9 85.7 -8.2

All matched tasks had no score changes, or no tasks could be matched to the previous evaluation.

Back to Movers