Skip to main content

Claude Sonnet 4.6

Change Analysis · 2026 Week26

Claude Sonnet 4.6 2026 Week26 Code Execution (v5) dimension dropped 16.5 pts

Score Comparison

79.7 74.0 -5.7
Dimension Previous Current Change
Code Execution (v5) 87.7 71.2 -16.5
Knowledge Synthesis (v5) 93.4 95.3 +1.9
Grounding (v5) 94.5 92.9 -1.6
Value 29.7 28 -1.7
Stability 58 42 -16
Availability 100 100 0

All matched tasks had no score changes, or no tasks could be matched to the previous evaluation.

Back to Movers