Skip to main content

Claude Sonnet 4.6

Change Analysis · 2026 Week27

Claude Sonnet 4.6 2026 Week27 Knowledge Synthesis (v5) dimension dropped 8.5 pts

Score Comparison

74.0 72.9 -1.1
Dimension Previous Current Change
Code Execution (v5) 71.2 75.2 +4
Knowledge Synthesis (v5) 95.3 86.8 -8.5
Grounding (v5) 92.9 92.5 -0.4
Value 28 28.1 +0.1
Stability 42 42.7 +0.7
Availability 100 100 0

All matched tasks had no score changes, or no tasks could be matched to the previous evaluation.

Back to Movers