Grok 3
Change Analysis · 2026 Week16
Grok 3 2026 Week16 Knowledge Synthesis (v5) dimension rose 3.2 pts
Score Comparison
65.6
62.7
-2.9
| Dimension | Previous | Current | Change |
|---|---|---|---|
| Code Execution (v5) | 91.2 | 78.6 | -12.6 |
| Knowledge Synthesis (v5) | 51.6 | 54.8 | +3.2 |
| Grounding (v5) | 83.8 | 83.4 | -0.4 |
| Value | 25.1 | 23.6 | -1.5 |
| Stability | 35.9 | 35 | -0.9 |
| Availability | 100 | 98 | -2 |
All matched tasks had no score changes, or no tasks could be matched to the previous evaluation.
Back to Movers