Grok 3
Change Analysis · 2026 Week20
Score Comparison
65.6
-
0
| Dimension | Previous | Current | Change |
|---|---|---|---|
| Code Execution (v5) | 91.2 | 0 | -91.2 |
| Knowledge Synthesis (v5) | 51.6 | 0 | -51.6 |
| Grounding (v5) | 83.8 | 0 | -83.8 |
| Value | 25.1 | 0 | -25.1 |
| Stability | 35.9 | 0 | -35.9 |
| Availability | 100 | 0 | -100 |
All matched tasks had no score changes, or no tasks could be matched to the previous evaluation.
Back to Movers