Skip to main content

Qwen3 Max

Change Analysis · 2026 Week26

Qwen3 Max 2026 Week26 Code Execution (v5) dimension dropped 6.6 pts

Score Comparison

80.2 77.6 -2.6
Dimension Previous Current Change
Code Execution (v5) 91.2 84.6 -6.6
Knowledge Synthesis (v5) 77.7 79 +1.3
Grounding (v5) 94.5 90.7 -3.8
Value 57.7 56.2 -1.5
Stability 51 46.9 -4.1
Availability 100 100 0

All matched tasks had no score changes, or no tasks could be matched to the previous evaluation.

Back to Movers