Skip to main content

Qwen3 Max

Change Analysis · 2026 Week27

Qwen3 Max 2026 Week27 Code Execution (v5) dimension dropped 18.7 pts

Score Comparison

77.6 71.6 -6
Dimension Previous Current Change
Code Execution (v5) 84.6 65.9 -18.7
Knowledge Synthesis (v5) 79 77.4 -1.6
Grounding (v5) 90.7 92.9 +2.2
Value 56.2 53.9 -2.3
Stability 46.9 37.1 -9.8
Availability 100 100 0

All matched tasks had no score changes, or no tasks could be matched to the previous evaluation.

Back to Movers