Skip to main content

Qwen Max

Change Analysis · 2026 Week20

Score Comparison

64.8 - 0
Dimension Previous Current Change
Code Execution (v5) 82.2 0 -82.2
Knowledge Synthesis (v5) 46.4 0 -46.4
Grounding (v5) 81.6 0 -81.6
Value 49 0 -49
Stability 30.6 0 -30.6
Availability 100 0 -100

All matched tasks had no score changes, or no tasks could be matched to the previous evaluation.

Back to Movers