Qwen Max Stability Plummets by 22.8 Points: Model Update Triggers Output Quality Volatility
Qwen Max exhibits extreme duality in this week's evaluation, with significant improvements in programming and long-context tasks, but a catastrophic decline in stability metrics. This "fire and ice" performance warrants in-depth analysis.