Skip to main content

文心一言 4.0

Change Analysis · 2026-03-24-Same-Day Compare

文心一言 4.0 2026-03-24-Same-Day CompareCode Execution (v5) dimension rose 9.8 pts

Score Comparison

71.0 75.3 +4.3
Dimension Previous Current Change
Code Execution (v5) 84 93.8 +9.8
Knowledge Synthesis (v5) 41.8 46.3 +4.5
Grounding (v5) 77.9 82.4 +4.5
Value 98.6 99.1 +0.5
Stability 31.2 30.6 -0.6
Availability 100 100 0

All matched tasks had no score changes, or no tasks could be matched to the previous evaluation.

Back to Movers