Skip to main content

文心一言 4.0

Change Analysis · 2026 Week12

文心一言 4.0 2026 Week12 Code Execution (v5) dimension rose 41.4 pts

Score Comparison

49.5 64.2 +14.7
Dimension Previous Current Change
Code Execution (v5) 20.2 61.6 +41.4
Knowledge Synthesis (v5) 28.2 38 +9.8
Grounding (v5) 62.3 78.1 +15.8
Value 86.6 97.1 +10.5
Stability 52.1 30 -22.1
Availability 99 100 +1

All matched tasks had no score changes, or no tasks could be matched to the previous evaluation.

Back to Movers