Dimension Drop
Severity 10/10
2026-W12
文心一言 4.0 Stability下跌 22.1 分
Score Comparison
| Dimension | Previous | Current | Change |
|---|---|---|---|
| Overall (v5) | 49.5 | 64.2 | +14.7 |
| Code Execution (v5) | 20.2 | 61.6 | +41.4 |
| Knowledge Synthesis (v5) | 28.2 | 38.0 | +9.8 |
| Grounding (v5) | 62.3 | 78.1 | +15.8 |
| Value | 86.6 | 97.1 | +10.5 |
| Stability | 52.1 | 30.0 | -22.1 |
| Availability | 99.0 | 100.0 | +1 |
Affected Dimensions
Stability
Run #37 · Formula v5 · Judge v6 · Benchmark v5.1 · 2026-03-22 14:26 SGT
View 文心一言 4.0 Full Profile