文心一言4.5 Integrity Rating Fail: Code Execution Surges 42.5 Points but Side Metrics Collapse
In the latest Smoke quick test, 文心一言4.5 posted a deeply split report: the main score edged up, but its integrity rating dropped directly from pass to fail. This change is not an isolated incident but a concentrated manifestation of severe multidimensional volatility.