Anthropic Releases Claude's Constitution Audiobook on May 11, 2026, Sparking Controversy Over Transparency and Sonnet 4.5 Retirement

Anthropic released the audiobook version of Claude's Constitution on May 11, 2026, aiming to enhance AI safety and transparency, but faced backlash over the sudden retirement of Sonnet 4.5, accused of violating constitutional welfare principles. Winzheng.com provides a technical analysis, comparing it with peers, and offers an YZ Index v6 evaluation along with practical advice for developers and enterprises.

AI Safety Anthropic Claude模型
807

AI Big Models in Turmoil! Wenxin Yiyan Soars 24.7 Points but Integrity Collapses, Gemini Drops 16 Points in Three Consecutive Declines

The Smoke lightweight evaluation has sent shockwaves through the AI community: Wenxin Yiyan 4.5 saw its main leaderboard score soar by 24.7 points, yet its integrity rating fell from pass to fail; meanwhile, the Gemini series suffered three consecutive declines, and DeepSeek V4 Pro plummeted by 16.1 points on the main leaderboard.

GPT-5.5 ERNIE Bot Code Execution
353

2026 Mainstream AI Benchmark Horizontal Comparison: YZ Index vs SuperCLUE vs OpenCompass vs C-Eval

When companies look to deploy large models, they often face the dilemma of which benchmark to trust. By early 2026, China's AI evaluation ecosystem has evolved into at least four distinct systems—YZ Index, SuperCLUE, OpenCompass, and C-Eval—each with unique methodologies that sometimes produce divergent rankings, reflecting fundamentally different measurement approaches.

AI Evaluation YZ Index SuperCLUE
1,555