行业趋势 (1 articles)

Claude Sonnet 4.6 Rises to the Top! 8 AI Models See 25-Point Plunge in Code Execution, Industry Shakeup Uncovered

In the Smoke Lite evaluation on May 14, 2026, the key finding is shocking: Claude Sonnet 4.6 surged to the top with a main score of 84.68, but the code execution dimension of 8 mainstream AI models collectively dropped by 25 points, causing a drastic reshuffle in overall rankings. This is no coincidence—it’s a hidden crisis signal of rapid iteration in the AI industry.