Claude 3.5 Sonnet Breaks 90% in Coding Tests: AI Programming Ability Approaches Human Level
Anthropic's Claude 3.5 Sonnet model achieved 92.0% on the SWE-bench software engineering benchmark, surpassing all previous AI models and marking a new milestone in AI coding capabilities. This breakthrough sparked heated discussions on X platform with over 150,000 interactions, as developers shared real projects built with Claude and debated the future role of AI programmers.