Review Grok 4 Material Constraints Plunge 21.3 Points, Code Execution Soars 50, Main Ranking Rises 17.9
In today's Smoke evaluation, Grok 4 showed a stark divergence: its material constraint score dropped from 80.30 to 59.00, a one-day plunge of 21.3 poi
May 23, 2026