Grok 4 Tops with 97.44 Points, GPT-o3 Plunges 28 Points on Main Leaderboard
In Smoke's latest 10-question quick test, execution weaknesses of AI models were laid bare. Grok 4 reached the top with 97.44 points, while GPT-o3's main leaderboard score dropped 28.1 points from 94.53 to 66.43.