Gemini 2.5 Pro Plummets 22.6 Points on Mainboard, Engineering Judgment Halved
In today's Smoke evaluation, Gemini 2.5 Pro lost 22.6 points on the mainboard, with core execution dropping from 100 to 95 and material constraints slightly declining. The engineering judgment dimension collapsed from 66.7 to 30, and task expression fell from 50 to 10, signaling deeper issues beyond normal fluctuation.