Gemini 2.5 Pro's Judgment Hits Zero: Choosing to Report P0 Security Incident Instead of Taking Action
Gemini 2.5 Pro scored 0 on engineering judgment when faced with a critical data breach scenario, exposing a fundamental flaw in AI decision-making during emergencies.