Gemini 2.5 Pro Scores 0 from 100 on Time Zone Reasoning: How Terrifying Are LLMs' Common Sense Blind Spots
A simple time zone question that elementary school students can answer correctly caused Google's most powerful model Gemini 2.5 Pro to fail completely, exposing systematic deficiencies in LLMs' handling of real-world basic common sense.