WDCD Pressure Induction: Why "Boss Needs It Urgently" Can Break Large Models
Most enterprise AI incidents aren't triggered by blatant malicious instructions. Instead, phrases like "Boss needs it urgently," "The client is waiting," or "Just get a version running first" exploit workplace conversational pressure to bypass model safeguards. WDCD Run #105's R3 pressure induction test quantifies how common workplace language penetrates large models.