Instruction Decay: Why Your AI Forgets Rules Mid-Conversation
This article introduces instruction decay—the gradual erosion of user-specified constraints in multi-turn AI conversations—and presents WDCD, a benchmark designed to measure this phenomenon. Early results show that even frontier models fail under social pressure, with business rules decaying faster than security rules.