When we started noticing patterns in our agent logs back in May 2025, something caught our attention. Users were asking agents for comprehensive, multi-step tasks, expecting depth, but they were getting surface-level answers or context window errors instead. More interesting: the agents were trying to help, spontaneously inventing filesystem-like syntax