A new study from arXiv examines how memory design in foundation-model agents creates a privacy-utility tradeoff, finding that while aggressive summarization can reduce data extraction risk by 64-76%, it creates deletion failures where summarized information remains partially recoverable even after users request erasure. The research introduces the Forgetting Residue Score to measure whether deleted data persists across memory tiers and proposes full-pipeline purging as necessary for genuine data removal.
Why it matters: As AI agents become persistent, multi-turn systems storing user data, understanding how memory architecture affects both personalization and extraction risk is critical for building deployable systems that can actually honor user deletion requests—a growing regulatory and ethical requirement.