My latest paper discusses new tools @afedercooper.bsky.social and I, with others, are using to identify not just verbatim memorization in AI models but text that is extractable with only minor changes. We find significantly more memorization once we include non-exact copies.
arxiv.org/abs/2603.24917
Recent work shows that standard greedy-decoding extraction methods for quantifying memorization in LLMs miss how extraction risk varies across sequences. Probabilistic extraction -- computing the prob...