Extracting books from production language models (LLMs):
https://arxiv.org/pdf/2601.02671v1