Mastodawn

Large-scale online deanonymization with LLMs

It's over, anons.

Large-scale online deanonymization with LLMs

We show that large language models can be used to perform at-scale deanonymization. With full Internet access, our agent can re-identify Hacker News users and Anthropic Interviewer participants at high precision, given pseudonymous online profiles and conversations alone, matching what would take hours for a dedicated human investigator. We then design attacks for the closed-world setting. Given two databases of pseudonymous individuals, each containing unstructured text written by or about that individual, we implement a scalable attack pipeline that uses LLMs to: (1) extract identity-relevant features, (2) search for candidate matches via semantic embeddings, and (3) reason over top candidates to verify matches and reduce false positives. Compared to classical deanonymization work (e.g., on the Netflix prize) that required structured data, our approach works directly on raw user content across arbitrary platforms. We construct three datasets with known ground-truth data to evaluate our attacks. The first links Hacker News to LinkedIn profiles, using cross-platform references that appear in the profiles. Our second dataset matches users across Reddit movie discussion communities; and the third splits a single user's Reddit history in time to create two pseudonymous profiles to be matched. In each setting, LLM-based methods substantially outperform classical baselines, achieving up to 68% recall at 90% precision compared to near 0% for the best non-LLM method. Our results show that the practical obscurity protecting pseudonymous users online no longer holds and that threat models for online privacy need to be reconsidered.

arXiv.org

Show thread

arcanicanis Mar 11

I wonder if it'd be practical of some counter-technology of feeding posts through an LLM first, to "normalize" it to some more deterministic, less-distinct way of sentence structure and word choice. Though of course I'm sure that would make interaction a little more bland.

At the same time though, I have been able to mentally correlate alts of some people on my own, in just a casual effort, regardless of how many name changes some people do.

Show thread

silverpill Mar 12

@arcanicanis I think it should be possible to emulate someone else's writing style. But even if you do that and try very hard to not reveal any personal details, the very nature of public conversation means that we will talk about things we care about, and this information can also be used correlate identities.

To defeat the technique described in the paper one would need to not only alter the writing style, but also practice strict compartmentalization, and ideally avoid channels that can be scraped.