0 Followers
0 Following
1 Posts

Large-scale online deanonymization with LLMs

https://sh.itjust.works/post/57571819

Large-scale online deanonymization with LLMs - sh.itjust.works

Paper by, > Simon Lermen, Daniel Paleka, Joshua Swanson, Michael Aerni, Nicholas Carlini, Florian Tramèr It talks about deanonymizing those who writes under a pseudonym. Sites like reddit, lemmy would be that type. From the paper, > Given two databases of pseudonymous individuals, each containing unstructured text written by or about that individual, we implement a scalable attack pipeline that uses LLMs to: (1) extract identity-relevant features, (2) search for candidate matches via semantic embeddings, and (3) reason over top candidates to verify matches and reduce false positives. > Our results show that the practical obscurity protecting pseudonymous users online no longer holds and that threat models for online privacy need to be reconsidered. They can match writing styles, interests, details to infer a job or city, or other unstructured information. That allows to match unrelated pseudonyms to the same person. Like, FooFighterGroupie and Yolanda43905 are the same human, despite they never said it. It can allow also, to match a pseudonym to a real identity across sites. Like someone posted on LinkedIn with a real name. It takes less info than most people expect, to figure out Julia Greenberg of Cedarville, NH is FooFighterGroupie. You can protect yourself by never giving away much info. But ofc sometimes that’s the whole point! Think talking about specific hobbies or w/e, gives away info. Also change up writing styles + vocab use, b/c it is a unique fingerprint. I doubt this technique is used in a dragnet way… YET! But no reason it can’t scale, if the cost of resources goes low eonugh. We could eventually see it become standard, analysis to link people across sites and identities.

He also gave his famous opinion about Facebook users. Deep down, he agrees with privacy advocates. The diff is that he’s a shitty enough person to take advantage of the less techy people out there even if his society will be damaged badly in the process. Most of us are not that shitty.

they trust me

dumb fucks

I think we can move beyond Facebook here. Trusting big tech with your data never works out well.

I think that’s what it is.

What it is today. But these things tend to slip-slope their way to worse privacy violations over time. Oh, children are getting around the setting? Well, we better tie it to a government ID.

I’m more afraid of what it becomes than what it starts as.

The article:

financial service providers increasingly utilize AI features, such as automated social media screening, to determine risk scores for their customers.

I wonder how long until the absence of a discoverable social media trail will be considered a “red flag” used to deny essential services required for remedial participation in society.

Gotcha, thanks. I had a browse over the GIT issues too for Lemmy. I didn’t stumble into the right search terms to find much about this though.
That’s just replies to my comments though, isn’t it? What I am after is to see just the new posts to a thread, which mostly are from other people.

Depends on how you access Lemmy.

D’oh! I should have specified, sorry!

I use the web interface through sh.itjust.works.

sh.itjust.works - A bilingual (EN/FR) general-purpose instance located in eastern Canada! Powered by 99% renewable energy! Everyone is welcome eh.

Lemmy

Is it possible to see only new post replies?

https://sh.itjust.works/post/55755646

Is it possible to see only new post replies? - sh.itjust.works

I’m new to Lemmy, days not weeks. Liking it so far and I’m trying to contribute in a positive way to the instance. I have one usability issue, trying to figure out which replies in a post are new since I last read it. I see the number like (4 New) telling me how many, but not which. Sorting by “New” hardly helps because of the threaded display. Threading is a good thing, IMO, since it preserves the flow of the conversation. But new replies to older replies get buried with a “New” sort. When the post has only a few replies total, I can keep up simply by re-scanning the whole thread. On more popular posts that becomes infeasible. Please don’t beat me up too bad if I’m missing an obvious thing! I saw the user settings, “Show Read Posts”, but that seems to be post level, not reply level.

Donated.

I almost didn’t see this post or even this whole group. I found it in order to ask an unrelated question. Otherwise, I may never have known about the donation option.

All things considered, the budget feels very reasonable for running something like this. The donation seemed even cheaper since I was paying in $USD and it looks like it’s currently around 1:1.37 to $CAD.

I second this. Local is the way to go for medical information.

I always recommend remaining highly skeptical about apps promising to help with anything like this. There were period tracking apps sending women’s period information to data broker companies, who would then sell it onward. That’s creepy as hell! Doing everything locally avoids intrusive data collection.