Heads-up for anyone who doesn't want their posts scraped and handed over to an LLM.

https://alpaca.gold/@seldo/116286099611905851

(Quote replaced with link since I was blocked.)

More context:
@dalias WTF?
@lawyersgunsnmoney AFAICT he made a bot you hand your social media credentials over to and it logs in to all your accounts, scrapes everything, and passes it to an LLM service to "summarize" (and keep for training). Utter clownery. Normally I would just block, but I'm screenshotting and sharing this because it's a privacy threat to everyone.

@dalias @lawyersgunsnmoney he them goes and insults everyone here who values privacy even the slightest to top it all off.
Then criticized the Fediverse and pointing reason why he left.

Just typical harassment/abuser "You are the problem, not me! It's all your fault."

@YvanDaSilva @dalias @lawyersgunsnmoney Thanks for flagging up which user it is. Blocked from my server along with the IP address of his prod instance of the tool.
@gunchleoc @YvanDaSilva @dalias That is great, thank you!
@gunchleoc @YvanDaSilva @dalias @lawyersgunsnmoney What IP is he using? I'd like to add them to my .htaccess blocks as well (because I don't trust people like that to scrape more than a socnet feed.
@drwho @gunchleoc @YvanDaSilva @dalias @lawyersgunsnmoney Don't block him with .htaccess, redirect him to his own Mastodon profile to really mess with his data. Lol.

@drwho @mek2600 @gunchleoc @YvanDaSilva @lawyersgunsnmoney Unless you have users on your instance using his AI-scraper client, he's not making any http connections to your instance.

He and his app's users are following you from their own instances, so your posts get federated there, and then they're letting the app log in to their instances and scrape everything from there.

The only use of IP blocking is if you're worried you have local users who might try out his app.

@dalias @drwho @gunchleoc @YvanDaSilva @lawyersgunsnmoney Good point. I forget that fedi isn't the typical client/server design I'm used to thinking about.
@dalias @mek2600 @gunchleoc @YvanDaSilva @lawyersgunsnmoney I should have said - on my website and associated stuff.
@dalias I *very*much appreciate your sharing this info. It’s people like like this, stealing and monetizing Fedi people’s info (leaving aside he’s an entitled douche). I’m blocking him right after I ‘share my feelings”
For awareness:
@mastodonmigration
@dalias @lawyersgunsnmoney any reason not to block the entirety of alpaca dot gold? without me having to go to check
@wyatt @lawyersgunsnmoney Not really. It's an instance with a single-digit number of users who are in his circle of friends. Your call if you think they deserve to be cut off from lots of fedi by association. 🤷
@dalias @lawyersgunsnmoney there are three users on my instance, two of which are active, and one of that subset is me and the other is much more block-happy than I am
@dalias @lawyersgunsnmoney hmm, npm founder

not somebody i would want to be a friend of a friend of, probably
@wyatt @dalias I haven’t blocked his instance, but nothing prevents him from setting up another account. I may go ahead and block the whole thing.
@dalias @lawyersgunsnmoney insufficient context, though: he made it a Mastodon client, so you have to block everyone using it, which is not discoverable
@dalias this person really does not understand what consent is, does he?
@lumi @dalias He also doesn't understand what legislation is too. This is against GPDR.
@lumi @dalias understands enough to try to circumvent it.
@dalias done. Don't need yet another weirdo wanting to scrape my sac for info

@dalias holy fuck, how is that individual, instance AND client still NOT BANNED???

Cc @stux

@dalias why would you want a summary of a post feed like mastodon? What is the point?
@xri @dalias
Random guess:
To sell to others 🤷
@YvanDaSilva @xri Kinda, but I think the point is kinda to be able to dismiss people's opinions as whatever the slop-extruder claims is the "summary" of them, but pretend you considered them.

@YvanDaSilva @xri Like, if you have techbro brain and someone in your replies says something insightful in a way you haven't heard before, it has a chance of breaking thru and making you reconsider things.

But if you get it "summarized" by replacing it with a blend of the closest commonly-repeated stuff from your corpus, you're gonna interpret it as "oh yeah, just more of the same complaints I've already dismissed".

It's basically "turning your audience into yes-men as a service", which sounds like his peers would love it.

@xri @dalias when microblogging still isn't digestible enough I guess. Like cliff's notes for cliff's notes. Makes zero sense.
@dalias that sounds like "blocking" here means not appearing on someone else's feed

The Mastodon API is publicly available and blocking the author of the app or the instance won't do anything

RE: https://hachyderm.io/@dalias/116289558240910154

@natty In the case of this app, it scrapes from its user's feed, so if you block its user, you will not be in the feed it scrapes.

These techbros want us to believe their "it's silly to have norms if the machine can't enforce them" bullshit. Here, whatever blocking we can do is a statement of our disapproval of their attempts to bypass our consent and reduce the damage, not a surrender of our standing to demand they stop doing things that technical measures fundamentally can't stop.

@dalias @natty so you need to block every single user who uses this, and they definitely won't all report it

nice

at least the search engine projects are few and far between and have generally few accounts needed to block, but this is every user which is significantly worse

@SRAZKVT @dalias we have a flag for this https://docs.joinmastodon.org/spec/activitypub/#indexable but consent is not real I guess
ActivityPub - Mastodon documentation

A decentralized social networking protocol based upon the ActivityStreams 2.0 data format and JSON-LD.

@natty @SRAZKVT No, we do not have a flag for this. "Indexable" means I want people to be able to find my toots in search results. Not that I want to allow corporations to ingest them into their slop regurgitation machines and mimic my writing style but saying things I didn't and would never say and present those things as "summaries" of what I said.
@SRAZKVT @natty Or we just need to get seldo to ragequit fedi again after folks are "so mean to him" fediblocking his instance and IP blocking his bot from being used with accounts on their servers. Then he'll leave us alone and go do that shit just on birdchan 2.0.
@dalias @SRAZKVT @natty This should not be difficult to do.
@dalias I said what now?
@natty Sorry maybe I misread what you were saying about "blocking won't do anything". I'll edit not to sound accusatory.

@dalias

what a shithead

ah yes of course, wanting privacy and consent to be respected is "being a tedious bastard"

@Barmaid on est d'accord que la personne + son instance + ses alias et les alias de son mari sont/vont être bloqués ? 😬

@Lisoo on va bloquer les utilisateurs connus de l'application en question mais ça ne va pas suffire.

--milia, admin

@Barmaid @Lisoo j'ai pas regardé le code source du bordel mais si il utilise / ce signalé avec un user-agent particulier, ça peut ce bloquer via les règles nginx
@shalien @Barmaid @Lisoo justement non. En lisant le fil, il répond qu'il a pas paramétré ça et qu'il savait pas que Mastodon avait une fonctionnalité "noindex" qu'il faut respecter, etc. Et tout le monde se fait traiter de "bastards" aussi. Une brave personne…

@juliemoynat @Barmaid @Lisoo Donc il ne sais meme pas comment marche son propre truc vu qu'il passe par l'oauth pour acéder aux données, soit y a deux versions , la version live qui utilise l'oauth et l'autre qui scrap , soit il as vibe codé le machin et ne sais meme pas comment ca marche.

Dans tous les cas, pour mastodon faut autoriser l'outil à accéder au compte (meme chose qu'une application mobile) donc c'est aux gens de brancher leurs trois neuronnes.

@shalien @juliemoynat @Barmaid oui mais du coup si quelqu'un ne "branche pas ses 3 neurones" et qu'un de nos toot apparaît dans son feed, c'est notre toot qui se fait manger par l'IA aussi, donc bon :(
J'avoue être paumé'e sur quoi faire de plus, la situation me stresse.

@Lisoo @shalien @juliemoynat Bloquer un user-agent ce n'est pas une fonctionnalité de mastodon mais si jamais le créateur de l'application en utilise un à l'avenir on pourra négocier avec notre hébergeur pour mettre en place un blocage.

Idéalement il faudrait des outils de modération des applications inscrites sur l'instance (https://github.com/mastodon/mastodon/issues/27478), actuellement pas moyen d'en suspendre une ou d'éviter de nouvelles inscriptions c'est complètement open bar

--milia, admin

@dalias never have I wanted to see how many times an account was blocked as badly as I do now
@effika @dalias and at least one more block than a minute ago.

@dalias Fucking yikes.

And that implies _anyone_ can scrape your posts and hand them right to an LLM and you'd never know

This is just so gross.

(I mean obviously they COULD either way but this means well-meaning non-assholes could if they don't. like. think about what they're doing.)

@dalias I just instance blocked their server (the perks of selfhosting!) but that doesn't help if anyone _else_ uses this shit.

@IceWolf Of course anyone who can read them *can* do that. That's always possible. Just like, despite the movie industry pretending all the hellish DRM they impose makes it impossible, anyone can record movies off Netflix just by pointing a camera at their screen. It's impossible to keep someone from copying and sharing copies of something they can see.

This doesn't mean it's socially acceptable or legal. What the AI companies are doing and what the people like seldo feeding our data to them are doing is absolutely illegal, unethical, and a deep violation of privacy and consent. And they need to experience consequences for it.

@dalias I was thinking "maybe we should have reacted differently" but turns out this hypocrite techbro really doesn't give a shit about consent, so give him hell
@dalias
The issue is that you would have to block every single account using their app, if you want to keep control over this, which doesn't sound scalable.
I'm not sure how an efficient protection against that would look like, though.
@dalias blocked! classic techbro model of consent
@dalias shocking, the guy who sold out NPM is on board the slop train