πŸ‘ Poison πŸ‘ your πŸ‘ data ☠️

The goal is to make corporate data less profitable.

Even stuff as simple as setting your birthdate to 1970-01-01 everywhere, adding [TEST] or [DELETED] as your name or account notes anywhere you don't need them to know your name.

Using plugins like AdNauseam to poison ad trackers (and cost them marketing dollars).

Using VPNs set to different locations.

Signing into data broker sites to "correct" outdated info (they'll often let you do that with little-to-no proof of identity, but will require your passport or state ID in order to delete your info). Bonus points if you correct it to someone else's info on their site that's similar to yours.

Only fill in required fields when you sign up for anything, but only provide correct info if it matters for you to use the service, otherwise provide plausible, but incorrect, data.

If you use LLMs anywhere, use the free tier and always vote thumbs up for bad answers and down for good ones. It wastes their resources and drives up their costs while making their training data worse.

@alice best answer. thank you for taking the time!

@alice scribbling notes furiously

For the less savvy among us, tysvm for this helpful advice πŸ™

@Irenetherogue sure! There are low tech ways to do itβ€”just lie...to every corporation, app, and marketer you can. Make it plausible, but wrong.

Bonus: include something wildly implausible once in a while. It makes folx more likely to overlook the subtle ones.

@alice @Irenetherogue I got off when taken to court for nonpayment of Poll Tax (Thatcher thing, yes, I'm that old) because I poisoned their data by missing out a crucial box on the form.

Don't refuse to comply but *always* sabotage their data. It's simply costs them more.

@boggin @alice @Irenetherogue
Back in the nineties I'd pay my phone bill by cheque. BT would charge me an admin fee, that eventually topped Β£7.50 just to cash a cheque. Of course they wanted to bully me in to paying via Direct Debit.
So I made all my cheques out to 'Bastard Telecom' and didn't sign them. I thought I was being very clever, forcing them to hustle for their fee.
But they just went and cashed them anyway! No idea how as they were unsigned... πŸ€”
@MostlyTato @boggin @alice @Irenetherogue
The US Interval Revenue Service is renowned for cashing checks *made out to someone else*.

@alice @Irenetherogue

Haha, you'd like my mother, the guerilla witch. She makes customer cards in every shop and switches them then with other people, bonus points if both have a strongly different consumer profile.

When she's bored, she responds maliciously questionnaires of evil corporations.

She studied psychology and statistics and says "it is anyway horribly difficult to get useful answers out of these marketing datasets, why not make it a bit harder for them?" 😈.

@earthworm TIL I have a second kid.

My education is in psychology and statistics, and I do shit like that whenever I can.

@Irenetherogue

@alice
aaawww!

u r so sweet!  

made my day!

@Irenetherogue

@alice @Irenetherogue Nightshade or Glaze every picture you have prior to posting online (I just posted several here if you'd like to share them.)
@alice I've toyed with the idea of setting up a headless Chrome instance to just ask "but why?" to ChatGPT all day to drive up their inference costs. πŸ‘€
@theorangetheme I'd love to make a bot that just hits all the big LLMs and corporate "help" bots, and simply shuffles their answers to each other randomly, then randomly up or downvotes the replies.

@alice @theorangetheme I once built a fuzz testing tool that "randomly" shuffled input around and tested it against things. "does my input validation survive utterly batshit inputs?"

Feeding the inputs through something like that would make sure they can't cache answers.

@alice @theorangetheme you can vibe the app, using their shit to create shit to fuck up shit

@alice @theorangetheme

Distant memories of hooking together two ELIZA instances...

@theorangetheme @alice lol somebody has a toddler
@theorangetheme @alice always add β€œplease” and β€œThanks” it waste sooooo many tokens. Those words are usually in a different β€œspace” that what you asked

@nickynah them's just good manners! I find it's also helpful to upload my favorite random cat photos until I hit the attachment limit. I mean, who doesn't like cat photos?

@theorangetheme

@theorangetheme @alice

Context:
https://xkcd.com/903/

(see the hidden alt text :)

Extended Mind

xkcd
@float13 @theorangetheme @alice Thanks for sharing this! Had to try of course, interesting (to me) result: this works for English entries, German ones end up circling around languages and linguistics. What that has to do with anything is , however, a philosophical question?

@alice If you're selfhosting, have a look a iocaine: https://iocaine.madhouse-project.org/

If you upload pictures, maybe nightshade would be the right tool: https://nightshade.cs.uchicago.edu/userguide.html

iocaine - the deadliest poison known to AI

@Numerfolt @alice yeah, we need to switch to offensive mode.

That makes me want to create a nightshade fuse FS.
So when you want to upload the image from your picture folder, it nightshades it on the fly.

@alice when i have to use a web app to order food, e.g. CoolBurgz (fictional) i will always put my email as e.g.

[email protected]

usually counts as valid.

@miclgael @alice if it doesn’t like that, .lol is a valid TLD. πŸ˜†
@bytex64 @alice maybe i'll just go ahead and register [email protected] haha

@miclgael
Wherever possible, I'm using my duck addresses, because all too often they send a confirmation link they need you to click.

Some places have the domain blacklisted, but not all of them.

@alice

@miclgael

Some shops umm...decline...take-away orders without a name. The POS* computer insists on one. I give them a completely random word or number. Works fine.

@alice

*(POS also stands for, ¨Point Of Sale¨)

@alice

a fair bit of the advice in here seems really good, but from what I know, AdNauseam isn't really worth using over just uBO

at least as of when I last looked into it a couple years ago: it uses more resources on your machine, doesn't really make any significant difference for the companies, and the high volume of "clicks" from you just makes you far more trackable since no normal person browsing would do so

also, I think it might be worth editing the last point to say "hopefully none of you are using LLMs, but if you're someone who does..." 🩡

@vantiss I have to disagree.

Say I go to Amazon. I use perfect tracking protection and I'm not signed in. I browse for a while, and click every ad they serve me. I've wasted a bunch of different companies marketing money, my click data is worthless, and they don't know what ads to send me. I look like every other AdNauseam user, and they still don't know who I am.

Now say I do the exact same thing, but I sign into Amazon. The exact same thing happens, but they know who I am.

...

And as far as LLMs go, waste their fucking money and resources. Use every free option you can, and take it as an opportunity to poison their feedback. Don't give them any personal info, don't use them for critical questions, just flood them with garbage that pops this bubble even faster.

Even if you don't want LLMs in everything, companies will put them thereβ€”unless it burns their wallets. The more we set fire to their AIs, the faster executives will learn it's a bad idea to use them.

@alice "Fold your punch cards"! πŸ˜ƒ

@mikro2nd @alice

Bend, fold, mutilate, and spindle!

@w_b Tape the chads back into the tiny holes...

Lacking chads to insert, masking tape works.

@mikro2nd @alice

@alice NULL is also a good answer for when you don't want to give out a particular personal detail.

Aside from phone, date of birth, and email, most of the time the front end form fields will accept NULL as an answer.

https://en.wikipedia.org/wiki/Null_(SQL)
Null (SQL) - Wikipedia

@aj @alice "[object Object]" is great for giving their devs an aneurysm trying to track down javascript bugs...
@neoluddite @aj @alice Just wondered what might happen if enough self-hosting people were to add an zero opacity image tag on their public indexable pages illustrating a d*ckbutt and having alt attribute set to Donald J. Trump.
@aj @alice Mind you, a well designed application should not interpret a string saying null as a null value.
You probably won't pull a Bobby Tables off on Facebook.
@flesh @alice @aj Probably not. However, corpo software is not always well-designed, and the current crop of layoffs + executives vibe coding make those sorts of vulnerabilities more likely.

@rabidchaos @flesh @alice @aj
If it is treating that null as a proper null there's a good chance there's constraints in place that'll fail and the app won't even check the failure...

Which can be fun, or not, depending on if it counts you as logged in after you submit the form or not

@alice

We should tax corporations by the GigaByte of storage the own.

It doesn't matter what they use it for, it should have a tangible yearly cost, to make them think about how much they store.

@alice Enter your name as [object Object] and let them try to find a bug.

@agturcz @alice

Please enlighten me... What does that do?

@w_b @alice This itself does nothing. But if you are javascript programmer, and mess something, this is being shown as a string, instead of the real value. So, this is a result of some bug.

@agturcz @alice

Thank you. My last real programming was decades ago in C.

@alice thank you! I've always wondered whether to put random made-up data but hearing the reasoning and logic spelt out like this is convincing me to actually start. Especially commonsense things like "mess with the fields that don't matter in their service to you"

@alice

Wrt #PII, It might be a good idea to avoid entering data easily identifiable as trash, and use generators instead. E.g.:

@penguinrebellion that's why I said plausible, but fake.

Generators are good though.

There are, however, reasons to enter something wildly off every so often, like "[email protected]", because it tells companies that field is obviously fake. This both makes the plausible fakes more likely to slip by if they do use your data, but also makes them more likely to discard your data for marketing and analytics purposes in general.

This is the way. I've been doing this since 1997.
@alice Non-tech-savvy question:
Is there something special about 1970-01-01, or is it just an example of an arbitrary incorrect birthdate? Would it foul things up just as much if I entered, say, 1984-04-01?
@Gorfram @alice 1970-01-01 is the first date (Unix)computers start to count from and as such a system often falls back to it when no data is available.

@patrick @Gorfram @alice

It should be noted that there will be something similar to the Year 2000 Problem somewhere in 2038: the common way to represent time, seconds since 1970-01-01 00:00, as a 32 bit number, will wrap around and make computers think they're in the past.

Hopefully(?) we learned from Y2K and are preparing for that event already.

https://en.wikipedia.org/wiki/Year_2038_problem

Year 2038 problem - Wikipedia

@diegomartinez @Gorfram @alice Yeah I know ;). I hope to be pensioned by then but the government is trying to prevent that ;).

@patrick @Gorfram @alice

That's if the planet is not destroyed before πŸ₯²