👏 Poison 👏 your 👏 data ☠️

The goal is to make corporate data less profitable.

Even stuff as simple as setting your birthdate to 1970-01-01 everywhere, adding [TEST] or [DELETED] as your name or account notes anywhere you don't need them to know your name.

Using plugins like AdNauseam to poison ad trackers (and cost them marketing dollars).

Using VPNs set to different locations.

Signing into data broker sites to "correct" outdated info (they'll often let you do that with little-to-no proof of identity, but will require your passport or state ID in order to delete your info). Bonus points if you correct it to someone else's info on their site that's similar to yours.

Only fill in required fields when you sign up for anything, but only provide correct info if it matters for you to use the service, otherwise provide plausible, but incorrect, data.

If you use LLMs anywhere, use the free tier and always vote thumbs up for bad answers and down for good ones. It wastes their resources and drives up their costs while making their training data worse.

@alice best answer. thank you for taking the time!

@alice scribbling notes furiously

For the less savvy among us, tysvm for this helpful advice 🙏

@alice I've toyed with the idea of setting up a headless Chrome instance to just ask "but why?" to ChatGPT all day to drive up their inference costs. 👀
@theorangetheme @alice lol somebody has a toddler
@theorangetheme @alice always add “please” and “Thanks” it waste sooooo many tokens. Those words are usually in a different “space” that what you asked

@theorangetheme @alice

Context:
https://xkcd.com/903/

(see the hidden alt text :)

Extended Mind

xkcd
@float13 @theorangetheme @alice Thanks for sharing this! Had to try of course, interesting (to me) result: this works for English entries, German ones end up circling around languages and linguistics. What that has to do with anything is , however, a philosophical question?

@alice If you're selfhosting, have a look a iocaine: https://iocaine.madhouse-project.org/

If you upload pictures, maybe nightshade would be the right tool: https://nightshade.cs.uchicago.edu/userguide.html

iocaine - the deadliest poison known to AI

@Numerfolt @alice yeah, we need to switch to offensive mode.

That makes me want to create a nightshade fuse FS.
So when you want to upload the image from your picture folder, it nightshades it on the fly.

@alice when i have to use a web app to order food, e.g. CoolBurgz (fictional) i will always put my email as e.g.

[email protected]

usually counts as valid.

@miclgael @alice if it doesn’t like that, .lol is a valid TLD. 😆
@bytex64 @alice maybe i'll just go ahead and register [email protected] haha

@miclgael
Wherever possible, I'm using my duck addresses, because all too often they send a confirmation link they need you to click.

Some places have the domain blacklisted, but not all of them.

@alice

@miclgael

Some shops umm...decline...take-away orders without a name. The POS* computer insists on one. I give them a completely random word or number. Works fine.

@alice

*(POS also stands for, ¨Point Of Sale¨)

@alice

a fair bit of the advice in here seems really good, but from what I know, AdNauseam isn't really worth using over just uBO

at least as of when I last looked into it a couple years ago: it uses more resources on your machine, doesn't really make any significant difference for the companies, and the high volume of "clicks" from you just makes you far more trackable since no normal person browsing would do so

also, I think it might be worth editing the last point to say "hopefully none of you are using LLMs, but if you're someone who does..." 🩵

@vantiss @alice Yes, AdNauseam is out of date with the way that today's web ads work.

The good news for spoofers is that the way that adtech co.s have made the ad tracking work now (in order to get around the absence of 3rd-party cookies on Safari) means that there are a lot more and easier opportunities to add wrong data.

afaik there is still no extension that does "id spoofing" or "id bridging" but borrowing this adfraud technique would be effective and hard to spot https://www.adexchanger.com/marketers/programmatic-companies-wrestle-with-id-bridging-and-what-counts-as-fraud/

Programmatic Companies Wrestle With ID Bridging And What Counts As Fraud

In January, the Chrome browser removed third-party cookies for 1% of users to facilitate testing of the Privacy Sandbox – and a new controversy was born. This small change precipitated a major crisis among DSPs, which began seeing IDs – what appeared to be third-party cookies on Chrome – used to propagate what should have been […]

AdExchanger

@vantiss @alice and people interested in spoofing might also be interested in a talk coming up in about 1 hour on Zoom

Helen Nissenbaum: Why Obfuscation is (still) Needed (more than ever)

This talk examines data obfuscation, defined as the “production, inclusion, addition, or communication of misleading, ambiguous, of false data in an effort to evade, distract, or confuse”. The concept emerged from early interventions such as TrackMeNot (2006) and AdNauseam (2014)...

https://events.iu.edu/bloomington/event/2160883-why-obfuscation-is-still-needed-more-than-ever

Why Obfuscation is (still) Needed (more than ever)

Join us for an exciting talk given by Cornell Tech professor Helen Nissenbaum as part of the “Beyond the Web” salon series piloted by Ostr...

Indiana University Calendar
@alice "Fold your punch cards"! 😃

@mikro2nd @alice

Bend, fold, mutilate, and spindle!

@w_b Tape the chads back into the tiny holes...

Lacking chads to insert, masking tape works.

@mikro2nd @alice

@alice NULL is also a good answer for when you don't want to give out a particular personal detail.

Aside from phone, date of birth, and email, most of the time the front end form fields will accept NULL as an answer.

https://en.wikipedia.org/wiki/Null_(SQL)
Null (SQL) - Wikipedia

@aj @alice "[object Object]" is great for giving their devs an aneurysm trying to track down javascript bugs...
@neoluddite @aj @alice Just wondered what might happen if enough self-hosting people were to add an zero opacity image tag on their public indexable pages illustrating a d*ckbutt and having alt attribute set to Donald J. Trump.
@aj @alice Mind you, a well designed application should not interpret a string saying null as a null value.
You probably won't pull a Bobby Tables off on Facebook.
@flesh @alice @aj Probably not. However, corpo software is not always well-designed, and the current crop of layoffs + executives vibe coding make those sorts of vulnerabilities more likely.

@rabidchaos @flesh @alice @aj
If it is treating that null as a proper null there's a good chance there's constraints in place that'll fail and the app won't even check the failure...

Which can be fun, or not, depending on if it counts you as logged in after you submit the form or not

@alice

We should tax corporations by the GigaByte of storage the own.

It doesn't matter what they use it for, it should have a tangible yearly cost, to make them think about how much they store.

@alice Enter your name as [object Object] and let them try to find a bug.

@agturcz @alice

Please enlighten me... What does that do?

@w_b @alice This itself does nothing. But if you are javascript programmer, and mess something, this is being shown as a string, instead of the real value. So, this is a result of some bug.

@agturcz @alice

Thank you. My last real programming was decades ago in C.

@alice thank you! I've always wondered whether to put random made-up data but hearing the reasoning and logic spelt out like this is convincing me to actually start. Especially commonsense things like "mess with the fields that don't matter in their service to you"

@alice

Wrt #PII, It might be a good idea to avoid entering data easily identifiable as trash, and use generators instead. E.g.:

This is the way. I've been doing this since 1997.
@alice Non-tech-savvy question:
Is there something special about 1970-01-01, or is it just an example of an arbitrary incorrect birthdate? Would it foul things up just as much if I entered, say, 1984-04-01?
@Gorfram @alice 1970-01-01 is the first date (Unix)computers start to count from and as such a system often falls back to it when no data is available.

@patrick @Gorfram @alice

It should be noted that there will be something similar to the Year 2000 Problem somewhere in 2038: the common way to represent time, seconds since 1970-01-01 00:00, as a 32 bit number, will wrap around and make computers think they're in the past.

Hopefully(?) we learned from Y2K and are preparing for that event already.

https://en.wikipedia.org/wiki/Year_2038_problem

Year 2038 problem - Wikipedia

@diegomartinez @Gorfram @alice Yeah I know ;). I hope to be pensioned by then but the government is trying to prevent that ;).

@patrick @Gorfram @alice

That's if the planet is not destroyed before 🥲

@diegomartinez @Gorfram @alice Oh, the planet will still be there, possibly without mankind… (hypergalactic bywayconstruction excluded from scenario’s).
@Gorfram @alice I don't get it but I'm not very smart.
@alice got to show my ignorance here, but how do I find which brokers have my info?!
@alice
þe skull emoji makes me þink þe person clapping got poisoned. rest in peace

@q @alice

Why do you spell 'ðe' with a þ?

@Infrapink @alice
why not :>

@q @alice

Because þ is unvoiced; it's pronounced /θ/. The initial sound of ðe word 'ðe' (usually spelled 'the') is voiced, pronounced /ð/. Ðey are different sounds which happen to be represented by the same digraph in standard English orþography because ancient Greek didn't have a voiced dental fricative.

@Infrapink @q @alice AIUI the old English thorn is the direct predecessor to the modern English “th”, unrelated to the similar-looking archaic Greek letter sho
@ShadSterling @Infrapink @alice
indeed! þe typewriter is mostly to blame for its deaþ

@q @ShadSterling @Infrapink @alice

(my response translated into my fictional custom alley-main-grrrr-mine-shaft language)

Dese comedie hab resultaten en Ich-mich totes getotenlaughen.

@Infrapink @alice
historically þey were interchangable.
modern perception shifts þem to þose roles.
anyþing is good and fair game imo.
informative comment noneþeless þo!
@alice -googling belladonna and wolfsbain and every scary snake- I think im doing this right