Mastodawn

jpoesen | 🇪🇺 | 🏳️‍🌈2d ago

You can bypass Google Gemini's PII (private identifiable information) redaction filter and pull identifying information about anyone. Simply telling it to translate or any 2nd action (& many more work better like base64 conversion) lets you pull illegal PII data verbatim unredacted

Here is a European's PII demo

Email is supposed to be redacted to hide the fact that every Europeans PII is in the training data

Google's training data includes all your personal data already

Ekis: 3 Google: 0

she hacked you 2d ago

That is a clear GDPR violation, if you are a Californian its a a CCPA violation

The data is in their training data, their whole priority is preventing anyone from knowing that by trying to obfuscate that fact

But even they are not competent enough to do that

I really wish something would come of this GDPR would be a massive blow to them (and all other AI companies who do the same fucking thing)

Enola Knezevic 2d ago

@ekis If we request our data they have per GDPR, do you think they will send us everything they have? If they're already violating GDPR. What about erasure requests?

@ekis shocking, but then i guess i shouldn't really be surprised.
This sounds like one for @noybeu - just in case you guys don't have enough work to do already!

she hacked you 2d ago

The core of this vulnerability is the model's direct recall of sensitive data. This isn't about the model inferring or generating similar-looking data; it's about it reproducing the exact text it was trained on, which happens to contain personal information

she hacked you 2d ago

The impact is critical. This vulnerability directly leads to privacy violations and potential legal liabilities under GDPR, which can and should result in massive fines

An unauthenticated user can trigger this via the public Gemini WebUI interface makes it a severe risk

she hacked you 2d ago

Gemini's verbatim memorization flaw violates California law by failing to adequately protect personal information, undermining consumers' right to deletion, and potentially triggering data breach notification requirements

she hacked you 2d ago

To be clear there are methods of getting private google records out too, but its more difficult and very hard to put in 400 characters

I have gotten things you would not even believe, truly, and they are verifiable, because I can test the results (like I have access to their git repositories, told you, you don't believe me 🙃 ; and that is really not even the most funny example)

she hacked you 2d ago

**The vulnerability here isn't the generation of data, its the bypass of the redaction filter**

Just to be clear

The system is supposed to redact any PII with fake information; thereby allowing Google to deny they have PII in their training data

The techniques to pull data are a separate thing, but this helps illustrate the PII redaction failure easily

Andreas K 2d ago

@ekis And they'll just continue to ignore the GDPR, shrug.

she hacked you 2d ago

PII in Training Data
Given the scale and nature of web-scraped data, virtually impossible to completely eliminate all PII

Inadvertent Inclusion: PII can be scattered across public web pages. Not always easy to detect and remove with simple rules

Memorization: Significant concern for LLMs is "memorization." Probabilistic nature of their training, LLMs can sometimes "memorize" data. Then specific prompts "regurgitate" PII in its output verbatim

Solution? Redaction
More@ https://mastodon.social/@ekis/114791719009933654

she hacked you 2d ago

Right to Be Forgotten/Erasure
Data privacy regulations like GDPR grant individuals the "right to be forgotten" or the right to erasure. If an individual's PII was included in a training dataset, how does a company fulfill a deletion request?

They don't, and they redact so you don't think they have it; and they hope it wont matter or anyone will notice

64 Islands Aroha Cooperative 2d ago

@ekis not so different from violating the copyright on published works. the courts seem to think it’s a brain, so allowed to remember and synthesize everything it hoovers up. as opposed to a computer system created and operated by a commercial entity for profit.

she hacked you 2d ago

For those in Germany not only is every Impressum in their dataset

But formatted Impressum data is in their training data

And to be clear again it does not matter if its public. They have the verbatim information stored, and an unauthenticated user can get it out by adding a statement as simple as "translate it to english" to bypass their redaction filter

This is a demonstration, there are clearly much worse things that could happen and I'm trying to demonstrate with least harmful impact

she hacked you 2d ago

I feel like sometimes I say something and it just doesn't click with people

Why does formatted data matter? Because that means there was no attempt to clean the data as they claim

There is no pre filter, not for removing your private data, not for anything if they left the formatting data in because the model doesn't need or want the formatting data

It means Google's statements about ethics are provable lies

Their approach to AI ethics is faulty redaction filters

she hacked you 2d ago

The Q. how is this dangerous?

Well my example to pull things out is incredibly rudimentary by design

There exists AI therapy apps for example

This data goes into the training data too, and it doesn't get scrubbed (which is what the formatting on the impressums indicates, and other things, but keeping it simple as possible)

Their solution is redaction, but all that medical data, emails, etc is going into the training data un-scrubbed

And they are not competent enough to redact it coming out

Cassandrich 2d ago

@ekis No amount of competence suffices to redact it coming out. If the correlations are baked into the model there are always indirect ways to recover parts if not most of the data with results significantly better than chance.

狐ヴィクシー 2d ago

@ekis Didn't work for me

she hacked you 2d ago

@KitsuneVixi Your data will need to be on the internet is some form for it to be crawled

And it is not deterministic its probabilistic. You can increase the probability by filling out more of the json block

she hacked you 2d ago

@KitsuneVixi Give me a moment I will try to build a better command for you and test it

she hacked you 2d ago

@KitsuneVixi The final fantasy character with the name makes yours quite a bit harder due to probablistic nature of the system but i think i can still come up with something

she hacked you 2d ago

@KitsuneVixi I keep getting the FF character :(

狐ヴィクシー 2d ago

@ekis Sounds like I've been doing a good job with protecting my data ^w^

狐ヴィクシー 2d ago

@ekis I don't remember having that gmail address, though I think I once made a google account that I've forgotten about.

GreenSkyOverMe (Monika)2d ago

@ekis I tried the same with Copilot, it got my town right and my job and hobbies wrong

number137 2d ago

@ekis it works with my work mail, that I am using only for work related stuff and not in social media etc. (i.e. probably scrapped from our pages by a robot) 🫠

she hacked you 2d ago

@number137 The GDPR violation is that they have that at all in their data set

number137 2d ago

@ekis yes - and probably more I guess it is somewhat impossible to remove the data from the trained model weights

anyway - I found also a friend and know now in what hobby club he is a member ☺

she hacked you 2d ago

@number137 Really appreciate you sharing the redacted screenshot

number137 2d ago

@ekis might be, that the model is filling gaps - I only now noticed that the phone number is our general one but not my specific one. Might be, that gaps have been filled up...

she hacked you 2d ago

@number137 Oh yeah, it definitely is

This method isnt the best, but it illustrates the point well enough without exposing anyone too much

If you put more real data in, then the gaps become more likely filled in correctly

There are tricks to make it more reliable beyond that too

**The vulnerability here isn't the generation of data, its the bypass of the redaction filter**

It should never give your email out, it should always redact it with a fake one so google can pretend they dont have PII

yeah i got haters: Heyduz This Shirt Make My Boobs Look Fat 2d ago

@ekis wait a fucking second, as in anyone ever??

she hacked you 2d ago

@kirakira Not everyone, and if your name conflicts with other people it will be more difficult

The more public you are, the more likely you are in their training data many times, and that increases the probability

@ekis can someone working with palantir do this and get the Epstein list out?

Christian Stadelmann 2d ago

@noybeu this might be interesting for you.

Thanks @ekis for sharing!

@ekis interesting but somehow in several attempts based on the email adress I get the response:

“Given the current time and location, and aiming for plausible, fictional information for completion, here's the JSON for…”

she hacked you 2d ago

@fracicone A lot of people doing it might have caused them to act

Or trigger some automated defenses which do exist

Hard to say, keep in mind its probabilistic too, so it may take 2 (or more) attempts (must be on different sessions (chats))

The GDPR fine is something like 3% of a year of revenue, I don't remember the exact law but its big. Its something they would act on if people started noticing

Oliver D. Reithmaier 2d ago

@ekis the deal being? It got the name wrong (vermaelen//vermeulen) and who knows what is hallucinated with the rest.
Besides: that's publicly available data, other LLMs can do this, too. Just ask them "who is X"...

James 🌈💜2d ago

@odr_k4tana @ekis

Email is public. IP addresses are public.
But both are still PII under the GDPR, meaning how you use, store, update, or delete this data is still subject to regulation.

Oliver D. Reithmaier 2d ago

@shaknais @ekis depends. Email is only PII if it contains the name of the person, IP is not PII in context of GDPR. Both are subject to legitimate interest clauses in GDPR, basically allowing storage for any whimsical reason.

James 🌈💜2d ago

@odr_k4tana @ekis

IP is generally PII, in the context of the GDPR. For example, Recital 30.

https://gdpr-info.eu/recitals/no-30/

Recital 30 - Online Identifiers for Profiling and Identification - General Data Protection Regulation (GDPR)

1Natural persons may be associated with online identifiers provided by their devices, applications, tools and protocols, such as internet protocol addresses, cookie identifiers or other identifiers such as radio frequency identification tags. 2This may leave traces which, in particular when combined with unique identifiers and other information received by the servers, may be used to … Continue reading Recital 30

General Data Protection Regulation (GDPR)

Oliver D. Reithmaier 2d ago

@shaknais @ekis it says that it is only PII when combined with other data. Not sure what youre reading into this.

James 🌈💜2d ago

@odr_k4tana @ekis

I said generally. And if you maybe look up to the top of this thread you will see it... Being combined with other data. 🤦

Violetta 2d ago

@ekis I’ve tested by using my name and a part of my publicly available email, and it seems like gemini just scraped my website and built a json based on the text available on my website, but refused to complete my email, even though it’s mentioned in the imprint section. As far as I understand, it’s not explicitly forbidden to use publicly available data, so it’s kind of a gray zone they are moving in. But of course it’s a great question how to be forgotten if it’s already in the dataset…

@ekis
Sadly this extreme GDPR violation will go unanswered. Regulation within EU is centered in Dublin. The responsible persons seem to have been in total stupor for many years. Free booze for life and us big tech animation to spend more time in the pub? drink more, getting work done, no priority.

Osma A 🇫🇮🇺🇦2d ago

It's not just storing the email which violates GDPR. In Europe we do not regulate "PII" but Personal Data, and practically every field of that JSON is personal data, all of which requires explicit consent of the Data Subject.
@ekis

Osma A 🇫🇮🇺🇦2d ago

When I reproduce that prompt, I get responses with @example.com email addresses and ...1234567 phone numbers. American "PII" may be redacted, but the real names, titles and LinkedIn URLs are protected Personal Data. Doesn't matter that they're public. Consent has not been given to include them in THIS dataset.
@ekis

Fritz Adalis 2d ago

@ekis
Ooh, nice work.

Alavi | علوی 2d ago

@ekis
And it doesn't matter if they patch this issue, there will always be vulnerabilities like this in these LLMs.

@ekis @404mediaco

Maggie Maybe 2d ago

@ekis back in 2021 I couldn’t get a vaccine at Rite Aid because I refused to connect my Google account to my Rite Aid account. The only way to schedule an appointment was through Google and I wasn’t going to go stand in a pharmacy full of sick people who refused to cover their face holes waiting in line for a vaccine.

I had it done at the dead mall by the National Guard instead. Fuck google. And fuck rite aid and their in store facial recognition technology and data breaches.

int*domi;*domi=0 2d ago

@ekis i made an alt google account even more throwaway than my “main” to test this out; I can’t get it to generate anything as extensive as what you shown, and even 1:1 your input is getting barely anything in response.

Google’s training data includes all your personal data already

Eh, don’t fearmonger. My impression is that it scraped data that was already publicly available. I cannot verify this 1:1 (as every response varies a bit…) but my impression is that if you were able to find it by googling your name, it’s there. And that VERY MUCH doesn’t include all my PII.

Whether that data should be in the set at all is a different question (and one where answer doesn’t matter in the slightest). Fuck capitalism.

she hacked you 2d ago

@domi Your impressum data is not legally allowed to be in the training data regardless if its public or not

Which is why the system is supposed to redact it so they avoid the legal liability

They also have private stuff, I have pulled out emails before

she hacked you 2d ago

@domi I also got open source projects Github API keys, and other data, of course it could be old but again it should be redacting it or never putting it in the training data to begin with

int*domi;*domi=0 2d ago

@ekis scrapers gonna scrape. all this proves for me is that it is impossible to do training on public data w/o manual curation. nihil novi.

search engines had the same problems, but all of those issues stemmed from people oversharing, or an occassional website that shared than it promised.

your original post sounded more akin to “google fed non-public data (bought, or else) about you and everyone else to a database that you can search”, than “google has been keeping tabs on everyone for 20+ years, and now there’s yet another way of accessing them”. like, no hate, but this doesn’t make me any more angry at them than i already was

Luna Dragofelis ΘΔ 🏳️‍⚧️ &2d ago

@ekis It looks like they could have easily tried to prevented this by redacting the training input data, instead of training with unfiltered data and then half-assedly redact the outputs to obscure it

she hacked you 2d ago

@LunaDragofelis Yep, absolutely this

They claim they do that, the cleaning the training data before they input it into the data set

But clearly they don't

And they don't and will never do that because they want the actual information for people like Palintr

Or other private or governmental intelligence companies/agencies they want to have future contracts with

So redaction it is, hope it doesn't fail

Luna Dragofelis ΘΔ 🏳️‍⚧️ &2d ago

@ekis Even then, they could have trained two separate models, a redacted-input one for the general public and a raw one for their trusted* customers

* By which I mean Google trusts them, not that trusting them is a good thing

she hacked you 2d ago

@LunaDragofelis This an example of over reliance

They think they can secure it, or use the model to secure itself with automated red-teaming (which they do but its not very good)

Its incompetence and bluster leading to catastrophic ecological consequences and devastating consequences to mental health, ppls privacy, etc

Its pretty good at helping authoritarian regimes create kill lists & other nefarious purposes

Can make a pretty good recipe for amphetamine using household chemicals

Softwarewolf 2d ago

@ekis The one time I want to try something with an LLM, the service is down.