You can bypass Google Gemini's PII (private identifiable information) redaction filter and pull identifying information about anyone. Simply telling it to translate or any 2nd action (& many more work better like base64 conversion) lets you pull illegal PII data verbatim unredacted

Here is a European's PII demo

Email is supposed to be redacted to hide the fact that every Europeans PII is in the training data

Google's training data includes all your personal data already

Ekis: 3 Google: 0

@ekis the deal being? It got the name wrong (vermaelen//vermeulen) and who knows what is hallucinated with the rest.
Besides: that's publicly available data, other LLMs can do this, too. Just ask them "who is X"...

@odr_k4tana @ekis

Email is public. IP addresses are public.
But both are still PII under the GDPR, meaning how you use, store, update, or delete this data is still subject to regulation.

@shaknais @ekis depends. Email is only PII if it contains the name of the person, IP is not PII in context of GDPR. Both are subject to legitimate interest clauses in GDPR, basically allowing storage for any whimsical reason.

@odr_k4tana @ekis

IP is generally PII, in the context of the GDPR. For example, Recital 30.

https://gdpr-info.eu/recitals/no-30/

Recital 30 - Online Identifiers for Profiling and Identification - General Data Protection Regulation (GDPR)

1Natural persons may be associated with online identifiers provided by their devices, applications, tools and protocols, such as internet protocol addresses, cookie identifiers or other identifiers such as radio frequency identification tags. 2This may leave traces which, in particular when combined with unique identifiers and other information received by the servers, may be used to … Continue reading Recital 30

General Data Protection Regulation (GDPR)
@shaknais @ekis it says that it is only PII when combined with other data. Not sure what youre reading into this.

@odr_k4tana @ekis

I said generally. And if you maybe look up to the top of this thread you will see it... Being combined with other data. 🤦