CASE 2 ❌
It is not enough to only do this:
An AI feature keeps the content of text sent to it for improvement purposes. All directly identifiable information is removed from the text before storage. For example, names, emails, phone numbers, addresses are automatically removed. The rest of the text is stored and used for AI training purposes (some apps are actually doing this by the way, remain vigilant).
This data might not be anonymized at all.
It depends on the content of the text. If the content is a very personal and specific story, this anonymization technique might be useless.
For example, let’s imagine a psychologist puts a report of a patient's consultation in this app for writing optimization. Even without any name or email or address, this story could be so specific the patient is easily identifiable from the content.
For example, let’s imagine:
“The patient REDACTED was anxious about the expensive purchase of this popular social media company REDACTED. He was already having some trouble with his other car company REDACTED, and was hoping things would go better with this new one. He then confided this very personal story that happened on his private jet: ...”
Despite not containing any personal identifiers, this data would NOT be properly anonymized.
3/4 #DataAnonymization #Privacy