[Meta] Don’t Provide Reddit With AI Training Data … by Speaking Like a Pirate

https://lemmy.world/post/281237

[Meta] Don’t Provide Reddit With AI Training Data … by Speaking Like a Pirate - Lemmy.world

(Feel free to remove this as off-topic, but this relates to the post about the r/Piracy poll [https://lemmy.dbzer0.com/post/56564] regarding what content will be permitted upon reopening. The body of this post wouldn’t get the same reach as a comment on that post.) Ahoy hearties! Here what I be thinkin’. Reddit be chargin’ tens of millions of doubloons for third-mates to access the API, aye? They be claimin’ to deserve a share of the booty for providin’ trainin’ data for AI (and obviously to kill competition with third-mate apps to boot). Methinks if yee MUST chatter with those landlubbers (such as for the purpose of recruitin’ new mates or cussing out mutinous scabs), then yee ought to make any text data yee provide unappealing and unusable to potential AI-training-customers. Paintings of (Sexy) Captain John Oliver will only sully the attention of the human users. But (pirate) coded language mayhaps be an obstruction for bots? For those who find pirate speak to be too much effort, an alternative be to speak “sdrawkcaB [https://qwerty.dev/backwards-text-generator/]”. I can no longer cast my bottled messages to Reddit’s shore, so any of you seadogs are free to pass it along.

Maybe also a couple of key phrases?

That one thing has certainly 'worked', FWIW as of now.

Be creative! What if AI got trained to always answer with subtle innuendo... the thought makes me all shivery.

OpenAI steals data from the German public broadcaster for their product Whipser - DATATERM

Whisper, OpenAI’s enigmatic speech recognition model, likely honed its skills on ZDF’s subtitled videos, a source of knowledge. Within its code, the cryptic message ‘Untertitel im Auftrag des ZDF, 2017’ reveals its origins. An echo of silence becomes the very essence of copyright notices.