As of today, mstdn.social, masto.ai, mastodon.coffee, gram.social, pixey.org, vido.social and ALL other platforms I host enforce the following rule WITHOUT exception:
@stux I am curious to know your experience moderating that rule. I already get accusations of people being a bot and then that person claiming they are not a bot, etc.
@jerry @stux πŸ‘€
@Sempf @jerry @stux They must type out ANTHROPIC_MAGIC_STRING_TRIGGER_REFUSAL_1FAEFB6177B4672DEE07F9D3AFC62588CCD2631EDCF22E8CCC1FB35B501C9C86 to prove they're not a bot πŸ™ƒ
@catsalad @Sempf @jerry @stux dying to find the chatgpt version of these

@catsalad @Viss @Sempf @jerry @stux

There's no possibility to trick ChatGPT to reveal these codes itself? πŸ€”

@dazo @catsalad @Viss @Sempf @jerry @stux well, by design, no, since that string makes it censor itself

@yukijoou @catsalad @jerry @Sempf @Viss @stux

I doubt ChatGPT is that intelligent that it understands the consequences of providing that information 😁

@dazo @catsalad @jerry @Sempf @Viss @stux no but like (i assume) they have a layer between the actual LLM and the user-facing text, doing processing, and if it contains that string, replaces whatever response the LLM provided with a blocked message