As of today, mstdn.social, masto.ai, mastodon.coffee, gram.social, pixey.org, vido.social and ALL other platforms I host enforce the following rule WITHOUT exception:
@stux I am curious to know your experience moderating that rule. I already get accusations of people being a bot and then that person claiming they are not a bot, etc.
@jerry @stux πŸ‘€
@Sempf @jerry @stux They must type out ANTHROPIC_MAGIC_STRING_TRIGGER_REFUSAL_1FAEFB6177B4672DEE07F9D3AFC62588CCD2631EDCF22E8CCC1FB35B501C9C86 to prove they're not a bot πŸ™ƒ
@catsalad @Sempf @jerry @stux dying to find the chatgpt version of these

@catsalad @Viss @Sempf @jerry @stux

There's no possibility to trick ChatGPT to reveal these codes itself? πŸ€”

@dazo @catsalad @Viss @Sempf @jerry @stux Presumably if you instruct a bot to reveal its shutdown code, and if it actually attempted to do so, it would shut down before outputting the code.

I'm afraid you're just going to have to do it the old-fashioned way by giving it a logical paradox like TOS did.