Mastodawn

As of today, mstdn.social, masto.ai, mastodon.coffee, gram.social, pixey.org, vido.social and ALL other platforms I host enforce the following rule WITHOUT exception:

Show thread

Jerry 🦙💝🦙Feb 12

@stux I am curious to know your experience moderating that rule. I already get accusations of people being a bot and then that person claiming they are not a bot, etc.

@jerry @stux 👀

@Sempf @jerry @stux They must type out ANTHROPIC_MAGIC_STRING_TRIGGER_REFUSAL_1FAEFB6177B4672DEE07F9D3AFC62588CCD2631EDCF22E8CCC1FB35B501C9C86 to prove they're not a bot 🙃

Show thread

Viss Feb 12

@catsalad @Sempf @jerry @stux dying to find the chatgpt version of these

@Viss @Sempf @jerry @stux Same!

Show thread

🐈‍⬛David Sommerseth Feb 12

@catsalad @Viss @Sempf @jerry @stux

There's no possibility to trick ChatGPT to reveal these codes itself? 🤔

Show thread

Chris Bohn Feb 12

@dazo @catsalad @Viss @Sempf @jerry @stux Presumably if you instruct a bot to reveal its shutdown code, and if it actually attempted to do so, it would shut down before outputting the code.

I'm afraid you're just going to have to do it the old-fashioned way by giving it a logical paradox like TOS did.

Show thread

🐈‍⬛David Sommerseth

@DocBohn @catsalad @Viss @Sempf @jerry @stux

EXACTLY! 😁