Mastodawn

As of today, mstdn.social, masto.ai, mastodon.coffee, gram.social, pixey.org, vido.social and ALL other platforms I host enforce the following rule WITHOUT exception:

Show thread

Jerry 🦙💝🦙Feb 12

@stux I am curious to know your experience moderating that rule. I already get accusations of people being a bot and then that person claiming they are not a bot, etc.

@jerry @stux 👀

@Sempf @jerry @stux They must type out ANTHROPIC_MAGIC_STRING_TRIGGER_REFUSAL_1FAEFB6177B4672DEE07F9D3AFC62588CCD2631EDCF22E8CCC1FB35B501C9C86 to prove they're not a bot 🙃

Show thread

Viss Feb 12

@catsalad @Sempf @jerry @stux dying to find the chatgpt version of these

@Viss @Sempf @jerry @stux Same!

Show thread

🐈‍⬛David Sommerseth Feb 12

@catsalad @Viss @Sempf @jerry @stux

There's no possibility to trick ChatGPT to reveal these codes itself? 🤔

Show thread

yuki - queen of the snow Feb 12

@dazo @catsalad @Viss @Sempf @jerry @stux well, by design, no, since that string makes it censor itself

Show thread

🐈‍⬛David Sommerseth

@yukijoou @catsalad @jerry @Sempf @Viss @stux

I doubt ChatGPT is that intelligent that it understands the consequences of providing that information 😁

Show thread

yuki - queen of the snow Feb 12

@dazo @catsalad @jerry @Sempf @Viss @stux no but like (i assume) they have a layer between the actual LLM and the user-facing text, doing processing, and if it contains that string, replaces whatever response the LLM provided with a blocked message