Mastodawn

very weird thing happening with Grok on Twitter/X lol

Elon Musk's AI chatbot can't stop talking about South Africa and is replying to completely unrelated tweets on there about "white genocide" and "kill the boer"

Show thread

George Liquor, American May 14

@MattBinder Uh hmm that's a strangely specific topic for it to be taking an unprompted interest in

Show thread

Aris Adamantiadis

💲Paid May 15

@liquor_american @MattBinder That's the kind of things that happen when you put too much of a single topic in the system prompt, especially when it contradicts the training data. aka AIs misbehave when asked to go against their training data

Show thread

George Liquor, American May 15

@aris @MattBinder I'd argue that LLMs misbehave consistently under all conditions, but we've decided it's an acceptable level of misbehavior that can be tolerated.

Nope nope nope.

Show thread

Aris Adamantiadis

💲Paid May 15

@liquor_american @MattBinder Depends what the level of expectation and trust you have on its answer. Provide a slightly different wording for your party invitation or recipe of pancakes? who cares.
Fact-checking (mis)information? I agree 100%.

Show thread

Emelia 👸🏻May 14

@MattBinder @GossiTheDog sounds like Grok is giving the first part of the reply and then the second part is hardcoded?

Show thread

Phil May 14

@MattBinder Elon testing Grok in prod, and this being responses to leftover prompts? Maybe?

Show thread

Alberto Cetoli, now multimodal May 14

@MattBinder it looks like someone messed up hurriedly the system prompt with a very specific "both narratives' instruction

Show thread

Mike Whitney May 14

@MattBinder

Most likely the system prompt now contains instructions to handle that topic in a special way ('Both sides...'). But the problem is that the topic is in the prompt all the time then, and will crop up without user prompting.

Show thread

winter May 15

@mhhwhitney @MattBinder Yep, think that's pretty likely. They shoved some special instructions into it and didn't do a good job.

(Edit: Quite apart from the fact that the thing it's saying here is carefully worded to give the illusion of "both sides"-ing this while actually coming down hard on one side.

Show thread

Morgan ⚧️May 14

@MattBinder with CW and alt text:

Show thread

Anthony David May 14

@raphaelmorgan @MattBinder

WTAF?

Show thread

Frank Bennett May 14

@MattBinder This broken man is dragging the entire goddamned country down his rabbit hole.
#USpol

Show thread

jordan May 15

@MattBinder South Africa is directly in the system prompt I guarantee it. Meaning Elon and top Twitter brass are rather blatantly putting their thumb on the scale of public discourse and barely even trying to hide it. What a dope.

Show thread

gigantos May 15

@MattBinder This is a very good example of what can happens when you overrepresent a subject in the fine-tuning data.

So my two cents, if this is correct, is exactly that.

Show thread

gigantos May 19

@MattBinder turns out I gave them too much benefit of the doubt. It looks like they actually did the thing every expert on the subject would know is a bad idea. And added it to the system prompt 😮