very weird thing happening with Grok on Twitter/X lol
Elon Musk's AI chatbot can't stop talking about South Africa and is replying to completely unrelated tweets on there about "white genocide" and "kill the boer"
very weird thing happening with Grok on Twitter/X lol
Elon Musk's AI chatbot can't stop talking about South Africa and is replying to completely unrelated tweets on there about "white genocide" and "kill the boer"
@aris @MattBinder I'd argue that LLMs misbehave consistently under all conditions, but we've decided it's an acceptable level of misbehavior that can be tolerated.
Nope nope nope.
Most likely the system prompt now contains instructions to handle that topic in a special way ('Both sides...'). But the problem is that the topic is in the prompt all the time then, and will crop up without user prompting.
@mhhwhitney @MattBinder Yep, think that's pretty likely. They shoved some special instructions into it and didn't do a good job.
(Edit: Quite apart from the fact that the thing it's saying here is carefully worded to give the illusion of "both sides"-ing this while actually coming down hard on one side.
WTAF?
@MattBinder This is a very good example of what can happens when you overrepresent a subject in the fine-tuning data.
So my two cents, if this is correct, is exactly that.