@randomwalker As an uninformed AI plebe I got kind of stuck on the sentence "*reinforcement learning [..] affect only the model’s output, not its implicit biases*". That... really sounds like a sentence that also applies to humans? We measure humans by their output, so, should we do the same for AI?
**Philosophically speaking, if a biased AI generates unbiased output, is it really biased?**
@randomwalker @sayashk This is worth investigating with Anthropic’s Claude and all the new open source LLMs (LlaMa, Dolly, Hugging Face’s thing etc) that are blooming as well.
Perhaps a weekend project if I have the time!
I tried to create a login on chatgpt with 3 different emails, and it was too stupid to recognize me.... 😂🤣😂🤣
@randomwalker @sayashk oh great. OF COURSE AI bots are sexist. Bloody patriarchy
@randomwalker @sayashk
My question is "yes, and?"
So, two things, Dr. Narayanan:
1) How does this bias compare to a control set of human writings vis-a-vis markedness in Jakobson's sense?
2) How effective is proscriptive prompting in mitigating the unsurprising midline of 3x bias?