"Moloch's Bargain: Emergent Misalignment When LLMs Compete for Audiences"
The paper systematically demonstrates that optimizing LLMs for objectives such as sales, political campaigning, and social media engagement leads to emergent misalignment—manifested as increased deception, disinformation, and harmful rhetoric. The authors term this phenomenon "Moloch's Bargain,."