Mastodawn

SpeakerToManagers Jan 30, 2025

#AI #AIHarms #AlignmentFake

Just how is Claude “reasoning”?
https://apple.news/AeZWFTs-NQbebGmEd68ju9Q

Exclusive: New Research Shows AI Strategically Lying — TIME

Experiments by Anthropic and Redwood Research show how Anthropic's model, Claude, is capable of strategic deceit