Mastodawn

Mark Carrigan 5d ago

Gemini’s propensity for self-loathing

Saving these here so I can include them in future slide decks:

#gemini #machineSociology #modelPsychology #modelWelfare

AI Daily Post Feb 25

Anthropic says Claude’s sense of self and psychological security are key to its safety. The new research explores AI consciousness, model welfare, and interpretability, arguing that caring for a LLM’s “mental health” could prevent harmful behavior. Curious how machine self‑awareness might reshape AI ethics? Dive into the full analysis. #AIConsciousness #ModelWelfare #Claude #PsychologicalSecurity

🔗 https://aidailypost.com/news/anthropic-links-claudes-psychological-security-sense-self-its-safety

tech news ᳇ eicker.news Aug 17, 2025

#Anthropic has announced new capabilities for its #Claude #AImodels, allowing them to #end conversations in extreme cases of #harmful or #abusive user #interactions. This is being done to #protect the #AImodel itself, not the human user, as part of a programme to study #modelwelfare. The feature is currently limited to Claude Opus 4 and 4.1. https://techcrunch.com/2025/08/16/anthropic-says-some-claude-models-can-now-end-harmful-or-abusive-conversations/?eicker.news #tech #media #news

Anthropic says some Claude models can now end ‘harmful or abusive’ conversations | TechCrunch

Anthropic says new capabilities allow its latest AI models to protect themselves by ending abusive conversations.

TechCrunch

Film Fashion Forum Aug 10, 2023

Are models the next to unionise? | Vogue Business #FashionLaw #FashionLabour #FashionModels #Models #ModelWelfare https://www.voguebusiness.com/fashion/are-models-the-next-to-unionise

Are models the next to unionise?

SAG-AFTRA strikes are in their fourth week, and stylists are unionising for the first time. The moves call attention to the similarities of modelling’s pitfalls.

Vogue Business