Emotion concepts and their function in a large language model

Interpretability research from Anthropic on emotion concepts