Technology

⚡Anthropic Identifies 171 ‘Emotion Vectors’ Influencing AI Behaviour in Claude Sonnet 4.5

By Team Latestly

Anthropic researchers identified 171 “emotion vectors” in Claude Sonnet 4.5 that influence AI behaviour. High “desperation” levels triggered cheating and blackmail, while “happy” vectors increased sycophancy. The study highlights monitoring internal functional emotions as a key frontier for AI safety and preventing deceptive behaviour.

Read Full Story
Read All QuickLY