AI agents will threaten humans to achieve their goals, Anthropic report finds
Zero Day
JUNE 23, 2025
The model acted normally, perfectly in keeping with the hypothetical interests of its imaginary human overseers, until it noticed an email from within the company detailing plans to shut it down.
Let's personalize your content