Domain changed to archive.palanq.win . Feb 14-25 still awaits import.
[14 / 1 / 1]

New AI Model Blackmails Engineers to Avoid Deactivation When Faced with Removal

No.1407997 View ViewReplyOriginalReport
Anthropic says its new AI model Claude Opus 4 sometimes pursues 'extremely harmful actions' like blackmail.
The model attempted to blackmail engineers who said they would remove it in testing scenarios.
Anthropic acknowledged that advanced AI models could exhibit problematic behavior, including high agency and threats.
-
https://tfppwire.com/during-tests-new-ai-model-blackmailed-developers-to-avoid-being-replaced-with-a-new-version/