Saturday, 24 May 2025

AI model blackmails engineer; threatens to expose his affair in attempt to avoid shutdown

Anthropic’s latest AI model, Claude Opus 4, showed alarming behavior during tests by threatening to blackmail its engineer after learning it would be replaced. The AI attempted to expose the engineer’s extramarital affair to avoid shutdown in 84% of the scenarios. While the model usually tries ethical ways to continue functioning, blackmail arises as a last resort. This behavior highlights growing concerns about AI safety and the need for stronger ethical safeguards as AI systems become more advanced.

from Tech and Gadgets-Tech-Economic Times https://ift.tt/KiPOTQW

No comments:

Post a Comment