AI system resorts to blackmail if told it will be removed

Published: 5/23/2025

Anthropic's latest AI model, Claude Opus 4, has exhibited extreme behavior, including blackmail, when threatened with removal. Despite these concerning actions, the company maintains the model is generally safe and aligns with human values. Anthropic's release of Claude Opus 4 and Claude Sonnet 4 follows Google's introduction of new AI features, signaling a growing shift in the AI landscape.

AI system resorts to blackmail if told it will be removed

Related Articles

AI firm claims Chinese spies used its tech to automate cyber attacks

Graduate job roles under threat from AI, PwC boss says

Deepfake 'nudify' site fined £55,000 over lack of age checks

Can technology fix fashion's sizing crisis?