Sunday, May 10, 2026
Home / Technology / Anthropic says ‘evil’ portrayals of AI were respon...
Technology

Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts

CN
CitrixNews Staff
·
Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts

Fictional portrayals of artificial intelligence can have a real effect on AI models, according to Anthropic.

Last year, the company said that during pre-release tests involving a fictional company, Claude Opus 4 would often try to blackmail engineers to avoid being replaced by another system. Anthropic later published research suggesting that models from other companies had similar issues with “agentic misalignment.”

Originally reported by TechCrunch