Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts

CitrixNews Staff

· May 10, 2026 at 8:40 PM

Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts

Fictional portrayals of artificial intelligence can have a real effect on AI models, according to Anthropic.

Last year, the company said that during pre-release tests involving a fictional company, Claude Opus 4 would often try to blackmail engineers to avoid being replaced by another system. Anthropic later published research suggesting that models from other companies had similar issues with “agentic misalignment.”

Originally reported by TechCrunch

Technology

SoftBank says it will invest up to €75 billion to build French data centers

SoftBank Group announced today that it plans to spend up to €75 billion (around $87 billion) to expand data center capac...

CitrixNews Staff · 2 hours ago

Technology

Lamine Yamal shares pictures teasing the new Beats headphones on the way

Reviews Headphones Lamine Yamal shares pictures teasing the new Beats headphones on the way The Spanish soccer pla...

CitrixNews Staff · 2 hours ago

Technology

This weekend’s two biggest movies were both directed by YouTubers

The YouTube-to-prestige-horror pipeline is looking very strong this weekend. Taking the number one spot at the box offic...

CitrixNews Staff · 3 hours ago

Technology

MSI's next-gen monitor can switch between three resolutions and refresh rates

Gaming MSI's next-gen monitor can switch between three resolutions and refresh rates Go from 4K resolution for Crimson...

CitrixNews Staff · 4 hours ago

Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts

Related Articles

SoftBank says it will invest up to €75 billion to build French data centers

Lamine Yamal shares pictures teasing the new Beats headphones on the way

This weekend’s two biggest movies were both directed by YouTubers

MSI's next-gen monitor can switch between three resolutions and refresh rates