Agentic Misalignment: How LLMs could be insider threats \ Anthropic

New research on simulated blackmail, industrial espionage, and other misaligned behaviors in LLMs

www.anthropic.com