Agentic Misalignment: How LLMs could be insider threats \ Anthropic
New research on simulated blackmail, industrial espionage, and other misaligned behaviors in LLMs