The Hot Mess of AI: How Does Misalignment Scale with Model Intelligence and Task Complexity?

alignment.anthropic.com