The Hot Mess of AI: How Does Misalignment Scale with Model Intelligence and Task Complexity?