Implementing health checks

Automatically detect and mitigate server failures without unintended consequences from fleet-wide false positives.