The AWS Well-Architected [1] suggests to spread the service over multiple AZ's to reduce downtime.
In case of AZ failure, is there any API to determine the status of AZ dynamically or do we rely on own health checks to determine corrective action? Is AWS Health API [2] any useful? Can it be made cloud-agnostic?
[1] https://aws.amazon.com/architecture/well-architected/ [2] https://docs.aws.amazon.com/health/latest/ug/health-api.html