Bookmark this page

Summary

  • Use readiness and liveness probes to monitor the availability of a VM's application.

  • VMIs have the spec.running and spec.runstrategy settings that control the running state of the VM.

  • The watchdog feature detects and restarts unresponsive operating systems, but it does not detect application failures.

  • Eviction strategies determine the appropriate actions to take with orphaned resources, such as pods, persistent volumes, and VMs.

  • Machine health checks automatically remediate an unhealthy machine, if the machine exists in a particular machine pool.

  • As the standard remediation for unhealthy nodes, OpenShift drains the remaining VMs on the failed node, and reschedules them to healthy nodes so that the failed node can register itself again within the cluster and restore its annotations and labels.

  • Power-based remediation uses a power controller to power off an inoperable node.

  • Use the Self Node Remediation Operator to automatically reboot unhealthy nodes.

  • Deleting a node deletes the node object in Kubernetes, but it does not delete the pods on the node.

Revision: do316-4.14-d8a6b80