Health Monitoring and TrackingA health model defines what it means for a system to be healthy (operating within normal conditions) or unhealthy (failed or degraded) and the transitions in and out of such states. Good information on a system's health is necessary for the maintenance and diagnosis of running systems. The contents of the health model become the basis for system events and instrumentation on which monitoring and automated recovery is built. To keep an application up and running, the operations team needs to watch the application's health metrics, detect symptoms of a problem, diagnose the cause, and fix the problem before the application performs unacceptably. This is referred to as health monitoring and tracking. To create a health model, the modeler needs to do the following:
|