Root Cause Analysis

Root cause analysis is the last and most complex step of event management. It consists of analyzing dependencies between events, based, for instance, on a model of the environment and dependency graphs, to detect whether some events can be explained by others. For example, if database D runs on server S and this server gets durably overloaded (CPU used at 100% for a long time), the event “the SLA for database D is no longer fulfilled” can be explained by the event “Server S is durably overloaded”.

Iotellect's approach for a root cause analysis is implemented by setting up advanced Alerts. An alert, that is an event by itself, is always caused by a certain event (if using Event Triggers) or state (if using Variable Triggers). This cause is associated with the alert event and persistently stored, allowing for the root cause of any problem to be found during real-time event monitoring or when analyzing event history.

Was this page helpful?