Date Added: Nov 2010
Problem determination in a large and dynamic IT service is a challenging task. In this paper, the authors propose a framework for problem determination based on monitoring the event streams generated by the different components of an IT service. They give a generic representation of a problem through spatial-temporal patterns, which is a graph where the vertices capture the location and the time of the matching events, and the edges represent the spatio-temporal conditions between two matching events. The spatial conditions are based on the underlying system topology graph, and the temporal conditions are based on event timestamps. A practical implementation of the above framework will require fast algorithms for detecting patterns.