Event Queries on Correlated Probabilistic Streams
A major problem in detecting events in streams of data is that the data can be imprecise (e.g. RFID data). However, current state-of-the-art event detection systems such as Cayuga, SASE or SnoopIB, assume the data is precise. Noise in the data can be captured using techniques such as hidden Markov models. Inference on these models creates streams of probabilistic events which cannot be directly queried by existing systems. To address this challenge the authors propose Lahar, an event processing system for probabilistic event streams. By exploiting the probabilistic nature of the data, Lahar yields a much higher recall and precision than deterministic techniques operating over only the most probable tuples.