Optimizing Probabilistic Query Processing on Continuous Uncertain Data
Uncertain data management is becoming increasingly important in many applications, in particular, in scientific databases and data stream systems. Uncertain data in these new environments is naturally modeled by continuous random variables. An important class of queries uses complex selection and joins predicates and requires query answers to be returned if their existence probabilities pass a threshold. In this paper, the authors optimize threshold query processing for continuous uncertain data by; expediting joins using new indexes on uncertain data, expediting selections by reducing dimensionality of integration and using faster filters, and optimizing a query plan using a dynamic, per-tuple based approach.