Download Now Free registration required
One of the most well-studied problems in data mining is computing association rules from large transactional databases. Often, the rule collections extracted from existing data-mining methods can be far too large to be carefully examined and understood by the data analysts. This paper addresses exactly this issue of overwhelmingly large rule collections by introducing and studying the following problem: Given a large collection R of association rules one wants to pick a subset of them S ? R that best represents the original collection R as well as the dataset from which R was extracted. The paper first quantify the notion of the goodness of a ruleset using two very simple and intuitive definitions.
- Format: PDF
- Size: 279.6 KB