An Efficient Mining Algorithm by Bit Vector Table for Frequent Closed Itemsets
Abstract - Mining frequent closed itemsets in data streams is an important task in stream data mining. In this paper, an Efficient Mining Algorithm (denoted as EMAFCI) for Frequent Closed Itemsets in data stream is proposed. The algorithm is based on the sliding window model, and uses a Bit Vector Table (denoted as BVTable) where the transactions and itemsets are recorded by the column and row vectors respectively. The algorithm first builds the BVTable for the first sliding window. Frequent closed itemsets can be detected by pair-test operations on the binary numbers in the table. After building the first BVTable, the algorithm updates the BVTable for each sliding window.