Multi DIM SEQ-Data Mining with a Parallel Approach

Provided by: Creative Commons
Topic: Data Management
Format: PDF
Algorithm PTPSPM (a Parallel algoriThm based on Prefix tree for Sequence Pattern Mining) is proposed in order to deal with the speed limited and effectiveness problem of the sequence pattern mining in massive data. In this paper, a new prefix-tree structure and an improved prefix-span algorithm are introduced to mine the local sequence, the global sequence are obtained by merging all the local sequences. A new prefix tree pruning technique is presented to delete the global k-sequence which cannot be attended. PTPSPM algorithm applies project database identifier index table of dynamic scheduling to avoid the processor idle waiting.

Find By Topic