Binary Information Press
Web log mining is one of the domains of sequential pattern mining; it is used to find the access patterns of web logs. Web Access Pattern tree (WAP-tree) mining is a sequential pattern mining technique for web log access sequences. In this paper, the array-based and effectively use of the prefix tree technique sequential pattern mining algorithm, WAP-mining is presented. It is based on a novel data structure named W-matrix to store the sequence number, which could greatly reduce the needs to traverse WAP-trees and the number or length of conditional sequence bases.