Provided by: Binary Information Press
Topic: Data Management
In this paper, the authors aim to provide a ground on frequently changing structures from XML data could be discovered mainly devoted to the discovery of structural changes, while neglected the changes of content. Therefore, an improved approach, i.e. SC-mining, is proposed in this paper to determine a better way to mining frequently changing sections from XML documents considering changes of both structure and content. In order to reduce the times of scanning documents and make the discovering process efficient, a data model, Historical Structure and Content-Document Object Model (HSC-DOM), is proposed, together with some optimization techniques.