International Journal of Engineering and Innovative Technology (IJEIT)
The increasing amount of XML datasets available to users increases the necessity of investigating techniques to extract knowledge from these data. Data mining is widely applied in the database research area in order to extract frequent correlations of values from both structured and semi-structured datasets. In this paper, the authors describe an approach to mine Tree-based Association Rules (TAR) from XML documents. Such rules provide information on both the structure and the content of XML documents; moreover, they can be stored in XML format to be queried later on.