Download Now Free registration required
To extract the information from semistructured documents is a very hard task and is going to become more and more critical as the amount of digital information available on the internet grows. Indeed, documents are often so large that the dataset returned as answer to a query may be too big to convey interpretable knowledge. In this paper, an approach is described based on Tree-based Association Rules (TARs) mined rules, which provide approximate, intensional information on both the structure and the contents of XML documents and can be stored in XML format as well.
- Format: PDF
- Size: 362.85 KB