Towards Data Quality and Data Mining Using Constraints in XML
Source: University of South Australia
Quality data is necessary for different data mining techniques and reversely, data mining techniques can be utilized to measure quality of data. Data mining and data quality issues got much attention for relational data in past. But, as a massive amount of data is being stored and represented over the web in XML, the issue of data quality for mining purposes and also using data mining techniques for quality measures get research interest. The paper proposes two important interrelated issues: how quality XML data is useful for data mining in XML and how data mining in XML is used to measure the quality data for XML.