Declarative XML Data Cleaning with XClean

Provided by: Hasso-Plattner-Institut
Topic: Data Management
Format: PDF
Data cleaning is the process of correcting anomalies in a data source that may for instance be due to typographical errors, or duplicate representations of an entity. It is a crucial task in customer relationship management, data mining, and data integration. With the growing amount of XML data, approaches to effectively and efficiently clean XML are needed, an issue not addressed by existing data cleaning systems that mostly specialize on relational data. The authors present XClean, a data cleaning framework specifically geared towards cleaning XML data.

Find By Topic