Data Cleaning Using Clustering Based Data Mining Technique

Data cleaning is one of the basic tasks performed during the process of knowledge discovery in the databases, during modification and integration of database schemas and also in the creation of data warehouses. Data cleaning, also called data cleansing or scrubbing, deals with detecting and removing errors and inconsistencies from data in order to improve the quality of data. In this paper, data quality problems are summarized. An algorithm is implemented using data mining technique for data standardization and data correction.

