An Effective Data Preprocessing Technique for Improved Data Management in a Distributed Environment

Provided by: International Journal of Computer Applications
Topic: Big Data
Format: PDF
With the evolution of distributed computing, the databases are inherently distributed across the globe and therefore data analysis from various data sources is very essential in decision making. The core need in the current industrial environment is hence to extract information from the huge, complex and dynamic data through data mining techniques. Integrating data from multiple data sources and analyzing the large, complex dynamic data is a tedious and complex work. Additionally, database consists of inconsistent and noisy data. Further, with the decrease in quality of data to be mined the quality of knowledge model obtained from it also decrease which in-turn affects the decision making process.

Find By Topic