Provided by: IRPN Press
Topic: Big Data
In this paper, the authors have study the performance of k-Mean data-mining algorithm (k-Mean),which is implemented on the heterogeneous compute cluster with the multi core programming. The multi-core program is implemented with MPI and C for the parallel computing and utilizing the maximum compute power of the heterogeneous cluster. The heterogeneous cluster is established with the help of MPICH2. They have also analyzed the efficiency and performance of k-Mean data mining algorithm for the large dataset. The dataset, which they have used, is chess.txt. The dataset is divided into the number of cores and core compute the dataset independently and makes a data cluster of similar dataset on each processor core.