An efficient method to improve the clustering performance for high dimensional data by Principal Component Analysis and modified K-means

Clustering analysis is one of the main analytical methods in data mining. K-means is the most popular and partition based clustering algorithm. But it is computationally expensive and the quality of resulting clusters heavily depends on the selection of initial centroid and the dimension of the data. Several methods have been proposed in the literature for improving performance of the k-means clustering algorithm. Principal Component Analysis (PCA) is an important approach to unsupervised dimensionality reduction technique. This paper proposed a method to make the algorithm more effective and efficient by using PCA and modified k-means.

Provided by: Dr.N.G.P. Institute of Technology Topic: Big Data Date Added: Feb 2011 Format: PDF

Find By Topic