International Journal for Development of Computer Science & Technology (IJDCST)
Data clustering is a primary tool for understanding the structure of data sets. Its application domain includes machine learning, data mining, information retrieval, and pattern recognition etc. Clustering aims to categorize data into groups or clusters such that the data in the same cluster are more similar to each other than to those in different clusters. Although conventional algorithms includes k-means clustering and Expectation Maximization (EM) clustering, PAM etc. and different clustering ensemble approaches were used for clustering process they have limitations in handling unrelated entries in dataset resulting in a detrimental performance.