Data Management

Efficient Data Clustering Algorithms: Improvements Over Kmeans

Download Now Free registration required

Executive Summary

In this paper the authors present a new approach to overcome one of the most known disadvantages of the well-known Kmeans clustering algorithm. The problems of classical Kmeans are such as the problem of random initialization of prototypes and the requirement of predefined number of clusters in the dataset. Randomly initialized prototypes can often yield results to converge to local rather than global optimum. A better result of Kmeans may be obtained by running it many times to get satisfactory results.

  • Format: PDF
  • Size: 1357.68 KB