Improved K-Means Clustering Technique Using Distance Determination Approach
Powerful systems for collecting data and managing it in large databases are in place in all large and mid-range organizations. The value of raw data (collected over a long time) is on the ability to extract high-level information: information useful for decision support, for exploration, and for better understanding of the phenomena generating the data. Traditionally this task of extracting information was done with the help of analysis where one or more analysts with the help of statistical techniques provide summaries and generate reports. Such an approach fails as the volume and dimensionality of the data increase.