Big Data

Determining Cluster Boundaries Using Particle Swarm Optimization

Date Added: Jun 2009
Format: PDF

Self-Organizing Map (SOM) is a well known data reduction technique used in data mining. Data visualization can reveal structure in data sets that is otherwise hard to detect from raw data alone. However, interpretation through visual inspection is prone to errors and can be very tedious. There are several techniques for the automatic detection of clusters of code vectors found by SOMs, but they generally do not take into account the distribution of code vectors; this may lead to unsatisfactory clustering and poor definition of cluster boundaries, particularly where the density of data points is low. In this paper, the authors propose the use of a generic Particle Swarm Optimization (PSO) algorithm for finding cluster boundaries directly from the code vectors obtained from SOMs.