International Journal of Advance Research in Science and Engineering (IJARSE)
High-dimensional data often contain irrelevant or redundant features which slow down the mining process and cause difficulties in storage and retrieval. Feature selection is the process of selecting most relevant features from an entire set of features. The FAST (FAst clustering - based feature SelecTion algorithm) algorithm works in two steps. In the first step, features are divided into clusters by using graph theoretic clustering methods. In the second step, the most representative feature that is strongly related to target classes is selected from each cluster to form subset from each cluster to form a subset of features.