High Dimensional Data Clustering Using Fast Cluster Based Feature Selection

Download Now
Provided by: Creative Commons
Topic: Big Data
Format: PDF
Feature selection involves identifying a subset of the most useful features that produces compatible results as the original entire set of features. A feature selection algorithm may be evaluated from both the efficiency and effectiveness points of view. While the efficiency concerns the time required to find a subset of features, the effectiveness is related to the quality of the subset of features. Based on these criteria, a fast clustering-based feature selection algorithm is proposed and experimentally evaluated in this paper. In the first step, features are divided into clusters by using graph-theoretic clustering methods. In the second step, the most representative feature that is strongly related to target classes is selected from each cluster to form a subset of features.
Download Now

Find By Topic