Feature Selection and Clustering Approcahes to the KNN Text Categorization
Automatic text classification is a discipline at the cross roads of information retrieval machine learning and computational linguistics and consists in the realization of text classifiers. (i.e.) software systems capable of assigning text to one or more categories or classes, from a pre-defined set. This paper will focus on the feature selection for, reducing the dimensionality of the vectors, after that the authors apply one pass clustering for group the related data's and then they apply classification technique like KNN for categorizations the data and finally evaluate the results by using precision, etc.