International Journal of Computer Science and Information Technology & Security (IJCSITS)
Text classification also known as text categorization is the task of automatically allocating unlabeled documents into predefined categories. Text classification means allocating a document to one or more categories or classes. The ability to accurately perform a classification task depends on the representations of documents to be classified. Text representations transform the textural documents into a compact format. Text classification plays an important role in information mining, summarization, text recovery and question-answering. It uses several tools from Information Retrieval (IR) and machine learning.