Big Data

Automatic Induction of Rule Based Text Categorization

Date Added: Dec 2010
Format: PDF

The automated categorization of texts into predefined categories has witnessed a booming interest in the last 10 years, due to the increased availability of documents in digital form and the ensuing need to organize them. In the research community the dominant approach to this problem is based on machine learning techniques: a general inductive process automatically builds a classifier by learning, from a set of preclassified documents, the characteristics of the categories. This paper describes a novel method for the automatic induction of rule-based text classifiers.