Date Added: Nov 2009
The standard method for combating spam, either in email or on the web, is to train a classifier on manually labeled instances. As the spammers change their tactics, the performance of such classifiers tends to decrease over time. Gathering and labeling more data to periodically retrain the classifier is expensive. This paper presents a method based on an ensemble of classifiers that can detect when its performance might be degrading and retrain itself, all without manual intervention.