Spam Filtering With Several Novel Bayesian Classifiers
This paper presents spam filtering with three novel bayesian classification methods: Aggregating One-Dependence Estimators (AODE), Hidden Naïve Bayes (HNB), Locally Weighted learning with Naïve Bayes (LWNB). Other four traditional classifiers: Naïve Bayes, k Nearest Neighbor (kNN), Support Vector Machine (SVM), C4.5 are also performed for comparison. Four feature selection methods: Gain Ratio, Information Gain, Symmetrical Uncertainty and ReliefF, are used to select relevant words for spam filtering. Results of experiments on two corpora show the promising capabilities of bayesian classifiers for spam filtering, especial for that of AODE.