Association for Computing Machinery
In recent years, spam email has become a major tool for criminals to conduct illegal business on the Internet. Therefore, in this paper, the authors describe a new research approach that uses data mining techniques to study spam emails with the focus on law enforcement forensic analysis. After they retrieve useful attributes from spam emails, they use a connected components clustering algorithm to form relationships between messages. These initial clusters are then refined by using a weighted edges model where membership in the cluster requires the weight to exceed a chosen threshold.