Download now Free registration required
High ranking of a Web site in search engines can be directly correlated to high revenues. This amplifies the phenomenon of Web spamming which can be defined as preparing or manipulating any features of Web documents or hosts to mislead search engines' ranking algorithms to gain an undeservedly high position in search results. Web spam remarkably deteriorates the information quality available on the Web and thus affects the whole Web community including search engines. The struggle between search engines and spammers is ongoing: both sides apply increasingly sophisticated techniques and counter-techniques against each other. This paper first presents a general background concerning the Web spam phenomenon. They then explain why the machine learning approach is so attractive for Web spam combating.
- Format: PDF
- Size: 722.83 KB