Lexical Feature Based Phishing URL Detection Using Online Learning
Phishing is a form of cybercrime where spammed emails and fraudulent websites entice victims to provide sensitive information to the phishes. The acquired sensitive information is subsequently used to steal identities or gain access to money. This paper explores the possibility of utilizing confidence weighted classification combined with content based phishing URL detection to produce a dynamic and extensible system for detection of present and emerging types of phishing domains. The authors' system is capable of detecting emerging threats as they appear and subsequently can provide increased protection against zero hour threats unlike traditional blacklisting techniques which function reactively.