A New Algorithm to Find Best Splitting Criteria for Web Page Classification

Download Now Date Added: Nov 2011
Format: PDF

Web is popular as the availability of data is vast. It contains several billions of HTML documents, pictures and other multi media files. These documents reside on Internet servers and the information-exchange can be done through HTTP protocols. Individual HTML files having unique address is called Web page and collection of such web pages having same electronic address is called as Web site. The quality of the web page is decided by its title. It is the name of a web page or web site. So giving an appropriate and good title to a web page is very important.