International Journal of Application or Innovation in Engineering & Management (IJAIEM)
In this paper, the authors are using boilerplate codes for classifying the individual text elements in a web page. They enhanced it for date as well as content extraction from a web page by using text analysis process. Similarly classification of web page content is essential to many tasks in web information retrieval such as maintaining web directories, to organize the web information as well as focused crawling. Moreover the algorithm and code has improved for web page categorization.