Data Extraction and Web page Categorization using Text Mining

Provided by: International Journal of Application or Innovation in Engineering & Management (IJAIEM)
Topic: Big Data
Format: PDF
In this paper, the authors are using boilerplate codes for classifying the individual text elements in a web page. They enhanced it for date as well as content extraction from a web page by using text analysis process. Similarly classification of web page content is essential to many tasks in web information retrieval such as maintaining web directories, to organize the web information as well as focused crawling. Moreover the algorithm and code has improved for web page categorization.

Find By Topic