Extraction of Template Using Clustering From Heterogeneous Web Documents

Provided by: International Journal of Computer Applications
Topic: Data Management
Format: PDF
In general, a common template or layout is used to generate set of pages in websites. For example, Google book lays out the details like author name, book names, reviews or comments, etc. in the similar way in all of its book pages. The database provides different values to generate the pages. The problem during automatic database value extraction from different web pages is studied which is done without any human data input. A template is well defined which would propose the framework to be used to describe how the values are inserted into the pages.

Find By Topic