Mining Structured Objects (Data Records) Based on Maximum Region Detection by Text Content Comparison From Website

Provided by: The International Journals of Engineering & Sciences (IJENS)
Topic: Data Management
Format: PDF
At present, a great amount of information on the web is presented in regularly structured objects. These are known as data records. A list of such objects in a web page often describes a list of similar items; such as, a list of products to provide their value-added services. Therefore, it has become increasingly necessary to develop an effective process for extracting information from them. In this paper, the authors present a more effective method to perform the task. The proposed method is able to mine data records not only from a single web page but also from an entire web site.

Find By Topic