Download now Free registration required
The longstanding problem of automatic table interpretation still illudes people. Its solution would not only be an aid to table processing applications such as large volume table conversion, but would also be an aid in solving related problems such as information extraction, semantic annotation, and semi-structured data management. In this paper, the authors offer a solution for the common special case in which so-called sibling pages are available. The sibling pages the authors consider are pages on the hidden web, commonly generated from underlying databases. Their system compares them to identify and connect nonvarying components (category labels) and varying components (data values).
- Format: PDF
- Size: 881.2 KB