Combining Tag and Value Similarity for Data Extraction and Alignment

Provided by: PicoSoft Technologies
Topic: Big Data
Format: PDF
Based on a user's query web databases create query result pages. For many applications, such as data integration, which needs to cooperate with multiple web databases there, is a need to automatically extract the data from these query result pages. So the authors present a data extraction and alignment method called CTVS which combines both tag and value similarity. CTVS automatically extracts data from query result pages by first identifying and segmenting Query Result Records (of QRRs) in the pages of query results, and aligning QRRs segmented into a table, wherein the values of the same attribute data are set in the similar column.

Find By Topic