Machine Learning Based Annotating Search Results from Web Databases
Deep web is a database based, i.e., for many search engines, data encoded in the returned result pages come from the underlying structured databases. Such type of search engines is often referred as Web DataBases (WDB). A typical result page returned from a WDB has multiple Search Result Records (SRRs). Unfortunately, the semantic labels of data units are often not provided in result pages. Having semantic labels for data units is not only important for the above record linkage task, but also for storing collected SRRs into a database table.