Rank-Aware Crawling of Hidden Web Sites

An ever-increasing amount of valuable information on the Web today is stored inside online databases and is accessible only after the users issue a query through a search interface. Such information is collectively called the "HiddenWeb" and is mostly inaccessible by traditional search engine crawlers that scout the Web following links. Since the only way to access the Hidden Web pages is through the submission of queries to the Hidden Web sites, previous work has focused on how to automatically generate queries in order to incrementally retrieve and cover a Hidden Web site in depth, as much as possible.

Provided by: University of Athens Topic: Software Date Added: Jun 2011 Format: PDF

Find By Topic