Business Intelligence

Building Query Optimizers for Information Extraction: The SQoUT Project

Free registration required

Executive Summary

Text documents often embed data that is structured in nature. This structured data is increasingly exposed using information extraction systems, which generate structured relations from documents, introducing an opportunity to process expressive, structured queries over text databases. This paper discusses the SQoUT project, which focuses on processing structured queries over relations extracted from text databases. The authors show how, in the extraction-based scenario, query processing can be decomposed into a sequence of basic steps: retrieving relevant text documents, extracting relations from the documents, and joining extracted relations for queries involving multiple relations.

  • Format: PDF
  • Size: 205.5 KB