Experiments on Element and Document Statistics for XML Retrieval

Download Now Free registration required

Executive Summary

This paper presents an information retrieval model on XML documents based on tree matching. Queries and documents are represented by extended trees. An extended tree is built starting from the original tree, with additional weighted virtual links between each node and its indirect descendants allowing to directly reach each descendant. Therefore only one level separates between each node and its indirect descendants. This allows to compare the user query and the document with flexibility and with respect to the structural constraints of the query. The content of each node is very important to decide whether a document element is relevant or not, thus the content should be taken into account in the retrieval process.

  • Format: PDF
  • Size: 357.2 KB