Networking

A Framework for Hierarchical Clustering Based Indexing in Search Engines

Free registration required

Executive Summary

Granting efficient and fast accesses to the index is a key issue for performances of Web Search Engines. In order to enhance memory utilization and favor fast query resolution, WSEs use Inverted File (IF) indexes that consist of an array of the posting lists where each posting list is associated with a term and contains the term as well as the identifiers of the documents containing the term. Since the document identifiers are stored in sorted order, they can be stored as the difference between the successive documents so as to reduce the size of the index.

  • Format: PDF
  • Size: 370.85 KB