Data Management

Optimized Method for Indexing the Hidden Web Data

Free registration required

Executive Summary

Published Methods of Indexing the hidden web database use different indexing techniques like distributed indexing and noble indexing techniques. The goal of this paper is to extract the data from various hidden web databases and this data in integrated form will be stored in large repository with no duplicate records. Here, the authors propose an optimized method for indexing the hidden web database. This research uses Map-Reduce Framework for indexing the Data downloaded by the Siphone++. Basically, the idea behind using this Map-Reduce framework as Indexer is to make strong links between the WebPages using clustering of nodes.

  • Format: PDF
  • Size: 341.43 KB