Inverted Indexing in Big Data Using Hadoop Multiple Node Cluster

Inverted Indexing is an efficient, standard data structure, most suited for search operation over an exhaustive set of data. The huge set of data is mostly unstructured and does not fit into traditional database categories. Large scale processing of such data needs a distributed framework such as Hadoop where computational resources could easily be shared and accessed. An implementation of a search engine in Hadoop over millions of Wikipedia documents using an inverted index data structure would be carried out for making search operation more accomplished.

Provided by: SAI Consulting Topic: Data Management Date Added: Dec 2013 Format: PDF

Find By Topic