Distributed Lucene : A Distributed Free Text Index for Hadoop

Source: Hewlett-Packard

Favorite

Free registration required

This paper describes a parallel, distributed free text index written at HP Labs called Distributed Lucene. Distributed Lucene is based on two Apache open source projects, Lucene and Hadoop. It was written to gain a better understanding of the Apache Hadoop architecture, which is derived from work at Google on creating large distributed, high availability systems from commodity components.
Format:PDF Size:172.30
Date:Jun 2008