Distributed Lucene : A Distributed Free Text Index for Hadoop
Source: Hewlett-Packard
This paper describes a parallel, distributed free text index written at HP Labs called Distributed Lucene. Distributed Lucene is based on two Apache open source projects, Lucene and Hadoop. It was written to gain a better understanding of the Apache Hadoop architecture, which is derived from work at Google on creating large distributed, high availability systems from commodity components.
| Format: | Size: | 172.30 | |
| Date: | Jun 2008 |
People who downloaded this item also downloaded
- Building a Compute Grid on Apache Hadoop Using Cloud Computing: A Case Study at the University of Pretoria
- Evaluating Storage Technologies for Virtual Server Environments
- Java Garbage Collection Characteristics and Tuning Guidelines for Apache Hadoop TeraSort Workload
- Behind the Cloud - A Closer Look at the Infrastructure of Cloud and Grid Computing
- Performance Evaluation of Hadoop on Virtual Machines



