A XML Keyword Search Algorithm Based on MapReduce

Provided by: AICIT
Topic: Big Data
Format: PDF
Increase of internet subscriber and expansion of XML application fields enable massive XML data processing as future XML research trend. Hadhoop Distribution File System (HDFS) can be deployed on cheap personal PC cluster based on MapReduce framework. It divides document into a series of data block onto each node for parallel computation, which is suitable for computation and processing of massive data. An XML keyword search algorithm adapting for large scale dataset in HDFS was brought out to achieve keyword search in massive amount XML document, including XML data partitioning, encoding, indexing and searching for Smallest Lowest Common Ancestor (SLCA).

Find By Topic