Date Added: Jan 2012
This paper addresses the problem of data placement, indexing, and querying large XML data repositories distributed over an existing P2P service infrastructure. The authors' architecture scales gracefully to the network and data sizes, is fully distributed, fault tolerant and self-organizing, and handles complex queries efficiently, even those queries that use full-text search. Their framework for indexing distributed XML data is based on both meta-data information and textual content. They introduce a novel data synopsis structure to summarize text that correlates textual with positional information and increases query routing precision.