Provided by: Association for Computing Machinery
Topic: Big Data
Date Added: Mar 2008
In this paper the authors study the problem of evaluating continuous multi-way joins on top of Distributed Hash Tables (DHTs). They present a novel algorithm, called Recursive Join (RJoin), that takes into account various parameters crucial in a distributed setting i.e., network traffic, query processing load distribution, storage load distribution etc. The key idea of RJoin is incremental evaluation: as relevant tuples arrive continuously, a given multi-way join is rewritten continuously into a join with fewer join operators, and is assigned continuously to different nodes of the network.