Association for Computing Machinery
Hardware failures in current data centers are common partly due to the higher data scales supported. Data replication is the common approach for improving availability. However, mostly static replication approaches have been proposed, i.e. the number of replicas and their locations are fixed. More-over, the geographical diversity of data locations has not explicitly been considered. In this paper, the authors propose a cost-efficient replication scheme across data centers that dynamically adapt the number of replicas employed per partition to the query load, while maintaining availability guarantees in case of failures.