Cooperative Pipelined Regeneration in Distributed Storage Systems
In distributed storage systems, a substantial volume of data is stored in a distributed fashion, across a large number of storage nodes. To maintain data integrity, when existing storage nodes fail, lost data are regenerated at replacement nodes. Regenerating multiple data losses in batches can reduce the consumption of bandwidth. However, existing schemes are only able to achieve lower bandwidth consumption by utilizing a large number of participating nodes. In this paper, the authors propose a cooperative pipelined regeneration process that regenerates multiple data losses cooperatively with much fewer participating nodes. They show that cooperative pipelined regeneration is not only able to maintain optimal data integrity, but also able to further reduce the consumption of bandwidth as well.