In-Network Redundancy Generation for Opportunistic Speedup of Data Backup
Erasure coding is a storage-efficient alternative to replication for achieving reliable data backup in distributed storage systems. During the storage process, traditional erasure codes require a unique source node to create and upload all the redundant data to the different storage nodes. However, such a source node may have limited communication and computation capabilities, which constrain the storage process throughput. Moreover, the source node and the different storage nodes might not be able to send and receive data simultaneously - e.g., nodes might be busy in a datacenter setting, or simply be offline in a peer-to-peer setting - which can further threaten the efficacy of the overall storage process.