GHOST: GPGPU-Offloaded High Performance Storage I/O Deduplication for Primary Storage System
Data de-duplication has been an effective way to eliminate redundant data mainly for backup storage systems. Since the recent primary storage systems in cloud services are expected to have the redundancy, the de-duplication technique can also bring significant cost saving for the primary storage. However, the primary storage system requires high performance requirement about several GBs per second. Most conventional de-duplication techniques targeted the performance requirement of 200-300MB/s. In an attempt to achieve a high performance storage de-duplication system at the primary storage, the authors thoroughly analyze the performance bottleneck of previous de-duplication systems to enhance the system to meet the requirement of the primary storage.