SAM: A Semantic-Aware Multi-Tiered Source De-duplication Framework for Cloud Backup

Download Now
Provided by: Institute of Electrical & Electronic Engineers
Topic: Cloud
Format: PDF
Existing de-duplication solutions in cloud backup environment either obtain high compression ratios at the cost of heavy de-duplication overheads in terms of increased latency and reduced throughput, or maintain small de-duplication overheads at the cost of low compression ratios causing high data transmission costs, which results in a large backup window. In this paper, the authors present a Semantic-Aware Multi-tiered (SAM) source de-duplication framework that first combines the global file-level de-duplication and local chunk-level deduplication, and further exploits file semantics in each stage in the framework, to obtain an optimal tradeoff between the deduplication efficiency and de-duplication overhead and finally achieve a shorter backup window than existing approaches.
Download Now

Find By Topic