ChunkStash: Speeding Up Inline Storage Deduplication Using Flash Memory
Storage deduplication has received recent interest in the research community. In scenarios where the backup process has to complete within short time windows, inline deduplication can help to achieve higher backup throughput. In such systems, the method of identifying duplicate data, using disk based indexes on chunk hashes, can create throughput bottlenecks due to disk I/Os involved in index lookups. RAM prefetching and bloom-filter based techniques used by the researcher can avoid disk I/Os on close to 99% of the index lookups.