International Journal of Advanced Research in Electrical, Electronics and Instrumentation Engineering (IJAREEIE)
Data deduplication describes approach that reduces the storage capacity needed to store data or the data has to be transfer on the network. Cloud storage has received increasing attention from industry as it offers infinite storage resources that are available on demand. Source deduplication is useful in cloud backup that saves network bandwidth and reduces network space deduplication is the process by breaking up an incoming stream into relatively large segments and deduplicating each segment against only a few of the most similar previous segments. To identify similar segments use block index technique.