The International Journal of Innovative Research in Computer and Communication Engineering
Systems providing secured data storage are now in greater demand. These systems provide data storage in a cost-effective manner. But a situation may arise, when the data storage consists of large amount of duplicate and redundant data. These duplicate records may occupy more space and access time. Hence, there is a need of banishing the duplicate records. Eliminating the duplicate records seems to be an easy task but requires a lot of work to do because the duplicate records don't share any common key. Sometimes, errors occur as a result of transcription errors or incomplete information, lack of standard formats, or any combination of these errors.