International Journal of Emerging Science and Engineering (IJESE)
Record linkage is an important process in data integration, which is used in merging, matching and duplicate removal from several databases that refer to the same entities. Deduplication is the process of removing duplicate records in a single database. In recent years, data cleaning and standardization becomes an important process in data mining task. Due to complexity of today's database, finding matching records in single database is a crucial one. Indexing techniques are used to efficiently implement record linkage and deduplication.