An Efficient way of Record Linkage System and Deduplication using Indexing techniques, Classification and FEBRL Framework

Provided by: International Journal of Emerging Science and Engineering (IJESE)
Topic: Data Management
Format: PDF
Record linkage is an important process in data integration, which is used in merging, matching and duplicate removal from several databases that refer to the same entities. Deduplication is the process of removing duplicate records in a single database. In recent years, data cleaning and standardization becomes an important process in data mining task. Due to complexity of today's database, finding matching records in single database is a crucial one. Indexing techniques are used to efficiently implement record linkage and deduplication.

Find By Topic