A Generic Statistical Machine Learning and Data Mining Framework for Record Classification and Linkage

Provided by: AICIT
Topic: Big Data
Format: PDF
Quality of data residing in a database gets degraded and leads to misinterpretation due to a multitude of factors. Such factors vary from poor database design, lack of standards for recording to typing mistakes, leading to redundant information, for example. In such situations, it is important to identify duplicates and merge them into a single entity. This process is known as record linkage. Although numerous attempts are being made to address the issue of duplicate identification and record linkage, many inherent drawbacks are found in those approaches.

Find By Topic