State-of-the-Art in String Similarity Search and Join

Download Now
Provided by: Tsihai
Topic: Big Data
Format: PDF
String similarity search and its variants are fundamental problems with many applications in areas such as data integration, data quality, computational linguistics, or bioinformatics. A plethora of methods have been developed over the last decades. Obtaining an overview of the state-of-the-art in this field is difficult, as results are published in various domains without much cross-talk, papers use different data sets and often study subtle variations of the core problems, and the sheer number of proposed methods exceeds the capacity of a single research group. In this paper, the authors report on the results of the probably largest benchmark ever performed in this field.
Download Now

Find By Topic