A Novel Class-Based Data Fusion Technique for Information Retrieval

Data fusion in information retrieval combines the results from multiple retrieval models or document representations. The achievement of data fusion technique is dependent on the quality of the inputs; classical data fusion techniques fail to improve the retrieval if the quality of the retrieval results varies from low to high quality. In order to tackle this problem, in this paper the authors address the issue of high variation among the retrieval strategies or document representations which affect the combination of their outputs. Their investigation on the MALACH speech collection - in which different segment representations are available - shows that neither the classical data fusion (CombSUM) nor the weighted version (WCombSum) improve the retrieval.