Data Management

Tuning Schema Matching Systems using Parallel Genetic Algorithms on GPU

Free registration required

Executive Summary

Most recent schema matching systems combine multiple components, each of which employs a particular matching technique with several knobs. The multi-component nature has brought a tuning problem, that is to determine which components to execute and how to adjust the knobs (e.g., thresholds, weights, etc.) of these components for domain users. In this paper, the authors present an approach to automatically tune schema matching systems using genetic algorithms. They match a given schema S against generated matching scenarios, for which the ground truth matches are known, and find a configuration that effectively improves the performance of matching S against real schemas.

  • Format: PDF
  • Size: 489.71 KB