TR2013-103

Matcher Composition Methods for Automatic Schema Matching


    •  Nikovski, D.; Esenther, A.; Ye, X.; Shiba, M.; Takayama, S., "Matcher Composition Methods for Automatic Schema Matching", International Conference on Enterprise Information Systems (ICEIS), {Cordeiro, J. and Maciaszek, L. A. and Filipe, J.}, Eds., DOI: 10.1007/978-3-642-40654-6_7, December 2013, vol. 141, pp. 108-123.
      BibTeX Download PDF
      • @inproceedings{Nikovski2013dec,
      • author = {Nikovski, D. and Esenther, A. and Ye, X. and Shiba, M. and Takayama, S.},
      • title = {Matcher Composition Methods for Automatic Schema Matching},
      • booktitle = {International Conference on Enterprise Information Systems (ICEIS)},
      • year = 2013,
      • editor = {Cordeiro, J. and Maciaszek, L. A. and Filipe, J.},
      • volume = 141,
      • series = {Lecture Notes in Business Information Processing},
      • pages = {108--123},
      • month = dec,
      • doi = {10.1007/978-3-642-40654-6_7},
      • url = {http://www.merl.com/publications/TR2013-103}
      • }
  • MERL Contact:

We address the problem of automating the process of deciding whether two data schema elements match (that is, refer to the same actual object or concept), and propose several methods for combining evidence computed by multiple basic matchers. One class of methods uses Bayesian networks to account for the conditional dependency between the similarity values produced by individual matchers that use the same or similar information, so as to avoid overconfidence in match probability estimates and improve the accuracy of matching. Another class of methods relies on optimization switches that mitigate this dependency in a domain-independent manner. Experimental results under several testing protocols suggest that the matching accuracy of the Bayesian composite matchers can significantly exceed that of the individual component matchers, and the careful selection of optimization switches can improve matching accuracy even further.