VersaMatch: Ontology Matching with Weak Supervision

被引:2
|
作者
Furst, Jonathan [1 ]
Argerich, Mauricio Fadel [2 ]
Cheng, Bin [3 ]
机构
[1] Zurich Univ Appl Sci, NEC Labs Europe, Zurich, Switzerland
[2] Univ Politecn Madrid, NEC Labs Europe, Madrid, Spain
[3] Springer Nat, NEC Labs Europe, Berlin, Germany
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2023年 / 16卷 / 06期
关键词
D O I
10.14778/3583140.3583148
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Ontology matching is crucial to data integration for across-silo data sharing and has been mainly addressed with heuristic and machine learning (ML) methods. While heuristic methods are often inflexible and hard to extend to new domains, ML methods rely on substantial and hard to obtain amounts of labeled training data. To overcome these limitations, we propose VersaMatch, a flexible, weakly-supervised ontology matching system. VersaMatch employs various weak supervision sources, such as heuristic rules, pattern matching, and external knowledge bases, to produce labels from a large amount of unlabeled data for training a discriminative ML model. For prediction, VersaMatch develops a novel ensemble model combining the weak supervision sources with the discriminative model to support generalization while retaining a high precision. Our ensemble method boosts end model performance by 4 points compared to a traditional weak-supervision baseline. In addition, compared to state-of-the-art ontology matchers, VersaMatch achieves an overall 4-point performance improvement in F1 score across 26 ontology combinations from different domains. For recently released, in-the-wild datasets, VersaMatch beats the next best matchers by 9 points in F1. Furthermore, its core weak-supervision logic can easily be improved by adding more knowledge sources and collecting more unlabeled data for training.
引用
收藏
页码:1305 / 1318
页数:14
相关论文
共 50 条
  • [21] Semantic matching of ontology instances
    Liu, Miao
    Guo, He-Qing
    Su, Jin-Dian
    PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 2959 - +
  • [22] An ontology for software component matching
    Pahl C.
    International Journal on Software Tools for Technology Transfer, 2007, 9 (2) : 169 - 178
  • [23] Interactive biomedical ontology matching
    Xue, Xingsi
    Hang, Zhi
    Tang, Zhengyi
    PLOS ONE, 2019, 14 (04):
  • [24] An ontology for software component matching
    Pahl, C
    FUNDAMENTAL APPROACHES TO SOFTWARE ENGINEERING, PROCEEDINGS, 2003, 2621 : 6 - 21
  • [25] Ontology Matching with Word Embeddings
    Zhang, Yuanzhe
    Wang, Xuepeng
    Lai, Siwei
    He, Shizhu
    Liu, Kang
    Zhao, Jun
    Lv, Xueqiang
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, CCL 2014, 2014, 8801 : 34 - 45
  • [26] A Study on Ontology Structure Matching
    Li, Li-Hua
    Hsu, Rong-Wang
    Hung, Shao-Shin
    Chou, Yu-Chien
    Pu, Tsung-Jen
    NEW ASPECTS OF SYSTEMS THEORY AND SCIENTIFIC COMPUTATION, 2010, : 234 - +
  • [27] ONTOLOGY MATCHING FOR COLLABORATIVE ENGINEERING
    Roche, Christophe
    ECEC/FUBUTEC'2009:16TH EUROPEAN CONCURRENT ENGINEERING CONFERENCE: 6TH FUTURE BUSINESS TECHNOLOGY CONFERENCE, 2009, : 105 - 109
  • [28] Special issue: Ontology matching
    Shvaiko, Pavel
    Euzenat, Jerome
    INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2007, 3 (02) : I - III
  • [29] The AgreementMakerLight Ontology Matching System
    Faria, Daniel
    Pesquita, Catia
    Santos, Emanuel
    Palmonari, Matteo
    Cruz, Isabel F.
    Couto, Francisco M.
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS: OTM 2013 CONFERENCES, 2013, 8185 : 527 - 541
  • [30] An effective ontology matching technique
    Alasoud, Ahmed
    Haarslev, Volker
    Shiri, Nematollaah
    FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2008, 4994 : 585 - 590