Rule-based data augmentation for knowledge graph embedding

被引:3
|
作者
Li, Guangyao
Sun, Zequn
Qian, Lei [1 ,2 ]
Guo, Qiang [1 ,2 ]
Hu, Wei [1 ,2 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Peoples R China
[2] State Key Lab Math Engn & Adv Comp, Wuxi, Peoples R China
来源
AI OPEN | 2021年 / 2卷
基金
中国国家自然科学基金;
关键词
Knowledge graph embedding; Data augmentation; Logical rules;
D O I
10.1016/j.aiopen.2021.09.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge graph (KG) embedding models suffer from the incompleteness issue of observed facts. Different from existing solutions that incorporate additional information or employ expressive and complex embedding techniques, we propose to augment KGs by iteratively mining logical rules from the observed facts and then using the rules to generate new relational triples. We incrementally train KG embeddings with the coming of new augmented triples, and leverage the embeddings to validate these new triples. To guarantee the quality of the augmented data, we filter out the noisy triples based on a propagation mechanism during the validation. The mined rules and rule groundings are human -understandable, and can make the augmentation procedure reliable. Our KG augmentation framework is applicable to any KG embedding models with no need to modify their embedding techniques. Our experiments on two popular embedding -based tasks (i.e., entity alignment and link prediction) show that the proposed framework can bring significant improvement to existing KG embedding models on most benchmark datasets.
引用
收藏
页码:186 / 196
页数:11
相关论文
共 50 条
  • [41] CONNECTIONIST AND RULE-BASED REPRESENTATIONS OF EXPERT KNOWLEDGE
    HUNT, E
    BEHAVIOR RESEARCH METHODS INSTRUMENTS & COMPUTERS, 1989, 21 (02): : 88 - 95
  • [42] Outlier Mining in Rule-Based Knowledge Bases
    Nowak-Brzezinska, Agnieszka
    2017 IEEE INTERNATIONAL CONFERENCE ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (INISTA), 2017, : 391 - 396
  • [43] Knowledge and Rule-Based Diacritic Restoration in Serbian
    Krstev, Cvetana
    Stankovic, Ranka
    Vitas, Dusko
    PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE COMPUTATIONAL LINGUISTICS IN BULGARIA (CLIB '18), 2018, : 41 - 51
  • [44] ISSUES IN THE VERIFICATION OF KNOWLEDGE IN RULE-BASED SYSTEMS
    NAZARETH, DL
    INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1989, 30 (03): : 255 - 271
  • [45] From graph transformation to rule-based programming with diagrams
    Hoffmann, B
    APPLICATIONS OF GRAPH TRANSFORMATIONS WITH INDUSTRIAL RELEVANCE, PROCEEDINGS, 2000, 1779 : 165 - 180
  • [46] APPLICATION OF GRAPH-GRAMMARS TO RULE-BASED SYSTEMS
    KORFF, M
    LECTURE NOTES IN COMPUTER SCIENCE, 1991, 532 : 505 - 519
  • [47] A rule-based data warehouse model
    Favre, Cecile
    Bentayeb, Fadila
    Boussaid, Omar
    FLEXIBLE AND EFFICIENT INFORMATION HANDLING, 2006, 4042 : 274 - 277
  • [48] Rule-Based Conditioning of Probabilistic Data
    van Keulen, Maurice
    Kaminski, Benjamin L.
    Matheja, Christoph
    Katoen, Joost-Pieter
    SCALABLE UNCERTAINTY MANAGEMENT (SUM 2018), 2018, 11142 : 290 - 305
  • [49] Graph theory for rule-based modeling of biochemical networks
    Blinov, Michael L.
    Yang, Jin
    Faeder, James R.
    Hlavacek, William S.
    TRANSACTIONS ON COMPUTATIONAL SYSTEMS BIOLOGY VII, 2006, 4230 : 89 - 106
  • [50] A Rule-Based Approach to Embedding Techniques for Text Document Classification
    Aubaid, Asmaa M.
    Mishra, Alok
    APPLIED SCIENCES-BASEL, 2020, 10 (11):